The Data Analyst II oversees activities related to data integrity, security and enhancement of the value of data. The Data Analyst II may direct the movement of data across multiple systems; oversee its validation and organization and make sure that data is available to appropriate people and systems within an organization. The Data Analyst II will also develop and execute reports that will be important assets of the practice as they continue to develop and evaluate data tools, repositories, and vocabularies. The Data Analyst II will support the Kronk Lab within the Institute for Health Equity Research (IHER) and the Queer Data Consortium (QDC), and is expected to collaborate within the consortium, the broader institute, and the Department of Population Health Science & Policy, supporting both funded and unfunded research projects, many of which relate to the development and maintenance of a LGBTQIA+ research database, lgbtDB, and bibliometric analyses related to that database. The Data Analyst II will also participate in projects relating to LGBTQIA+ representation in electronic health records (EHRs) and clinical datasets, as well an in broader linguistic analyses and initiatives related to health equity. In addition to data linking, mapping, analytics, and presentation, especially in relationship to established data and ethics standards and vocabularies, you will also be expected to assist in abstract, manuscript, and grant writing as needed. Duties may be modified from time to time depending on research direction and need, but will focus greatly on the development and maintenance of application programming interfaces (APIs) and broader data pipelining projects, as well as establishing and maintaining communications with other research and data centers, institutions, libraires, and archives. Day-to-day tasks will involve programming using Python, R, SQL, and SPARQL, and using ontologies and other knowledge organization systems (often represented using OWL and RDF) to parse through and organize broader swathes of data and datasets as part of ETL processes used to maintain larger research projects and make LGBTQIA+ more accessible, usable, searchable, and equitable. Such data often includes research abstracts and manuscripts, archival and library collections, sociopolitical and legal data, and population-related and spatiotemporal datasets. The Data Analyst II will be expected to write and test code, and make code available to other users and research groups, as well as to run programs that parse through and clean data in semi-automated and automated manners.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees