Research Assistant (Department of Health Policy & Management)

Johns Hopkins UniversityBaltimore, MD
Remote

About The Position

We are seeking a motivated and detail-oriented Research Assistant to assist with a research project focused on data extraction. The successful candidate will be responsible for tasks involving HTML parsing and extracting structured data from TXT, html and PDF files using Python. This position offers the opportunity to apply technical skills in real-world research applications. The ultimate goal of the project is to create a high-quality dataset intended for public consumption. Extensive documentation and collaboration with other colleagues conducting quality control checks will be integral parts of the process. The Research Assistant oversees data collection, data organization, and/or data management or similar functions/tasks for research study(ies) in support of a PI or a research team.

Requirements

  • Bachelor's Degree in a related field.
  • Additional education may substitute for required experience and additional related experience may substitute for required education beyond a high school diploma/graduation equivalent, to the extent permitted by the JHU equivalency formula.

Nice To Haves

  • Proficiency in Python programming, including libraries such as BeautifulSoup for HTML processing, unit testing frameworks and object oriented design.
  • Familiarity with data cleaning, preprocessing, and handling diverse file formats.
  • Strong analytical skills with attention to detail.
  • Ability to work independently and efficiently manage time.

Responsibilities

  • Run routine and ad hoc reports.
  • Use standard tools and computer programs to review data.
  • Assist with data cleaning measures to ensure accuracy of data and preparation of tables.
  • Lead basic activities such as data collection and data entry.
  • May lead specific tasks and develop processes to ensure study activities occur effectively and efficiently.
  • May conduct literature searches to support faculty in research efforts.
  • May design and format papers/publications.
  • May assist PIs in writing summaries of papers for release as policy briefs or other channels.
  • Parse and extract data from HTML and TXT files to generate structured datasets.
  • Develop and implement Python scripts to automate data extraction.
  • Refine and improve existing code to enhance efficiency and functionality.
  • Clean and preprocess extracted data for further analysis.
  • Write unit tests to ensure quality.
  • Document workflows, scripts, and processes comprehensively to ensure reproducibility and transparency.
  • Collaborate with other team members to ensure data quality through regular checks and reviews.
  • Contribute to project milestones and adhere to deadlines.
  • Other duties as assigned.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service