Part-Time Data Extraction Programming

The Pennsylvania State UniversityUniversity Park, FL
7h$17Remote

About The Position

The Population Research Institute is seeking applicants for part-time job of Data Extraction Programming, an upper-level undergraduate computer programmer or other qualified individual with interest in advancing skills in data extraction. Specifically, the candidate will develop Python or Java code that extracts data from printed population registry books that have been digitized + OCR text recognized. Data will be stored in a CSV file; the project already has defined possible data fields. Job duties to include: Desirable skills include coding experience in Python and/or Java (required), creative problem solving, ability to scrutinize own work and catch errors, experience annotating code, and ability to communicate work progress to supervisor with limited programming experience. While source materials generally present information systematically, the code must account for routine deviations and errors in the source materials. Requirements, qualifications, and/or competencies: Candidate will work independently with access to an experienced programmer for consultation when needed. Self-directed learning and resourcefulness are essential. Generative AI may be used strategically in consultation with supervisor but should not substitute for coding skills. Some familiarity with Amish and Mennonites is a plus but not required. Work is primarily remote and flexible. Candidate must be committed to working in a distraction-free environment. Occasional exceptions to remote work include: (1) candidate may be required periodically to digitize printed population registries using equipment at University Park campus, and (2) candidate will occasionally meet in-person with supervisor, even as Zoom, telephone, and email will be the primary modes of communication. Hours must be completed by August 17, 2026. While weekly hours are flexible, the candidate must work no fewer than four hours any given week. The position will be housed in Penn State’s Population Research Institute and supervised by postdoctoral scholar Dr. Cory Anderson . Position open immediately, and applications will be received until the position is filled. Please submit a list of programming-related experience, list of relevant coursework and final grades, and one reference who can speak to your programming experience and potential (e.g. advisor, instructor, or current/former employer).

Requirements

  • Coding experience in Python and/or Java (required)
  • Creative problem solving
  • Ability to scrutinize own work and catch errors
  • Experience annotating code
  • Ability to communicate work progress to supervisor with limited programming experience
  • Candidate will work independently with access to an experienced programmer for consultation when needed.
  • Self-directed learning and resourcefulness are essential.
  • Candidate must be committed to working in a distraction-free environment.
  • Hours must be completed by August 17, 2026.
  • Candidate must work no fewer than four hours any given week.
  • Please submit a list of programming-related experience, list of relevant coursework and final grades, and one reference who can speak to your programming experience and potential (e.g. advisor, instructor, or current/former employer).

Nice To Haves

  • Some familiarity with Amish and Mennonites is a plus but not required.

Responsibilities

  • Develop Python or Java code that extracts data from printed population registry books that have been digitized + OCR text recognized.
  • Store data in a CSV file.
  • Account for routine deviations and errors in the source materials.
  • Communicate work progress to supervisor with limited programming experience.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service