Technical Historical Records Linguist II - Armenian, Korean or German (Part-Time, SLC, UT-Hybrid)

The Church of Jesus Christ of Latter-day SaintsSalt Lake, UT
3dHybrid

About The Position

We are looking for an experienced detail-oriented individual with demonstrated native-level language expertise in Armenian (Eastern and Classical), Korean (Hanja and Hangul), and/or German (Latin and Gothic) language(s) , deep knowledge of historical genealogical documents, and demonstrated understanding of guidelines and principles for creating training data for machine learning systems to help us build high-quality training data for machine learning systems and review the work of others and mentor them to do likewise. This person’s work will help make historical records available in FamilySearch’s automation and machine learning platform.

Requirements

  • BA/BS Linguistics, Family History, Instructional Design, or other bachelor’s degree with related or equivalent experience required.
  • 2 years relevant or related experience or equivalent experience
  • Native level fluency in at least one of the following: Armenian: experience with Eastern and Classical Armenian Korean: experience with old Hanja characters and Hangul characters German: experience with Gothic and Latin scripts
  • Business level fluency in English
  • Demonstrated paleography skills to accurately decipher historical documents
  • Demonstrated linguistic skills to build NLP datasets
  • Demonstrated understanding of guidelines and best practices for creating different types of machine learning datasets from historical genealogical documents
  • Demonstrated ability to mentor and train others
  • Demonstrated ability to update instructional materials in a manner that is grammatically correct, concise, accurate, and easy to understand
  • Experience working with historical documents
  • Strong technical and analytical aptitude with a passion for data, efficiency, and accuracy
  • Independent worker who is self-motivated, dependable, detail oriented, responsible, self-disciplined, and a team player with a record of timely delivery of requests
  • Willingness to support several projects at one time, and to accept reprioritization as necessary in a fast paced, constantly evolving environment
  • Comfortable handling a high volume of work on a daily basis
  • High proficiency in Microsoft Office tools including: Word, PowerPoint, and Excel
  • Ability to quickly grasp technical concepts

Nice To Haves

  • Experience with additional languages a plus

Responsibilities

  • exercise demonstrated expertise in paleography, linguistics, technology, and historical records to build machine learning training datasets for historical genealogical documents in many languages.
  • review the work of peers and provide mentoring and timely feedback to ensure datasets are created accurately, model the truth of the underlying artifact, and follow project guidelines.
  • Assist in updating instructional materials and training others.
  • Use paleography skills to: Accurately decipher historical documents.
  • Annotate language data with linguistic information to build natural language processing (NLP) datasets.
  • Model the way people naturally read historical documents by creating hierarchies of relationships between areas of text.
  • Precisely map the layout of historical documents.
  • Curate large amounts of data.
  • Review datasets for errors and provide corrections in a timely manner.
  • Perform other data modeling activities and duties as assigned.
  • Enable FamilySearch’s automation efforts by meeting aggressive deadlines and accomplishing work assignments with consistently high output, quality, and accuracy, and helping others to do likewise.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service