The Church Of Jesus Christ Of Latter-Day Saints-posted 4 days ago
Part-time • Mid Level
Hybrid • Salt Lake City, UT
501-1,000 employees

We are looking for an experienced detail-oriented individual with native-level language expertise in Armenian, Japanese, and/or Romanian language(s) and deep knowledge of historical genealogical documents to build high-quality training data for machine learning systems. This person's work will help make historical records available in FamilySearch's automation and machine learning platform. This position can be done 100% remotely within the United States. If residing along the Wasatch Front, the expectation is to work at least 1 assigned day in office. As a Machine Learning Technical Historical Records Linguist II in the Records Product Group at FamilySearch you will be exercising your expertise in paleography, linguistics, technology, and historical records to build machine learning datasets from historical genealogical documents in many languages. You will enable FamilySearch's automation efforts by meeting aggressive deadlines and accomplishing work assignments with consistently high output, quality, and accuracy.

  • accurately decipher historical documents
  • annotate language data with linguistic information to build natural language processing (NLP) datasets
  • model the way people naturally read historical documents by creating hierarchies of relationships between areas of text
  • precisely map the layout of historical documents
  • curate large amounts of data
  • review datasets for errors and provide corrections in a timely manner
  • other data modeling activities and duties as assigned.
  • BA/BS Linguistics, Family History, or other bachelor's degree with related or equivalent experience required.
  • 2-3 years relevant or related experience or equivalent experience
  • Native level fluency in at least one of the following: Armenian, Japanese, Romanian
  • Business level fluency in English
  • Demonstrated paleography skills to accurately decipher historical documents
  • Demonstrated linguistic skills to build NLP datasets
  • Experience working with historical documents
  • Strong technical and analytical aptitude with a passion for data, efficiency, and accuracy
  • Independent worker who is self-motivated, dependable, detail oriented, responsible, self-disciplined, and a team player with a record of timely delivery of requests
  • Willingness to support several projects at one time, and to accept reprioritization as necessary in a fast paced, constantly evolving environment
  • Comfortable handling a high volume of work on a daily basis
  • High proficiency in Microsoft Office tools including: Word, PowerPoint, and Excel
  • Ability to quickly grasp technical concepts
  • Experience with additional languages a plus
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service