Meta-posted about 1 year ago
Full-time • Intern
Redmond, WA
Web Search Portals, Libraries, Archives, and Other Information Services

As a Linguistic Engineer Intern at Meta, you will contribute to the development of language and multimodal data systems for the Multimodal Assistant. This role involves collaborating with cross-functional teams to enhance product language needs and improve data-based models. You will leverage your analytical skills and linguistic expertise to design and develop language data infrastructure, ensuring consistent features across various languages and modalities. The internship lasts between twelve to sixteen weeks.

  • Provide linguistic expertise in syntax, semantics, pragmatics, dialog, ontology, and NLP.
  • Build datasets, pipelines, and models for ML applications.
  • Clearly communicate expertise with project stakeholders.
  • Identify best practices and improve procedures across NLP systems.
  • Identify linguistic needs and gaps within project ontologies and NLP systems.
  • Anticipate language-based problems before they occur.
  • Drive projects from conceptualization through launch and beyond with continual improvement and support.
  • Design and conduct data-driven experiments.
  • Currently has, or is in the process of obtaining, a PhD degree in Linguistics, Language Technologies, Computational Linguistics, Speech Science, or a related field.
  • Training in various areas of linguistics, including phonetics, phonology, morphology, syntax, semantics, pragmatics, discourse analysis, sociolinguistics, psycholinguistics, computational linguistics, and field work.
  • Basic familiarity with programming techniques and languages such as Praat, Python, SQL, PHP, Hack, JavaScript, and React.
  • Demonstrated ability to cooperate within smaller projects or teams.
  • Experience contributing to data experiments.
  • Experience with hierarchical structures and ontologies.
  • Experience with text/image/video labeling problems.
  • Experience forming internal team relationships and fostering external relations.
  • Actively pursuing an advanced degree in Linguistics, Language Technologies, Computational Linguistics, Speech Science, or a related field.
  • Experience with larger scripting projects that involve combining language data from different sources and computing complex metrics over large datasets.
  • Strong understanding of the relationship between data and machine learning models to increase linguists' impact on ML projects.
  • Familiarity with core data processing techniques and tooling, including version control, unit tests, and other programming best practices.
  • Fluency in two or more natural languages.
  • Intent to return to degree program after the completion of the internship.
  • Competitive salary with benefits (exact figures not specified).
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service