Senior Computational Linguist (USA)

Sigma GroupMadison, MS
10d

About The Position

Sigma AI is a global training data collection, preparation and annotation services company. With 30+ years of experience in the data annotation space, we support companies with the right mix of people, processes and technology to train smarter AI that serves humans better. We're looking a Senior Computational Linguist to collaborate with the Natural Language Processing team on the design and development of AI-based solutions for clients and grant-funded projects. Candidates should feel comfortable working in a young and multidisciplinary team where the environment is continuously changing.

Requirements

  • Master’s degree in computational Linguistics or NLP.
  • Over 5 years' experience in AI companies, working on text classification projects, NER, relationship detection, question-answering, machine translation, language modeling, automatic transcription and/or dialogue systems.
  • Experience in training machine learning models and neural networks.
  • Experience in using and fine-tuning transformer-based language models such as BERT and GPT.
  • Programming with Python.
  • Linux and Bash scripting.

Nice To Haves

  • Knowledge of audio, video and text annotation tools, active learning models and quality metrics.
  • Knowledge of Python NLP libraries: NLTK, Scikit-learn, Spacy, RASA and Transformers.
  • Acoustic analysis tools such as PRAAT.
  • Computer assisted translation (CAT) tools.
  • Descriptive and inferential statistics.
  • Experience in annotation projects.
  • Knowledge of linguistic typology.

Responsibilities

  • To collaborate with the Sigma AI team in the analysis of customer proposals, preparation of offers, carrying out of tests or pilots and training of the team of annotators or reviewers.
  • To configure annotation tools, automatically pre-annotate or generate data if possible, or implement active learning models to accelerate the annotation process, evaluate the quality of projects using quality metrics, and automatically detect errors.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service