Senior NLP Data Engineer

iManageChicago, IL
7dHybrid

About The Position

We offer a flexible working policy that supports a healthy balance between personal and professional well-being. This role requires in-office presence on Tuesdays & Thursdays to collaborate, connect, and learn from peers - while also maintaining the flexibility for meaningful work-life balance. Being a Senior NLP Data Engineer at iManage Means… You’re passionate about transforming unstructured text into meaningful insights that power AI and machine learning solutions. You thrive at the intersection of data engineering, AI and natural language processing, building the pipelines and datasets that fuel generative AI applications, agentic systems, advanced model fine tuning and other NLP-driven capabilities across iManage. As an NLP Data Engineer on the Applied AI team, you will design, build, and optimize large-scale text data pipelines that power AI/ML and Generative AI solutions for our customers. You’ll work with knowledge engineering, applied AI, and product teams to prepare, enrich, and integrate document data. Your work will be essential to enabling intelligent, AI-powered features across the iManage platform.

Requirements

  • A Bachelor’s degree or higher in Computer Science, Data Engineering, Applied Mathematics, Computational Linguistics, or a quantitative related field.
  • 4+ years of data engineering experience, with at least 2 years working with unstructured data in a business setting.
  • Strong proficiency in Python, PySpark, and data manipulation for large unstructured text datasets.
  • Strong understanding of NLP concepts such as tokenization, embeddings, semantic search, and experience with standard text libraries such as SpaCy, HuggingFace Datasets, NLTK.
  • Solid dataOps knowledge and experience orchestrating advanced NLP data pipelines using cloud based data infrastructure
  • Proficiency with Git and collaborative development frameworks
  • A passion for enabling AI capabilities through scalable, reliable data architecture.
  • Problem solving, creativity, curiosity, and a collaborative mindset.

Nice To Haves

  • Exposure to Microsoft Azure Services such as Fabric, ADLS, AI Foundry, Azure ML, MLflow
  • Experience with knowledge graph implementation for NLP applications
  • Experience working with data for the legal domain
  • Experience designing architectures for large-scale text corpora

Responsibilities

  • Designing, developing and maintaining scalable pipelines in MSFT Azure to ingest and transform large volumes of text data from multiple sources
  • Designing automated workflows for text normalization, deduplication, language identification, PII redaction and metadata enrichment
  • Building automated data validation processes to ensure accuracy and consistency
  • Supporting model fine-tuning, semantic search and Gen AI evaluations tuning through dataset curation, prompt dataset preparation, labeling coordination, and text quality validation
  • Partnering with the Applied AI team to gather data requirements and build data interfaces for developing and maintaining machine learning systems
  • Maintaining data lineage and following data privacy, security and governance best practices
  • Implementing data versioning and lineage tracking for machine learning experiments

Benefits

  • Join a supportive, experienced team with an inclusive, encouraging, and vibrant culture.
  • Have flexible work hours that allow me to balance my ‘me time’ with my work commitments.
  • Collaborate in a modern open plan workspace, with a gaming area, free snacks, drinks and regular social events.
  • Focus on impactful work, solving complex, real challenges utilizing the latest technologies and protocols.
  • Own my career path with our internal development framework. Ask us more about this!
  • Learn new skills and earn certifications with access to unlimited courses in LinkedIn Learning.
  • Join an innovative, industry leading SaaS company that is continuing to grow & scale!
  • Creating an inclusive environment where I can help shape the culture not just by fitting in, but by adding to it.
  • Providing a market competitive salary that is applied through a consistent process, equitable for all our employees, and regularly reviewed based on industry data.
  • Rewarding me with an annual performance-based bonus.
  • Offering comprehensive Health/Vision/Dental/Life Insurance, and a 401k Retirement Savings Plan with a company match up to 4%.
  • Giving access to HealthJoy, a healthcare concierge service, to help me maximize my health benefits.
  • Granting enhanced leave for expecting parents; 20 weeks 100% paid for primary leave, and 10 weeks 100% paid for secondary leave.
  • Providing me with a flexible time off policy to take the time off that I need. Be it for vacation, volunteering, celebrating holidays, spending time with family, or simply taking time to recharge and reset.
  • Caring for my mental health and well-being with multiple company wellness days and free access to the Healthy Minds app for mindfulness, meditation and more.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service