Senior AI Engineer, Unstructured AI

CollibraNew York, NY
12h$204,000 - $255,000Hybrid

About The Position

Joining Collibra’s Unstructured AI Team Work at the forefront of context engineering - shaping how AI systems retrieve, structure, and leverage context to deliver accurate, high-quality results at scale. Own end-to-end technical delivery of Unstructured AI systems - from feature prototype to stable production across enterprise environments. Build and scale full-stack systems that ingest, process, and enrich large volumes of unstructured content from distributed enterprise silos (PDFs, contracts, reports, and other document types). Collaborate with the Best: Work closely with xYC Founders to understand complex business challenges and deliver Deasy to solve them. Be part of a dynamic team where ideas flow freely and creativity thrives. Learn and Lead: Stay ahead of the curve by engaging with the latest developments in machine learning and AI. Share knowledge and lead by example to maintain high building standards. This is a hybrid role based in our New York office. Our hybrid model means you’ll work from the office at least two days each week. This setup helps us stay connected, work more closely together, and keep making progress as a team.

Requirements

  • Strong proficiency in Python (data processing, API development, and integrations).
  • Hands-on work with LLM-based and AI-driven enrichment models (e.g., classification, entity extraction, deduplication, PII detection).
  • Proven ability to deliver production-grade systems using Big Data frameworks (e.g., Spark) to handle data at scale.
  • Solid understanding of data pipelines, microservice architecture, and API design.
  • Experience ingesting and processing data from third-party enterprise sources (e.g., SharePoint/OneDrive, Salesforce, and SaaS-based knowledge bases).
  • Strong communication skills across technical and business teams.
  • Calm, structured decision-making under tight timelines or ambiguity.
  • Familiarity with metadata systems, data cataloging, or document AI workflows.
  • Knowledge of model evaluation best practices.
  • Experience with search relevance.
  • A bachelor’s degree or equivalent related working experience is required.

Responsibilities

  • Shipping complex systems under ambiguity - balancing speed and precision in real environments.
  • Writing and reviewing production-grade code across backend (Python, FastAPI).
  • Building/deploying document-processing systems that handle large-scale, unstructured data environments.
  • Integrating data from diverse enterprise data sources (e.g., SharePoint, Salesforce, or internal APIs) to provide context for AI features.
  • Partnering across engineering, product, and sales teams, ensuring alignment from prototype to rollout.
  • Occasionally working with modern frontend development.

Benefits

  • In addition to base salary, we offer a competitive total rewards package, including bonus potential, equity for eligible roles, a Flex Fund monthly stipend, pension/401k plans, and more.
  • Collibra recognizes and values that everyone has different needs, interests, and life goals. We built our benefits program with flexibility in mind to support you and your loved ones through a diverse range of circumstances and life events. These flexible offerings sit on a foundation of competitive compensation, health coverage, and time off.
  • Learn more about Collibra’s benefits.
  • We create inclusion and belonging through how we onboard, meet, connect, engage, and communicate.
  • Learn more about diversity, equity, and inclusion at Collibra.
  • At Collibra, we’re proud to be an equal opportunity employer. We realize the key to creating a company with a world-class culture and employee experience comes from who we hire and creating a workplace that celebrates everyone.
  • With this, we proudly consider qualified applicants without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, sexual orientation, pregnancy, sex, gender identity, gender expression, genetic information, physical or mental disability, HIV status, registered domestic partner status, caregiver status, marital status, veteran or military status, citizenship status or any other legally protected category.
  • If you have a need that requires accommodation, let us know by completing our Accommodations for Applicants form.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service