Collibra-posted about 2 months ago
$140,000 - $175,000/Yr
Full-time • Mid Level
Hybrid • New York, NY
1,001-5,000 employees

Joining Collibra’s Unstructured AI Team In this role, you will: Own end-to-end technical delivery of Unstructured AI deployments — from first prototype to stable production across enterprise environments. Build and scale full-stack systems that process and enrich large volumes of unstructured content (PDFs, contracts, reports, and other document types). Embed closely with customer and field teams to understand their metadata, governance, and security needs - guiding how Unstructured AI integrates into their broader Collibra stack. Scope work, sequence delivery, and remove blockers early to ensure fast iteration cycles between product, research, and deployment teams. This is a hybrid role based in our New York office. Our hybrid model means you’ll work from the office at least two days each week. This setup helps us stay connected, work more closely together, and keep making progress as a team. Forward Deployed Engineers at Collibra are responsible for: Balancing scope, speed, and quality - making clear trade-offs to keep pilots moving and convert them into production rollouts. Codifying repeatable patterns from customer projects into reusable connectors, enrichment modules, or playbooks that accelerate future deployments. Feeding field insights back to Product and Research, identifying opportunities to improve product experience. Keep cross-functional teams aligned through clear communication, prioritization, and follow-through.

  • Own end-to-end technical delivery of Unstructured AI deployments — from first prototype to stable production across enterprise environments.
  • Build and scale full-stack systems that process and enrich large volumes of unstructured content (PDFs, contracts, reports, and other document types).
  • Embed closely with customer and field teams to understand their metadata, governance, and security needs - guiding how Unstructured AI integrates into their broader Collibra stack.
  • Scope work, sequence delivery, and remove blockers early to ensure fast iteration cycles between product, research, and deployment teams.
  • Balancing scope, speed, and quality - making clear trade-offs to keep pilots moving and convert them into production rollouts.
  • Codifying repeatable patterns from customer projects into reusable connectors, enrichment modules, or playbooks that accelerate future deployments.
  • Feeding field insights back to Product and Research, identifying opportunities to improve product experience.
  • Keep cross-functional teams aligned through clear communication, prioritization, and follow-through.
  • Shipped complex systems under ambiguity - balancing speed and precision in real customer environments.
  • Written and reviewed production-grade code across backend (Python, FastAPI).
  • Built or deployed document-processing systems and are comfortable with CI/CD, monitoring, and debugging tools.
  • 2+ years of software engineering or technical deployment experience, ideally involving enterprise integrations, AI data processing, or customer-facing delivery.
  • Strong proficiency in Python (data processing, API development, and integrations).
  • Proven ability to deliver production-grade systems that process large-scale unstructured data (PDFs, text, documents).
  • Solid understanding of data pipelines, microservice architecture, and API design.
  • Experience with cloud infrastructure (AWS, GCP, or Azure), Infrastructure as Code (Terraform) and containerization (Docker / Kubernetes).
  • Experience with LLM-based or AI-driven enrichment models (classification, extraction, deduplication, PII detection).
  • Familiar with metadata systems, data cataloging, or document AI workflows.
  • Background in data governance, sensitive data detection, or enterprise integrations (Collibra, Databricks, Snowflake, etc.).
  • A track record of codifying repeatable deployment patterns into tools, SDKs, or frameworks.
  • Knowledge of security, compliance, and model evaluation best practices.
  • A bachelor’s degree or equivalent work experience is required.
  • Capable of communicating clearly across engineering, product, and field teams, ensuring alignment from prototype to rollout.
  • Experienced in spotting risks early, course-correcting without friction, and model composure when delivery timelines are tight.
  • Someone who cares deeply about data quality, precision, and governance.
  • Willing to gain hands-on experience with modern frontend development.
  • Able to translate customer requirements into technical plans and deliver end-to-end.
  • Strong communication and stakeholder-management skills across technical and business teams.
  • Calm, structured decision-making under tight timelines or ambiguity.
  • equity ownership at every level
  • bonus potential
  • a Flex Fund monthly stipend
  • pension/401k plans
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service