AI Data Infrastructure Engineer

Abaka AIPalo Alto, CA
2d$110,000 - $160,000

About The Position

We’re hiring an AI Data Infrastructure Engineer to build systems that power how large-scale datasets for LLM and multimodal models are discovered, evaluated, and scaled. This is a builder-first engineering role focused on designing LLM-powered agents, automation systems, and data pipelines. You’ll work on problems like: Automatically discovering new data sources across the internet Using LLMs and agents to evaluate and filter data sources at scale Building systems that significantly increase data throughput without increasing headcount This role sits at the intersection of data engineering, LLM systems, and applied AI infrastructure, and is ideal for someone who enjoys building from scratch and shipping fast.

Requirements

  • Strong technical foundation (engineering, scripting, systems, or data-focused background)
  • Experience building tools, automation, or pipelines from 0→1
  • Comfortable with Python, APIs, scraping, or backend workflows
  • Interest in LLMs, agents, or applied AI systems
  • Strong problem-solving ability and a builder mindset
  • Ability to operate independently in fast-paced, ambiguous environments

Nice To Haves

  • Experience with LLM frameworks or agent systems
  • Experience with large-scale data processing or distributed systems
  • Familiarity with automation tools, workflow builders, or AI-assisted development (e.g., Cursor)
  • Startup or high-growth environment experience

Responsibilities

  • Build LLM-powered agents and automation systems for data discovery and evaluation
  • Design and implement data pipelines for ingesting, filtering, and transforming large-scale datasets
  • Develop internal tools for data quality scoring, ranking, and selection
  • Experiment with scraping, APIs, and programmatic data collection at scale
  • Rapidly prototype and iterate on systems that improve data acquisition speed and quality
  • Collaborate closely with Data Engineering and Research teams to align data systems with model needs
  • Build scalable systems that increase data throughput and efficiency

Benefits

  • equity
  • health
  • dental
  • vision
  • PTO
  • flexible work schedule
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service