Platform Engineer - Reliability & Scale

LangChainSan Francisco, CA
3d$175,000 - $215,000

About The Position

Join our platform engineering team as we scale LangSmith and LangGraph Platform products. You'll architect and operate the critical systems that power our customers' AI observability and LangGraph app deployments, working directly with cutting-edge technologies at the intersection of AI and distributed systems.

Requirements

  • Experience: 5+ years building and operating production systems at scale
  • Database expertise: Production experience with OSS datastores (PostgreSQL, Redis)
  • Infrastructure expertise: Deep knowledge of Cloud Object Storage, Kubernetes, containerized infrastructure, cloud platforms (e.g. GCP)
  • Observability mastery: Hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry or similar)
  • Programming proficiency: Strong hands-on software engineering skills (Python, Go, Rust)
  • Operational mindset: "You build it, you run it, you own it" philosophy with the focus on sustainable practices

Nice To Haves

  • Knowledge of columnar file and memory formats
  • Proficiency with analytical databases
  • Background in high-growth startups
  • Previous experience in AI infrastructure

Responsibilities

  • Scale critical systems: Design and implement high throughput data-intensive systems supporting our flagship SaaS products (LangSmith and LangGraph Platform)
  • Drive reliability: Build monitoring, alerting, and automated recovery systems that maintain high uptime
  • Solve complex problems: Debug performance bottlenecks, optimize database queries, and architect solutions for distributed system challenges
  • Shape platform strategy: Influence technical decisions around infrastructure, tooling, and operational practices as we grow from startup to enterprise scale
  • Respond to incidents: Participate in on-call rotation with focus on post-incident learning, automation and prevention

Benefits

  • We offer competitive compensation that includes base salary, meaningful equity, and benefits such as health and dental coverage, flexible vacation, a 401(k) plan, and life insurance.
  • For team members in the EU and UK, we provide locally competitive benefits aligned with regional norms and regulations.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

101-250 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service