Apollo.io-posted 2 months ago
$195,000 - $285,000/Yr
Full-time • Manager
501-1,000 employees

As an Engineering Manager for the Infrastructure team, you’ll lead the engineers responsible for keeping Apollo’s systems fast, reliable, and scalable as we serve millions of daily users and process billions of data points. You’ll work at the intersection of platform engineering, SRE, observability, and developer productivity, ensuring that our foundation can support Apollo’s AI-native evolution and rapid growth.

  • Build and lead a world-class infrastructure team focused on reliability, scalability, and performance.
  • Report directly to the CTO, and partner closely with Product, Data, and AI Platform leaders to ensure the underlying systems enable fast, safe, and confident iteration.
  • Drive best-in-class engineering practices for production uptime, performance, CI/CD, observability, incident management, and cost optimization.
  • Foster a culture of excellence, ownership, and continuous improvement — where engineers are empowered to innovate and ship fearlessly.
  • Help define Apollo’s next-generation infrastructure strategy — from cloud architecture to developer experience and AI-driven automation.
  • Lead, coach, and grow a distributed team of high-impact Infrastructure Engineers.
  • Partner with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency.
  • Define and implement modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring.
  • Guide technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI/CD, and IaC (Terraform, Ansible).
  • Collaborate with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads.
  • Run effective 1:1s, career development conversations, and quarterly performance reviews.
  • Support recruiting efforts to attract top engineering talent across time zones.
  • 5+ years of hands-on software or infrastructure engineering experience.
  • 2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains.
  • Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS).
  • Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI/CD pipelines.
  • Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning.
  • Strong grounding in networking, security, and reliability principles.
  • Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale.
  • Experience with AI/ML infrastructure, data pipelines, MongoDB, Ruby on Rails, Ansible, or ElasticSearch.
  • Equity; company bonus or sales commissions/bonuses.
  • 401(k) plan.
  • At least 10 paid holidays per year.
  • Flex PTO and parental leave.
  • Employee assistance program and wellbeing benefits.
  • Global travel coverage.
  • Life/AD&D/STD/LTD insurance.
  • FSA/HSA and medical, dental, and vision benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service