DevOps Engineer

BlitzyCambridge, MA
88d$140,000 - $180,000Onsite

About The Position

We're looking for an exceptional DevOps Engineer to architect and maintain the infrastructure that powers our revolutionary AI agent ecosystem. You'll be instrumental in building scalable, resilient systems that support both our cutting-edge AI platform and modern applications. This role offers the unique opportunity to work at the intersection of traditional DevOps and emerging AI infrastructure, creating systems that enable thousands of AI agents to collaborate seamlessly. As our DevOps Engineer, you'll take ownership of our entire infrastructure stack, from Kubernetes orchestration to AI agent deployment pipelines. You'll work directly with our engineering teams to ensure our platform can scale to support enterprise customers while maintaining the performance and reliability they demand.

Requirements

  • 5-8 years of DevOps/Infrastructure experience
  • Expert-level Python proficiency for automation and scripting
  • Deep Kubernetes expertise: deployment, scaling, troubleshooting, and optimization
  • Strong experience with Helm for application package management
  • Proven track record designing and implementing CI/CD pipelines
  • Hands-on experience with major cloud platforms (AWS, Azure, or GCP)
  • Terraform expertise for Infrastructure as Code
  • Strong Linux administration and containerization (Docker) skills
  • Experience with monitoring tools (Prometheus, Grafana, ELK stack)
  • Understanding of microservices architecture and distributed systems

Nice To Haves

  • CKA (Certified Kubernetes Administrator) or CKAD certification
  • Experience with MLOps tools (MLflow, Kubeflow, Ray, etc.)
  • Knowledge of AI/ML infrastructure requirements and optimization
  • Experience with GPU orchestration and management
  • API gateway and service mesh implementation (Istio, Linkerd)
  • GitOps experience (ArgoCD, Flux)
  • Experience scaling infrastructure for high-growth startups
  • Contributions to open-source infrastructure projects
  • Experience with multi-region, highly available deployments
  • Background in security and compliance (SOC2, HIPAA)

Responsibilities

  • You architect and implement robust Kubernetes infrastructure that scales effortlessly to support our growing AI agent ecosystem
  • You create sophisticated CI/CD pipelines that enable rapid, reliable deployment of both traditional services and AI agents
  • You develop Python-based automation that eliminates manual tasks and accelerates our development velocity
  • You design monitoring and observability systems that provide deep insights into both infrastructure and AI agent performance
  • You optimize our cloud infrastructure for cost-efficiency while maintaining enterprise-grade reliability
  • You collaborate effectively with development teams to improve developer experience and productivity
  • You proactively identify and resolve infrastructure bottlenecks before they impact customers
  • You establish infrastructure best practices that support our rapid growth
  • You build systems that can handle the unique challenges of AI workloads at scale
  • You maintain 99.9%+ uptime for critical production services

Benefits

  • Competitive Salary
  • Comprehensive health, dental, and vision insurance
  • 401(k) with company match
  • Flexible PTO policy
  • $5,000 annual professional development budget
  • Latest hardware and software tools
  • The opportunity to shape infrastructure for the future of software development
  • Work with cutting-edge AI technology and world-class engineers
  • Modern office in Cambridge's innovation hub
  • Regular team events and activities
  • The chance to solve novel infrastructure challenges at the intersection of DevOps and AI

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1-10 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service