You will own the end-to-end reliability and performance of many of our most critical systems. Working in lockstep with Product and Engineering, you will design, build, and refine the platform that our application and AI features run on, from Kubernetes and databases through CI/CD and observability. You will focus on keeping our systems fast, reliable, and easy for developers to work with. You will work on real infrastructure that supports features people use every day—things like: Continuing to improve and iterate on our observability stack that includes Kibana, Grafana, OTel, and Elastic. Database performance improvements by analyzing slow and high-volume queries, tuning indexes, optimizing query patterns and timing, and recommending schema and code changes to keep QPS and latency low. Kubernetes improvements and upgrades, including deploying new services, improving resource utilization, tightening security, and standardizing deployment patterns across teams. Improving CI/CD pipelines for both backend and frontend services so engineers can ship quickly and safely, with clear feedback loops, fast build times, and reliable rollbacks. Enhancing the local developer experience so that running and debugging the app locally feels fast, consistent, and representative of production. Helping improve our CI/CD and observability for our ML pipeline and models, bringing MLOps best practices into our existing infrastructure.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed