At Navan, “It’s all about the user. All of them.” We’re passionate about providing a seamless one-stop experience for business travelers, no matter how they travel, where they stay, or where they’re going. We are constantly striving to make the most reliable and scalable systems possible to ensure that our services are available to our travelers when they need it most. With our exponential growth, we have many exciting challenges ahead and we’re looking for a passionate Site Reliability Engineer to join our team. As an SRE you will design and develop tooling, automation and infrastructure services that power the Navan services, used by thousands of travelers on a daily basis. You will work closely with development teams, release and productivity teams and security teams to identify customer needs and build innovative solutions to solve them. You will work across a vast array of systems and technologies, aiming to build an autonomous, monitored, fault-tolerant infrastructure that is optimized for both simplicity and uptime. You will collaborate with the backend and frontend engineering teams to ensure that product solutions are scalable, efficient, and reliable. You will design infrastructure to support our massive growth and work with the team to maintain the highest level of service. What You'll Do: Building a fast moving, high growth service. Navan is revolutionizing travel and expense services for the enterprise, and the product is evolving quickly. You are comfortable in a startup environment, enjoy seeing the product take shape, and have strong ownership of the success of your services. Designing, implementing and operating cloud infrastructure. You’re a fit for us if you think in terms of infrastructure as code, deployment pipelines, and building the guardrails to make going fast also going safely. Identifying reliability anti-patterns and solving them systemically. You dive deep into the data to evaluate the health of your systems, and you use it to improve visibility and reliability across the fleet of services. Finding and automating the toil out of our processes. You’d prefer to automate it entirely, or build a tool to empower your users rather than be the gatekeeper to the tool. Leveraging AI tools and platforms in your daily work to achieve autonomous operations, reduce toil, and improve system observability. Contributing to the definition and adoption of system reliability standards, including formalizing SLO/SLI frameworks, observability standards, and blameless post-mortem practices. Assisting in the adoption of AI-assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real-time architectural validation.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed