The concept of Product Reliability Engineering (PRE) draws inspiration from the principles of SRE. At Criteo, PRE acts as the bridge between Product, Platform Engineering and Infrastructure. The PRE group comprises eight global engineering teams with a common objective: to build the most reliable platform in AdTech. As a Senior Site Reliability Engineer, you'll work closely with product engineering to dig into the code of our apps, systems, and pipelines to assess where optimization is needed most. You'll tell stories with meaningful monitoring and, hopefully, never be paged during your on-call rotation because we've worked hard to make our platform reliable. Speaking of on-call, with an example team of five, you're looking at only 10 weeks in a year, and your time is compensated! Not bad, right? You'll learn skills from other team members along the way and have opportunities to teach us! It's perfect for an engineer who likes shipping code and wants to be involved in all aspects of reliability, efficiency & maintainability. In the PRE Platform team, you will help design production-oriented platforms at scale, and support reliability and scalability efforts for the Platform Factory group. So, you will in particular help the teams of the Factory group, in charge of providing R&D developers with tooling to help them build applications: CI and CD pipelines, code and app management, observability, identity management stacks, etc. Our missions as a PRE team: Engage in and improve the whole lifecycle of services - from inception and design, through to deployment, operation, and refinement. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews. Optimize and maintain services once they are live by measuring and monitoring availability, latency, and overall system health. Scale, automate, and evolve systems by pushing for changes that improve reliability and efficiency. Practice incident response and blameless postmortems. Python, Go, C#, Scala, Kubernetes, Mesos...
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Industry
Professional, Scientific, and Technical Services
Number of Employees
1,001-5,000 employees