Sonio is seeking its first Site Reliability Engineer (SRE) and first engineer in the US. This role will own the platform’s stability and releases, particularly during PST hours. The ideal candidate is a hybrid of a system administrator and a software engineer, capable of managing infrastructure and understanding the code running on it. This position offers high autonomy, requiring critical decision-making during incidents and ensuring the production environment is state-of-the-art, secure, and resilient. The SRE will report to the Lead DevOps Engineer and will be responsible for US coverage for releases and incidents as the first responder during PST hours. This role involves bridging infrastructure and code by collaborating with the DevOps team on Kubernetes, Terraform, and AWS, with the ability to read and patch Elixir code. The SRE will drive incident response end-to-end, including triage, mitigation, and blameless post-mortems. Key responsibilities include improving platform operability by defining SLOs, tuning alerts, and enhancing observability (metrics, logs, tracing). The role also involves transferring operational knowledge from France to the US by creating runbooks and documenting procedures. Additionally, the SRE will support compliance and security in a regulated medical-device environment, maintaining HIPAA-aligned controls and an audit-ready infrastructure.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
11-50 employees