The Site Reliability Engineer (SRE) role is a founding member of US Cold’s SRE practice, aimed at transitioning the organization from reactive operations to engineered reliability. This position will focus on studying critical system failures, particularly the Phenix WMS and facility automation interfaces, and designing controls, automation, and observability to reduce incidents. Success will be measured by fewer false alerts, faster recovery, less manual intervention, and self-healing systems. The SRE will collaborate with application, infrastructure, and operations teams, and participate in on-call rotations and incident response. This is a hands-on role where improvements directly impact daily warehouse operations.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
501-1,000 employees