Sr. Engineer, Site Reliability - Retail Mobility Engineering

T-MobileFrisco, WA
$107,300 - $193,500Onsite

About The Position

This role ensures the reliability and resilience of digital infrastructure, enabling efficient software development and deployment. It focuses on automating processes and reducing manual effort to prevent operational incidents, improve system performance, and enhance overall operational efficiency. The role requires expertise in programming, scripting, incident response management, and a variety of technical tools to maintain system robustness, improve deployment quality, and drive continuous operational improvements. Success is measured by system stability, incident reduction, and ongoing gains in operational efficiency. As part of a Retail Mobility Engineering organization, this role supports a large-scale enterprise device management and application delivery ecosystem that enables critical retail operations. The Senior Site Reliability Engineer leverages automation, CI/CD practices, scripting, observability, and incident management expertise to improve reliability, scalability, and operational efficiency across a complex technology environment. This work directly impacts organizational stability and customer experience by ensuring the availability, performance, and reliability of critical systems.

Requirements

  • Bachelor's degree plus 3 years of related work experience, advanced degree with 1 year of related work experience, or a combination of education and experience deemed equivalent.
  • Degree in Computer Science, Engineering, or a related technical field.
  • 4 - 7 years working in operations or develops environments.
  • 4 - 7 years troubleshooting customer related issues and managing customer relationships.
  • 4 - 7 years developing software solutions using Python or similar programming languages.
  • Strong analytical and problem-solving skills with the ability to identify, troubleshoot, and resolve complex operational issues.
  • Ability to design, implement, and maintain reliable, scalable automation solutions across complex enterprise environments.
  • Strong understanding of Site Reliability Engineering (SRE), DevOps practices, system reliability, resiliency, and operational excellence.
  • Proficiency in Python, Bash, or similar scripting languages for automation, integration, and operational tooling.
  • Experience with deployment automation, configuration management, and infrastructure automation practices.
  • Strong understanding of incident management, root cause analysis, and preventive operational improvement methodologies.
  • Knowledge of enterprise secrets management solutions, including CyberArk, HashiCorp Vault, or similar platforms.
  • Experience implementing secure engineering practices and applying cybersecurity principles within software delivery and operational environments.
  • Experience with observability, monitoring, operational reporting, and deployment visibility solutions.
  • Working knowledge of cloud platforms, cloud infrastructure, containerized environments, and Kubernetes.
  • Ability to collaborate effectively with product, engineering, cybersecurity, operations, and support teams to deliver reliable technology solutions.
  • Strong verbal and written communication skills with the ability to communicate technical concepts to both technical and non-technical audiences.
  • Ability to balance operational support, project delivery, modernization initiatives, and continuous improvement efforts.
  • Demonstrated learning agility and ability to adapt to evolving technologies, tools, and business requirements.
  • Commitment to operational excellence, service reliability, automation, and continuous improvement.
  • At least 18 years of age
  • Legally authorized to work in the United States

Nice To Haves

  • Experience working in Site Reliability Engineering, DevOps, platform engineering, operations, or software development environments.
  • Experience troubleshooting production issues and supporting business-critical systems.
  • Experience developing automation solutions using Python, Bash, or similar programming languages.
  • Experience designing and implementing CI/CD pipelines using GitLab or similar platforms.
  • Experience automating deployment, configuration management, and operational workflows.
  • Experience integrating and automating CyberArk, HashiCorp Vault, or similar enterprise secrets-management platforms.
  • Experience administering Jamf Pro and/or supporting enterprise mobile device management environments.
  • Experience supporting mobile device lifecycle management, application deployment, policy management, and migration initiatives.
  • Experience applying cybersecurity best practices across software delivery and operational environments.
  • Experience contributing to technical design decisions and collaborating across multiple engineering teams.
  • Experience with cloud platforms, infrastructure automation, and containerized environments.
  • Experience with observability, monitoring, and operational reporting solutions.
  • Experience mentoring peers, sharing technical knowledge, and contributing to engineering best practices.
  • Jamf 100 and/or Jamf 200 Certification
  • AWS Certified DevOps Engineer
  • Certified Kubernetes Administrator (CKA)
  • Google Cloud Certified – Professional DevOps Engineer

Responsibilities

  • Automate processes to accelerate software development and deployment while minimizing manual interventions, including CI/CD pipelines, deployment automation, and endpoint management workflows where appropriate.
  • Design, build, and enhance automation solutions that improve operational efficiency, deployment consistency, and service reliability across complex enterprise environments.
  • Enhance system reliability and resilience by identifying issues and implementing preventive measures to reduce downtime and improve operational stability.
  • Conduct root cause analysis and collaborate with problem management teams to prevent incident recurrence and improve system operations.
  • Leverage programming, scripting, and incident response expertise to improve system robustness, deployment quality, and operational efficiency.
  • Implement and support secure automation practices, including secrets management, credential lifecycle automation, and integration with approved enterprise security platforms.
  • Partner with product, engineering, cybersecurity, and operations teams to design and implement scalable deployment and automation solutions.
  • Support modernization initiatives involving mobile device management platforms, application deployment automation, platform migrations, and operational process improvements.
  • Improve deployment visibility, monitoring, operational reporting, and observability capabilities to enhance traceability and operational awareness.
  • Apply problem-solving and analytical skills to prevent operational incidents and maintain system stability.
  • Continuously learn new skills and technologies to adapt to changing environments and drive innovation.
  • Perform other duties and projects as assigned.

Benefits

  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical insurance
  • Dental insurance
  • Vision insurance
  • Flexible spending account
  • Paid time off
  • Up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Back-up care
  • Enhanced family support
  • Childcare subsidy
  • Tuition assistance
  • College coaching
  • Short-term disability
  • Long-term disability
  • Voluntary AD&D coverage
  • Voluntary accident coverage
  • Voluntary life insurance
  • Voluntary disability insurance
  • Voluntary long-term care insurance
  • Mobile service & home internet discounts
  • Pet insurance
  • Access to commuter and transit programs
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service