T-Mobile-posted 7 days ago
Full-time • Mid Level
Frisco, TX
5,001-10,000 employees

At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That’s how we’re UNSTOPPABLE for our employees! Are you ready to join the Un-carrier movement? This Sr Site Reliability Engineer ensures the reliability and resilience of digital infrastructure to support efficient software development and deployment. It involves automating processes and reducing manual effort to prevent operational incidents and improve system performance. The role requires expertise in programming, scripting, incident response management, and various technical tools to maintain system robustness. Success is measured by system stability, incident reduction, and continuous improvement in operational efficiency. The work directly impacts organizational stability and customer experience by maintaining high-performing and reliable systems. We are a team that encourages innovation and advocate an agile and open approach, truly working and playing in the Un-carrier way!

  • Enhances system reliability and resilience by identifying potential issues and implementing preventive measures.
  • Facilitates faster and more efficient software development and deployment by automating processes and reducing manual effort
  • Root Cause Analysis (RCA) review/participation to identify system issues and prevent incident recurrence, collaborating with Problem Management teams on Corrective and Preventive actions to enhance system reliability and performance, and identifying and prioritizing items for the Core SRE backlog to ensure continuous improvement in system operations and stability.
  • Prevents operational incidents by utilizing strong problem-solving and analytical skills.
  • Contributes to the robustness and efficiency of systems by leveraging expertise in programming and scripting languages, incident response management, and various tech tools.
  • Adapts to changing circumstances and drives innovation by continuously learning new skills and Technologies.
  • Also responsible for other Duties/Projects as assigned by business management as needed.
  • Bachelor's Degree plus 5 years of related work experience OR Advanced degree with 3 years of related experience (Required)
  • Acceptable areas of study include Computer Science, Engineering or related field (Required)
  • 4-7+ years - Working in operations or DevOps environments.
  • 4-7+ years - Troubleshooting customer related issues and managing customer relationships.
  • 4-7+ years - Developing software solutions using Python or similar programming languages.
  • Programming - Proficiency in programming and scripting languages such as Python and Bash. (Required)
  • Automation - Ability to automate processes and reduce manual effort. (Required)
  • Incident Management - Understanding of incident response management and operational support. (Required)
  • Experience with designing and maintaining CICD Pipelines. (Required)
  • Ability to learn new skills and technologies quickly and adapt to changing circumstances. (Required)
  • Understanding system reliability and resilience principles. (Required)
  • At least 18 years of age
  • Legally authorized to work in the United States
  • Ability to drive innovation and improve software development and deployment processes. (Preferred)
  • Experience with cloud native platforms. (Preferred)
  • AWS Certified DevOps Engineer: This certification validates technical expertise in provisioning, operating, and managing distributed application systems on the AWS platform. (Preferred)
  • Certified Kubernetes Administrator: This certification validates the skills required for day-to-day administration of Kubernetes environments. (Preferred)
  • Google Cloud Certified - Professional DevOps Engineer: This certification validates the ability to efficiently develop and deploy applications using Google Cloud technologies and to manage operations. (Preferred)
  • Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches.
  • employees in regular, non-temporary roles are eligible for an annual bonus or periodic sales incentive or bonus, based on their role.
  • Most Corporate employees are eligible for a year-end bonus based on company and/or individual performance and which is set at a percentage of the employee’s eligible earnings in the prior year.
  • medical, dental and vision insurance, a flexible spending account, 401(k), employee stock grants, employee stock purchase plan, paid time off and up to 12 paid holidays - which total about 4 weeks for new full-time employees and about 2.5 weeks for new part-time employees annually - paid parental and family leave, family building benefits, back-up care, enhanced family support, childcare subsidy, tuition assistance, college coaching, short- and long-term disability, voluntary AD&D coverage, voluntary accident coverage, voluntary life insurance, voluntary disability insurance, and voluntary long-term care insurance.
  • eligible employees can also receive mobile service & home internet discounts, pet insurance, and access to commuter and transit programs!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service