Sr Engineers, Systems Reliability

T-MobileFrisco, TX
16hHybrid

About The Position

At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That’s how we’re UNSTOPPABLE for our employees! T-Mobile is America’s supercharged Un-carrier, delivering an advanced 4G LTE and transformative nationwide 5G network that will offer reliable connectivity for all. Sr Engineers, Systems Reliability is located in Frisco, TX and will utilize proficient knowledge and skill in emerging DevOps-centric automation tools and technologies for CICD, configuration management, etc. for production environments. Telecommuting is permitted, but applicant must work from the worksite location at least 3-4 days per week. Up to 10% of domestic travel required for meetings and projects. Position duties and responsibilities include, but are not limited to: Perform environment management, automated server provisioning, pipeline configuration (VMs). Deliver software to improve the availability, scalability, latency, and efficiency of T-Mobile’s services. Craft, manage, and use dashboard for continuous monitoring and health check of applications, and the underlying infrastructure, improve the quality of services using the monitoring feedback for production environment. Contribute to future improvement of software delivery processes and operations, e.g., cloud enablement, use of microservices with containerization. Relationship and People Management: Mentors/guides other Systems Reliability Engineers, Software Engineers and vendor resources as needed. Minimum requirements: Experience and education requirements: PRIMARY REQUIREMENTS: Master’s degree in Computer and information technology, Electrical and Computer Engineering, or related, and 6 years of relevant work experience. ALTERNATIVE REQUIREMENTS: Bachelor’s degree in Computer and information technology, Electrical and Communication Engineering, or related, and 8 years of relevant work experience. Skills requirements: Requires experience in each of the following skills: 1. Design, develop, and deliver complex GitLab CI/CD pipelines for enterprise billing platforms. 2. Build and administer Kubernetes clusters using Conductor for application lifecycle management, packaging with helm and duck templates for infrastructure automation. 3. Develop custom tools in Shell, Perl, YAML, Jython and Python (including Boto3) to support zero-downtime deployments and operations. 4. Implement Infrastructure as Code with Terraform and AWS CloudFormation to provision infrastructure across AWS, PCF, Google and Azure cloud platforms. 5. Develop AWS Lambda function to migrate historical billing information from RDS to S3. 6. Support and administer Skava-based ecommerce platforms, Java/J2EE and REST API’s including deployment, scaling, and operational troubleshooting in production. 7. Provision and manage relational and NoSQL databases, including PostgreSQL, MySQL, Oracle, and MongoDB (Atlas) and develop, optimize SQL scripts for billing workflows and for generating monthly consumer and business reports. 8. Develop scripts and controls to enforce access management using Azure AD and prevent public exposure of secrets using GitGuardian, T-Vault and CyberArk ensuring compliance with cybersecurity standards. 9. Automate Windows system administration and deployment processes using PowerShell, create and maintain Power BI reports and dashboards. 10. Expert-level experience in implementing and managing observability platforms like Splunk, AppDynamics, and Grafana, with a focus on developing real-time dashboards and actionable alerts for microservice health, API latency, and system fault detection. Additional: At least 18 years of age Legally authorized to work in the United States

Requirements

  • Master’s degree in Computer and information technology, Electrical and Computer Engineering, or related, and 6 years of relevant work experience.
  • Bachelor’s degree in Computer and information technology, Electrical and Communication Engineering, or related, and 8 years of relevant work experience.
  • Design, develop, and deliver complex GitLab CI/CD pipelines for enterprise billing platforms.
  • Build and administer Kubernetes clusters using Conductor for application lifecycle management, packaging with helm and duck templates for infrastructure automation.
  • Develop custom tools in Shell, Perl, YAML, Jython and Python (including Boto3) to support zero-downtime deployments and operations.
  • Implement Infrastructure as Code with Terraform and AWS CloudFormation to provision infrastructure across AWS, PCF, Google and Azure cloud platforms.
  • Develop AWS Lambda function to migrate historical billing information from RDS to S3.
  • Support and administer Skava-based ecommerce platforms, Java/J2EE and REST API’s including deployment, scaling, and operational troubleshooting in production.
  • Provision and manage relational and NoSQL databases, including PostgreSQL, MySQL, Oracle, and MongoDB (Atlas) and develop, optimize SQL scripts for billing workflows and for generating monthly consumer and business reports.
  • Develop scripts and controls to enforce access management using Azure AD and prevent public exposure of secrets using GitGuardian, T-Vault and CyberArk ensuring compliance with cybersecurity standards.
  • Automate Windows system administration and deployment processes using PowerShell, create and maintain Power BI reports and dashboards.
  • Expert-level experience in implementing and managing observability platforms like Splunk, AppDynamics, and Grafana, with a focus on developing real-time dashboards and actionable alerts for microservice health, API latency, and system fault detection.
  • At least 18 years of age
  • Legally authorized to work in the United States

Responsibilities

  • Perform environment management, automated server provisioning, pipeline configuration (VMs).
  • Deliver software to improve the availability, scalability, latency, and efficiency of T-Mobile’s services.
  • Craft, manage, and use dashboard for continuous monitoring and health check of applications, and the underlying infrastructure, improve the quality of services using the monitoring feedback for production environment.
  • Contribute to future improvement of software delivery processes and operations, e.g., cloud enablement, use of microservices with containerization.
  • Mentors/guides other Systems Reliability Engineers, Software Engineers and vendor resources as needed.

Benefits

  • Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches.
  • Employees in regular, non-temporary roles are eligible for an annual bonus or periodic sales incentive or bonus, based on their role.
  • Most Corporate employees are eligible for a year-end bonus based on company and/or individual performance and which is set at a percentage of the employee’s eligible earnings in the prior year.
  • Certain positions in Customer Care are eligible for monthly bonuses based on individual and/or team performance, while Retail and Business Sales roles are eligible for monthly or quarterly sales incentives.
  • EVERY employee at T-Mobile is eligible for an Annual Stock Grant.
  • We cover all of the bases, offering medical, dental and vision insurance, a flexible spending account, 401(k), employee stock grants, employee stock purchase plan, paid time off and up to 12 paid holidays - which total about 4 weeks for new full-time employees and about 2.5 weeks for new part-time employees annually - paid parental and family leave, family building benefits, back-up care, enhanced family support, childcare subsidy, tuition assistance, college coaching, short- and long-term disability, voluntary AD&D coverage, voluntary accident coverage, voluntary life insurance, voluntary disability insurance, and voluntary long-term care insurance.
  • Eligible employees can also receive mobile service & home internet discounts, pet insurance, and access to commuter and transit programs!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service