Senior System Engineer

AT&TAlpharetta, GA
Onsite

About The Position

The Senior System Engineer will be responsible for ensuring the stability, performance, and continuous improvement of web applications and services. This role involves leading the response to production issues, identifying and troubleshooting problems, implementing immediate fixes, and ensuring minimal downtime in compliance with SLAs. The engineer will also build alerting systems, monitoring tools, and dashboards for proactive issue identification. Strong analytical and technical skills are required to diagnose and resolve complex production issues, with a focus on quick impact mitigation and collaboration with development teams for long-term solutions. Responsibilities include creating and maintaining comprehensive documentation for system architecture, configuration, deployment procedures, and troubleshooting guides. The role also involves proactively detecting problems, analyzing trends, assessing impacts, and managing escalated issues. Collaboration with development teams to define and validate non-functional requirements, monitor application performance using APM tools, and ensure thorough knowledge transfer of system changes are key aspects of this position. The Senior System Engineer will provide on-call support for agent-facing applications, including homegrown J2EE applications and SaaS platforms like Salesforce and MuleSoft.

Requirements

  • Requires a Bachelor’s degree, or foreign equivalent degree in Computer Science or Information Technology.
  • 5 years of progressive post-baccalaureate experience in the job offered or 5 years of progressive post-baccalaureate in a related occupation.
  • Architecting and developing web applications.
  • Utilizing observability tools such as Dynatrace, AppDynamics, Splunk, ELK, MuleSoft Any Point, Quantum Metric, and Catchpoint to create alerts, dashboards, reports, and synthetic monitoring.
  • Working with integration technologies and API Gateways, including MuleSoft and WebLogic.
  • Working with object-oriented programming languages such as Java, J2EE technologies, JavaScript, and frameworks such as Spring.
  • Using automation tools and scripting languages (Python, Shell).
  • Utilizing containerization technologies (Docker, Kubernetes) and cloud services (Azure).
  • Working with DevOps practices and tools (CI/CD pipelines, Git, Jenkins).
  • Understanding network protocols, load balancing, and security principles.
  • Using database SQL queries and building Linux shell scripts on demand.

Responsibilities

  • Ensure stability, performance, and continuous improvement of web applications and services.
  • Lead the response to production issues, identify and troubleshoot problems, implement immediate fixes, and ensure minimal downtime in compliance with SLAs.
  • Build alerting systems, monitoring tools, and dashboards to proactively identify issues.
  • Apply strong analytical, technical and functional skills to diagnose and resolve complex production issues, focusing on quick impact mitigation and work with development teams to implement long-term solutions.
  • Create and maintain comprehensive documentation for system architecture, configuration, deployment procedures, and troubleshooting guides.
  • Proactively detect problems, analyze trends and patterns, assess impacts and manage escalated issues to ensure timely resolution.
  • Conduct blameless postmortems and after-action reviews to identify failure patterns, document lessons learnt and implement remediation to enhance application resilience.
  • Collaborate with development teams to define and validate non-functional requirements during design and development phases, ensuring compliance before production deployment.
  • Monitor application performance using APM tools such as Dynatrace and ELK.
  • Work closely with product development teams to ensure thorough knowledge transfer of system changes prior to operationalization.
  • Provide on-call support for agent-facing applications, including homegrown J2EE applications and SaaS platforms such as Salesforce and MuleSoft.

Benefits

  • Medical/Dental/Vision coverage
  • 401(k) plan
  • Tuition reimbursement program
  • Paid Time Off and Holidays (based on date of hire, at least 23 days of vacation each year and 9 company-designated holidays)
  • Paid Parental Leave
  • Paid Caregiver Leave
  • Additional sick leave beyond what state and local law require may be available but is unprotected
  • Adoption Reimbursement
  • Disability Benefits (short term and long term)
  • Life and Accidental Death Insurance
  • Supplemental benefit programs: critical illness/accident hospital indemnity/group legal
  • Employee Assistance Programs (EAP)
  • Extensive employee wellness programs
  • Employee discounts up to 50% off on eligible AT&T mobility plans and accessories, AT&T internet (and fiber where available) and AT&T phone

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Number of Employees

5,001-10,000 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service