Expert Site Reliability Engineer

Harris ComputerColorado Springs, CO
$95,000 - $110,000Remote

About The Position

As a Site Reliability Engineer (SRE) at Altera, you will be responsible for ensuring the reliability, scalability, and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability, automate operations, and improve the customer experience. You will act as a technical leader in monitoring, troubleshooting, incident response, and continuous improvement across our cloud and hybrid environments.

Requirements

  • 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments.
  • Strong experience with APM tools such as LogicMonitor, AppDynamics, Azure Monitor, SentryOne, Dynatrace, Datadog, or New Relic.
  • Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon.
  • Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups.
  • Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
  • Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles.

Nice To Haves

  • Scripting with PowerShell, Python, or similar languages.
  • Infrastructure as Code (Terraform, ARM Templates, Bicep).
  • CI/CD pipelines and deployment automation (Azure DevOps, GitHub Actions).
  • Experience with Kubernetes and containerized workloads.
  • Experience implementing SLOs, SLIs, and Error Budgets.
  • Experience in a healthcare technology or patient care environment.

Responsibilities

  • Maintain and improve the reliability, availability, and performance of our production environments.
  • Lead the investigation and resolution of complex application, database, and infrastructure issues.
  • Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences.
  • Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments.
  • Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers.
  • Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency.
  • Partner with engineering and cloud teams to refine deployment, monitoring, and support processes.
  • Provide technical leadership during major incidents and act as a key escalation point for critical issues.

Benefits

  • Competitive compensation and benefits package
  • Meaningful perks
  • Flexibility
  • A culture that values people, curiosity, and having fun while doing great work.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service