Senior Infrastructure Solutions Engineer

Exeter FinanceIrving, TX
$123,500 - $178,100

About The Position

Owns the operational health, stability, and reliability of enterprise infrastructure platforms, ensuring high availability, performance, and compliance across compute, storage, virtualization, cloud, and network environments. Operating within a run-focused model, this role is accountable for incident response, system performance, and continuous operational improvement. Serves as the senior technical escalation point for complex infrastructure issues, leading rapid resolution, root cause analysis, and long-term remediation planning. Partners closely with engineering and architecture teams to transition new capabilities into stable operations, ensuring solutions are supportable, observable, and aligned with enterprise standards. Drives automation, observability, and AI-enabled operational practices to improve efficiency, reduce manual intervention, and enhance overall service reliability.

Requirements

  • Bachelor’s degree or educational experience in related field preferred, or relevant work experience.
  • 6–10+ years of experience in infrastructure operations and engineering within enterprise environments, including 24/7 production support and responsibility for uptime, incident response, and service performance.
  • Strong hands-on experience with virtualization (VMware, Hyper-V), cloud infrastructure (Azure preferred), and enterprise storage and networking technologies.
  • Proven experience with automation and scripting (PowerShell, Python, Terraform, or similar), along with familiarity with observability, monitoring platforms, and SLA/SLO-driven operations.
  • Experience working within an ITIL-based operational model (Incident, Problem, Change) and managing complex infrastructure systems across compute, storage, and network domains.
  • In-depth knowledge of security practices, disaster recovery strategies, and performance optimization.
  • Strong analytical and problem-solving skills with the ability to resolve complex infrastructure challenges.
  • Strong leadership and communication skills, with the ability to collaborate across engineering, security, and business teams.
  • Excellent organizational and project management skills, with the ability to manage multiple high-priority operational activities.

Nice To Haves

  • Exposure to AI-enabled operations, automation platforms, or advanced operational tooling preferred.

Responsibilities

  • Own operational performance and reliability of infrastructure platforms, including SLA/SLO adherence, uptime, and service stability.
  • Lead complex incident resolution and drive root cause analysis and problem management, ensuring thorough diagnostics, cross-team collaboration, and long-term remediation to prevent recurrence.
  • Operate and maintain enterprise infrastructure environments, including virtualization (VMware/Hyper-V), cloud platforms (Azure), storage, and networking.
  • Partner with engineering/build teams to transition new platforms and changes into production operations, while supporting infrastructure modernization efforts and ensuring operational readiness, supportability, and alignment with enterprise standards.
  • Build and automate infrastructure workflows using scripting, configuration management, or orchestration frameworks (e.g., PowerShell, Python, Terraform) to reduce manual effort, improve consistency, and enhance system reliability.
  • Advance observability and monitoring practices, including alert tuning, telemetry, and performance visibility to improve signal-to-noise ratio and enable proactive issue detection.
  • Perform performance and capacity analysis, identifying trends and advising leadership on resource planning and technology investments.
  • Provide technical leadership across teams, mentoring engineers, strengthening troubleshooting capabilities, and reinforcing operational best practices.
  • Collaborate with vendors and service providers, represent Infrastructure Operations in cross-functional discussions, and support integration of automation and AI-enabled operational capabilities to improve incident response, analysis, and overall platform efficiency.

Benefits

  • competitive salary
  • benefits
  • training and development opportunities
  • team member recognition and awards
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service