Senior Systems Engineer

Hospital for Special SurgeryNew York, NY
Onsite

About The Position

Serves as an enterprise-wide subject-matter expert across infrastructure platforms and services (on-premises and cloud), providing authoritative technical direction for day-to-day operations and strategic initiatives. Demonstrated experience remediating infrastructure and application vulnerabilities across on-premises and cloud environments, partnering with owners to prioritize risk and meet defined remediation timelines. Experience automating vulnerability remediation workflows (e.g., patch orchestration, configuration compliance, reporting, and exception handling) using scripting and/or Infrastructure as Code to improve speed, consistency, and auditability. Owns and governs technical standards, reference architectures, and engineering guardrails to ensure reliability, scalability, security, and supportability across all supported technologies. Leads and coordinates complex incident response and root cause analysis, acting as the senior escalation point for cross-domain outages and recurring problems Provides next-level support and technical mentorship to systems analysts, engineers, and partner teams; elevates team capability through coaching, runbooks, and knowledge transfer. Leads technical assessments, solution designs, and implementations for major initiatives, ensuring end-to-end operability (monitoring, backup/recovery, patching, capacity, security, and support processes). Partners with stakeholders to capture requirements, translate business needs into technical outcomes, and define support models and service-level expectations for new and existing services. Drives continuous improvement of operational processes and tooling, including automation, monitoring/observability, configuration management, and documentation practices. Provides deep technical expertise across systems, cloud, networking, and security to resolve operational issues while enabling engineering activities and modernization efforts. Maintains and improves standard operating procedures, policies, and technical documentation to ensure consistent, auditable, and supportable operations. Defines requirements and recommends hardware, software, and cloud service changes (upgrades, updates, lifecycle actions), balancing risk, cost, performance, and security. Identifies design gaps, technical debt, and systemic risks; proposes and drives remediation plans that improve stability and reduce operational burden. Measures and improves system performance, capacity, and availability; recommends and implements optimizations based on data and operational telemetry. Collaborates cross-functionally with business units and technical teams to evaluate and implement technologies and capabilities in alignment with agreed-upon business requirements and security/compliance needs. Produces and maintains current-state and future-state architecture diagrams, service maps, and technical artifacts that enable effective operations and troubleshooting. Creates clear, detailed technical documentation (connectivity, dependencies, data flows, and operational

Requirements

  • 10-12 years of experience in an enterprise setting, maintaining and supporting physical, virtual, on-premises, and cloud-based systems, applications, and tools.
  • Advanced knowledge of physical and virtual systems, Microsoft Windows Server, Active Directory, Microsoft Exchange, VMware, and Citrix (full technology suite), including NetScaler.
  • Strong cloud architecture skills, including AWS (preferred) and Microsoft Azure, with experience designing hybrid (on-premises + cloud) solutions.
  • Working knowledge of automation and scripting (PowerShell required; Python preferred) and operations practices (e.g., configuration management and Infrastructure as Code).
  • Familiarity with AI-enabled IT operations (AIOps) concepts and tools (e.g., using AI to accelerate troubleshooting, monitoring, and operational workflows).
  • Advanced knowledge of networking fundamentals and protocols (DNS, DHCP, TCP/IP, routing/switching, load balancing) and security principles.
  • Experience with security tools and capabilities such as EDR/XDR, SIEM/SOAR, vulnerability management, identity and access management (IAM)/MFA, and privileged access management (PAM).
  • Advanced knowledge of systems and application security, hardening, and patching.
  • Advanced knowledge of Microsoft 365 apps as well as Microsoft Visio and Microsoft Project.
  • Knowledge of Linux and/or UNIX is a plus.
  • Knowledge of storage systems (SAN, NAS) on-premises and/or cloud-based.
  • Knowledge of backup and recovery software and tools.

Nice To Haves

  • Experience in a medical/hospital environment is preferred.
  • Relevant certifications are preferred and should align to enterprise architecture and operations across cloud, security, networking, and core platforms. Examples include cloud (AWS preferred e.g., AWS Certified Solutions Architect; Microsoft Azure e.g., Azure Solutions Architect Expert), security (e.g., CISSP, CCSP), networking (e.g., CCNP), virtualization (e.g., VMware VCP), and architecture frameworks (e.g., TOGAF).
  • IT service management certification (e.g., ITIL) is a plus.
  • Python preferred
  • Knowledge of Linux and/or UNIX is a plus.

Responsibilities

  • Serves as an enterprise-wide subject-matter expert across infrastructure platforms and services (on-premises and cloud), providing authoritative technical direction for day-to-day operations and strategic initiatives.
  • Demonstrated experience remediating infrastructure and application vulnerabilities across on-premises and cloud environments, partnering with owners to prioritize risk and meet defined remediation timelines.
  • Experience automating vulnerability remediation workflows (e.g., patch orchestration, configuration compliance, reporting, and exception handling) using scripting and/or Infrastructure as Code to improve speed, consistency, and auditability.
  • Owns and governs technical standards, reference architectures, and engineering guardrails to ensure reliability, scalability, security, and supportability across all supported technologies.
  • Leads and coordinates complex incident response and root cause analysis, acting as the senior escalation point for cross-domain outages and recurring problems.
  • Provides next-level support and technical mentorship to systems analysts, engineers, and partner teams; elevates team capability through coaching, runbooks, and knowledge transfer.
  • Leads technical assessments, solution designs, and implementations for major initiatives, ensuring end-to-end operability (monitoring, backup/recovery, patching, capacity, security, and support processes).
  • Partners with stakeholders to capture requirements, translate business needs into technical outcomes, and define support models and service-level expectations for new and existing services.
  • Drives continuous improvement of operational processes and tooling, including automation, monitoring/observability, configuration management, and documentation practices.
  • Provides deep technical expertise across systems, cloud, networking, and security to resolve operational issues while enabling engineering activities and modernization efforts.
  • Maintains and improves standard operating procedures, policies, and technical documentation to ensure consistent, auditable, and supportable operations.
  • Defines requirements and recommends hardware, software, and cloud service changes (upgrades, updates, lifecycle actions), balancing risk, cost, performance, and security.
  • Identifies design gaps, technical debt, and systemic risks; proposes and drives remediation plans that improve stability and reduce operational burden.
  • Measures and improves system performance, capacity, and availability; recommends and implements optimizations based on data and operational telemetry.
  • Collaborates cross-functionally with business units and technical teams to evaluate and implement technologies and capabilities in alignment with agreed-upon business requirements and security/compliance needs.
  • Produces and maintains current-state and future-state architecture diagrams, service maps, and technical artifacts that enable effective operations and troubleshooting.
  • Creates clear, detailed technical documentation (connectivity, dependencies, data flows, and operational

Benefits

  • The base pay scale for this position is $167,500.00 - $255,250.00. In addition, this position will be eligible for additional benefits consistent with the role.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service