Serves as an enterprise-wide subject-matter expert across infrastructure platforms and services (on-premises and cloud), providing authoritative technical direction for day-to-day operations and strategic initiatives. Demonstrated experience remediating infrastructure and application vulnerabilities across on-premises and cloud environments, partnering with owners to prioritize risk and meet defined remediation timelines. Experience automating vulnerability remediation workflows (e.g., patch orchestration, configuration compliance, reporting, and exception handling) using scripting and/or Infrastructure as Code to improve speed, consistency, and auditability. Owns and governs technical standards, reference architectures, and engineering guardrails to ensure reliability, scalability, security, and supportability across all supported technologies. Leads and coordinates complex incident response and root cause analysis, acting as the senior escalation point for cross-domain outages and recurring problems Provides next-level support and technical mentorship to systems analysts, engineers, and partner teams; elevates team capability through coaching, runbooks, and knowledge transfer. Leads technical assessments, solution designs, and implementations for major initiatives, ensuring end-to-end operability (monitoring, backup/recovery, patching, capacity, security, and support processes). Partners with stakeholders to capture requirements, translate business needs into technical outcomes, and define support models and service-level expectations for new and existing services. Drives continuous improvement of operational processes and tooling, including automation, monitoring/observability, configuration management, and documentation practices. Provides deep technical expertise across systems, cloud, networking, and security to resolve operational issues while enabling engineering activities and modernization efforts. Maintains and improves standard operating procedures, policies, and technical documentation to ensure consistent, auditable, and supportable operations. Defines requirements and recommends hardware, software, and cloud service changes (upgrades, updates, lifecycle actions), balancing risk, cost, performance, and security. Identifies design gaps, technical debt, and systemic risks; proposes and drives remediation plans that improve stability and reduce operational burden. Measures and improves system performance, capacity, and availability; recommends and implements optimizations based on data and operational telemetry. Collaborates cross-functionally with business units and technical teams to evaluate and implement technologies and capabilities in alignment with agreed-upon business requirements and security/compliance needs. Produces and maintains current-state and future-state architecture diagrams, service maps, and technical artifacts that enable effective operations and troubleshooting. Creates clear, detailed technical documentation (connectivity, dependencies, data flows, and operational
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior