Infrastructure Manager

MANSON WESTERN LLCTorrance, CA
4dHybrid

About The Position

The Infrastructure Manager is responsible for the reliability, stability, and operational effectiveness of WPS's hybrid infrastructure environment, including cloud, on-premises systems, and core business platforms. This role leads the Systems Engineer and DevOps Engineer and ensures that infrastructure and DevOps work is well-managed, visible, and supportable across the team. The Infrastructure Manager focuses on operational ownership, system reliability, and reducing dependency on individual contributors or external vendors. The ideal candidate is a hands-on infrastructure leader with strong experience in hybrid environments and team leadership, and enough technical curiosity to learn, document, and support DevOps workflows as a functional backstop when needed.

Requirements

  • Bachelor's degree in Information Systems, Computer Science, or related field preferred.
  • 7–10+ years of experience in infrastructure or IT operations roles across hybrid environments.
  • 2+ years of people management or team leadership experience.
  • Strong experience with hybrid infrastructure environments (cloud + on-prem), including AWS, Azure, or equivalent.
  • Experience managing Microsoft 365 and identity platforms (Entra ID / Active Directory), including lifecycle management, conditional access policies, and endpoint compliance.
  • Experience with monitoring, patching, backups, and disaster recovery processes.
  • Working knowledge of cloud platforms (AWS, Azure, or similar).
  • Solid understanding of networking fundamentals including firewalls, VPCs, VPNs, DNS, and load balancing.
  • Sufficient technical aptitude to understand DevOps workflows, actively learn and document processes alongside the DevOps engineer, and serve as a functional backstop to maintain operational continuity when the DevOps engineer is unavailable or workload requires additional support.
  • Familiarity with DevOps environments and deployment workflows (CI/CD, automation), with the ability to support and collaborate effectively — not required to be a DevOps practitioner.
  • Experience improving operational reliability through documentation, process improvement, and cross-training.
  • Experience maintaining secure, compliant systems aligned with SOC 2, HIPAA, or similar frameworks.
  • Strong communication and collaboration skills with the ability to work effectively across teams.
  • Strong analytical and troubleshooting skills with a focus on continuous improvement.

Responsibilities

  • Own the day-to-day reliability and performance of infrastructure systems across cloud and on-prem environments.
  • Manage hybrid infrastructure including servers, networking, storage, Microsoft 365, and identity platforms (Entra ID, Exchange Online, SharePoint, Intune), including lifecycle management, conditional access policies, and tenant configuration standards.
  • Establish and maintain operational standards for monitoring, patching, alerting, and incident response.
  • Lead infrastructure incident response and ensure issues are resolved quickly and completely, including after-hours escalation support as needed.
  • Develop and maintain disaster recovery readiness, including backups, runbooks, recovery procedures, and testing.
  • Identify and eliminate operational fragility by ensuring infrastructure, processes, and vendor-delivered work are documented, measurable, and supportable by the internal team.
  • Lead and develop a small infrastructure and DevOps team.
  • Establish clear ownership, accountability, and communication across team members.
  • Ensure knowledge is documented and shared to eliminate single points of failure and reduce dependency on individual contributors or external vendors.
  • Manage and support team members in problem-solving and technical growth in alignment with the company's core values.
  • Provide oversight and support for DevOps processes including CI/CD pipelines and deployment workflows.
  • Ensure DevOps work is documented, maintainable, and understood by the broader team.
  • Act as an escalation point for infrastructure-related issues impacting deployments.
  • Collaborate with the DevOps engineer to improve reliability and transparency of deployments.
  • Partner with Security & Compliance to maintain secure configurations and meet regulatory requirements.
  • Support vulnerability remediation, system hardening, patching strategy, and access control improvements.
  • Ensure infrastructure aligns with SOC 2, HIPAA, GDPR, CCPA, and other applicable standards.
  • Implement network and firewall configuration changes based on security requirements defined by the Information Security & Compliance team.
  • Manage relationships with vendors, cloud providers, hardware/software vendors, licensing, and managed service partners.
  • Ensure vendor work is clearly scoped, delivered, and aligned with business needs.
  • Conduct capacity planning and ensure systems scale with business needs.
  • Identify and reduce operational risk, inefficiencies, and unnecessary dependencies.
  • Develop and maintain runbooks, documentation, and operational processes.
  • Drive cloud and infrastructure cost optimization through resource right-sizing, usage analysis, and automation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service