Cloud Operations Manager

Work Truck SolutionsChico, CA
1d$120,000 - $193,000Remote

About The Position

The Cloud Operations Manager is responsible for the health, scalability, security, and cost-effectiveness of the organization's Azure cloud infrastructure. This role acts as a modern System Administrator and a team leader, ensuring system availability and managing the complexities of cloud infrastructure, compliance, and disaster recovery. The manager is expected to build, mentor, and lead a high-performing team of Cloud Operations Engineers.

Requirements

  • Proven experience managing infrastructure on major cloud platforms (AWS, Azure, or GCP), with at least 2 years in a leadership or managerial capacity.
  • Expertise in Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
  • Strong understanding of network security, IAM, and compliance frameworks.
  • Demonstrated ability to reduce cloud costs through FinOps principles.
  • Experience in designing and testing Disaster Recovery and High Availability architectures.
  • Proficiency in scripting languages for operational automation.
  • Familiarity with tools like CloudWatch, Datadog, Jenkins, or similar systems.
  • A focus on system availability as the primary key metric (target uptime 99.99%).
  • Excellent communication, delegation, and personnel management skills.

Responsibilities

  • Infrastructure Management & Provisioning: Oversee all cloud infrastructure and resources, including provisioning, performing regular patch management, and proactive capacity planning.
  • Monitoring & Incident Response: Establish comprehensive system observability and maintain alerting infrastructure; serve as the escalation point for major incidents, drive resolution, and champion thorough Root Cause Analysis (RCA).
  • Security & Compliance: Define and maintain a robust security posture by enforcing Identity & Access Management (IAM), completing security audits, ensuring data encryption, and managing audit logs for regulatory compliance.
  • Cost Optimization: Actively track cloud spend against budgets, direct the team in performing right-sizing and waste elimination, and optimize rates through reserved instances and savings plans (FinOps strategy).
  • Disaster Recovery & Continuity: Direct the implementation and regular testing of comprehensive disaster recovery and business continuity plans, including backup management and maintaining a High Availability (HA) architecture across multiple zones.
  • Automation & Tooling: Guide the team in building and maintaining automation and tooling, implementing Infrastructure as Code (IaC) practices, developing self-service provisioning portals, and scripting repetitive operational tasks.
  • Team Leadership: Manage a team of Cloud Operations Engineers, including recruitment, performance reviews, professional development, and day-to-day work prioritization.
  • Strategy & Roadmap: Define the strategic vision, roadmap, and operational goals for the Cloud Operations function in alignment with overall business objectives.
  • Process Improvement: Develop and enforce operational procedures, standards, and best practices to ensure reliable and efficient cloud infrastructure management.
  • Cross-Functional Collaboration: Act as the primary liaison between Cloud Operations, Development, and Product teams to ensure alignment on platform needs and release readiness.

Benefits

  • Work on meaningful projects that shape the future of the commercial vehicle industry.
  • Competitive salary.
  • Fully remote Monday-Friday work week.
  • Comprehensive medical, dental, and 401k benefits, with complimentary life insurance.
  • Paid Time Off (PTO) and holidays.
  • Flexible scheduling, subject to manager’s approval.
  • Opportunity to work with a supportive and innovative team.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service