Cloud Systems Engineer

SOLV EnergySan Diego, CA
Hybrid

About The Position

The Cloud Systems Engineer will manage, maintain, and support SOLV Energy’s Azure and AWS cloud-based infrastructure to ensure consistent, reliable, and secure operations, while maximizing the value of subscribed systems and services. This is a hands-on technologist role, reporting to the IT Infrastructure and Cloud Manager and will have responsibility for all aspects of the company’s cloud-based computing infrastructure and services. This role is hybrid, with regular in-office presence in San Diego, CA. Specific location details and expectations will be discussed during the interview process.

Requirements

  • Bachelor’s degree in Information Technology, related technology field, or equivalent combination of education and experience
  • 10+ years overall IT infrastructure experience, including 5+ years building and operating production workloads in Azure or AWS at enterprise scale
  • Expertise in Azure or AWS cloud systems oversight, performance tuning, and administration.
  • Linux Operating System Experience / Knowledge
  • Strong knowledge of cloud security technologies and best practices
  • Expertise with Infrastructure as Code: design and security, configuration management, integration, deployment, performance monitoring and tuning, automation of infrastructure.
  • Expertise with ARM templates and Terraform to enable automation.
  • Expertise with Entra ID Administration
  • Expertise with networking and networking protocols & services.
  • Proficiency in scripting languages such as PowerShell, Python, and Bash for automation
  • Expertise with deployment techniques (and tools) in a distributed environment.
  • Strong oral and written communication skills with a high degree of comfort with varying types of audiences
  • Emotional intelligence, flexible work style, and excellent diplomatic skills across all levels of an organization
  • Expertise designing and delivering complex solutions on time and with expected quality.
  • Expertise supporting various compliance and regulatory frameworks.
  • Advanced skills in performance tuning and optimization of cloud resources
  • Ability to multi-task, establish priorities, work independently, manage time, and deliver on commitments.
  • Hands‑on experience operating enterprise monitoring and alerting platforms
  • Practical experience automating operational tasks using scripting and orchestration tools such as PowerShell, Python, Bash, and tools like Ansible or AWX.
  • Familiarity with AI‑assisted features in monitoring or automation tools
  • Applicants must be legally authorized to work in the U.S. without requiring employer sponsorship now or in the future.

Responsibilities

  • Assess current Azure and AWS instance and create continuous improvement roadmap.
  • Define cloud systems strategy to maximize the return on IT investments, while meeting or exceeding uptime and performance expectations.
  • Drive the adoption of best practices in cloud architecture and operations, ensuring high standards of performance and security.
  • Design, plan, and implement Azure and AWS based systems and services in support of functional, storage, compute, data integration, and systems security initiatives.
  • Collaborate with SecOps and IT Operations teams to ensure that new or updated solutions and services comply with the enterprise cyber security standards.
  • Proactively identify and apply system updates to prevent issues, strengthen security, tune performance, automate tasks, and manage costs.
  • Monitor Azure and AWS systems operations and address alerts, anomalies, and issues.
  • Develop and support Disaster Recovery, Backup, and retention policies on Azure and AWS platforms.
  • Maintain zero trust endpoint security with tools such as Microsoft Defender
  • Develop and implement training programs for team members to enhance their skills and knowledge in Azure technologies
  • Lead project planning and execution, ensuring timely delivery of cloud solutions and adherence to project timelines
  • Manage cloud networking and security and monitor the logging of systems.
  • Development and maintenance of IT policies and procedures.
  • Ensure compliance with IT General Controls, provide needed information in support of audits and to substantiate process and controls compliance.
  • Own and maintain enterprise monitoring and alerting platforms, including Zabbix and cloud‑native tools, to provide clear visibility into the health, performance, capacity, and availability of Azure and AWS environments.
  • Build and support automation workflows using scripting and orchestration tools such as Ansible/AWX and operational runbooks to reduce manual effort, improve reliability, and streamline day‑to‑day operations.
  • Identify and adopt practical AI‑assisted features within monitoring and automation tools to improve anomaly detection, alert quality, and operational insights, while ensuring decisions and remediation remain under engineering control.

Benefits

  • medical
  • dental
  • vision
  • basic life and disability insurance
  • 401(k) plan
  • vacation
  • sick and holiday pay
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service