Cloud Engineer II

Clearwater Analytics Holdings Inc.Boise, ID
47d

About The Position

Clearwater Analytics is looking for a motivated and detail‑oriented Cloud Engineer II to help maintain and enhance the reliability, performance, and scalability of our industry‑leading cloud platforms. You will be part of a high‑impact team that excels in cloud operations, observability, and continuous improvement. In this role, you will contribute to modernizing our technology stack to align with industry best practices, optimize application launch speed and efficiency, and participate in on‑call rotations and incident response. You will work closely with teams across the organization to deliver seamless, scalable, and consistent cloud computing experiences that power our business at global scale.

Requirements

  • Hands-on experience with AWS, Azure, or GCP (AWS preferred)
  • Experience in Kubernetes, container runtimes (containerd, Docker), and related cloud native ecosystem tools.
  • Proficiency with Terraform, CloudFormation, or other IaC frameworks.
  • Strong understanding of monitoring, alerting, and observability tools (Dynatrace, Prometheus, Grafana, etc.)
  • Proficiency in scripting/automation (Python, Bash, PowerShell)
  • Ability to adapt easily to change, contribute while working as a team or individually
  • Strong problem-solving and analytical skills
  • Bachelor's/master's degree in engineering or a related field.
  • 2-8 years of experience

Responsibilities

  • Monitor and optimize large-scale cloud infrastructure to maintain peak performance, reliability, and availability.
  • Implement and uphold best practices for cloud operations and observability, including metrics, logging, and distributed tracing.
  • Collaborate with engineering teams to modernize and migrate workloads to industry‑standard containerization, orchestration, and capacity management platforms.
  • Design, build, and enhance automation pipelines to improve deployment speed, stability, and scalability.
  • Troubleshoot and resolve production incidents quickly, applying effective root cause analysis to prevent recurrence.
  • Collaborate across DevOps, Security, and Product teams to design resilient architectures.
  • Develop and maintain tools and dashboards that provide clear visibility into resource utilization, efficiency, and cost optimization for compute environments.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service