Software Engineer, Observability

Core WeaveSunnyvale, CA
73d$109,000 - $145,000Hybrid

About The Position

CoreWeave is the AI Hyperscaler, delivering a cloud platform of cutting edge services powering the next wave of AI. Our technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024. As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you're someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry. CoreWeave powers the creation and delivery of the intelligence that drives innovation.

Requirements

  • 2-5 years of experience in Software Engineering, Site Reliability Engineering, DevOps, or a related field.
  • Proficiency in at least one programming or scripting language (e.g., Python, Go).
  • Experience working in Kubernetes, containerization, and microservices architectures.
  • Experience being on call, triaging and escalating (when appropriate) production issues.
  • History of consuming observability systems at scale.
  • Excellent problem-solving, analytical, and communication skills.

Nice To Haves

  • Experience running a production observability database or tool (e.g. ClickHouse, Elastic, Loki, Victoria Metrics, Prometheus, Thanos, OpenTelemetry, and/or Grafana).
  • Familiarity with infrastructure-as-code tools like Terraform.
  • Exposure to modern testing frameworks and progressive deployment strategies.
  • Hands-on experience using data-streaming systems for observability pipelines.

Responsibilities

  • Design, build and maintain logging, tracing, and/or metrics platforms with moderate supervision.
  • Develop and refine monitoring and alerting to enhance system reliability.
  • Assist engineers across CoreWeave in developing effective usage patterns for Observability systems.
  • Manage production and pre-production clusters, building tools to enable development teams to follow best practices.

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service