Platform Engineer

ZooxFoster City, CA
2d

About The Position

The SW HIL (hardware in the loop) team ensures that the testing environment for the system and subsystem improves in uptime, reliability, and feature set. To aid the adoption of our testing framework for use by other teams, the SW HIL team collaborates in test development and maintains key tests that exercise the functionality of the testing framework. We are seeking a highly motivated, energetic, self-starting Platform Engineer to join our Systems Reliability and Stability team. This position will be responsible for ensuring the upkeep, design, and maintenance of various engineering services. You will oversee and collaborate with other teams to ensure the high uptime of our robot testing platforms while measuring and improving their stability, accuracy, and usability.

Requirements

  • Bachelor's Degree in Engineering, Computer Science, Math, or related field
  • 5+ years supporting in-production services, on-call rotations, and SRE responsibilities
  • Proficient in Python or Golang
  • Experience building and managing infrastructure services
  • Experience writing code for system-level automation, building resilient infrastructure, CI/CD pipelines
  • Proficiency with testing frameworks, hands-on at the system/integration level
  • Experience with cloud infrastructure (AWS), container orchestration (EKS/Kubernetes)
  • Linux system administration experience, troubleshooting, and performance tuning

Nice To Haves

  • Built software services, wrote APIs for backend services, owned and managed full-stack applications
  • Experience with CI toolchains such as Bamboo, Bazel, and test frameworks such as Pytest
  • Experience writing and managing infrastructure using IaC tools such as Terraform, Ansible, and Salt
  • Familiarity with Python test automation frameworks and test fixture design
  • Experience in provisioning and managing ephemeral test environments

Responsibilities

  • Responsible for measuring and maintaining the uptime of various services critical to the development of autonomous vehicles, such as testing and validation of on-vehicle software for hardware platforms.
  • Involved with all phases of rolling out various services, from design, deployment, operations, support, automation, and continuous improvement
  • Work with systems handling large volumes of data and data processing pipelines while performing compute-intensive tasks on CPUs and GPUs

Benefits

  • paid time off (e.g. sick leave, vacation, bereavement)
  • unpaid time off
  • Zoox Stock Appreciation Rights
  • Amazon RSUs
  • health insurance
  • long-term care insurance
  • long-term and short-term disability insurance
  • life insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service