DevOps/IT Engineer

Summer RoboticsCampbell, CA
Onsite

About The Position

We are seeking a DevOps/IT Engineer to join our team. This role involves maintaining and improving our cloud infrastructure, CI/CD pipelines, and AI/ML infrastructure. You will be responsible for scaling our Jenkins infrastructure, managing Docker images, and supporting our AI/ML training environments. Additionally, you will provide IT support to our engineering team, troubleshoot issues, and potentially build internal tools. The ideal candidate is comfortable with in-office collaboration, possesses strong problem-solving skills, and has experience in fast-paced R&D or robotics/AI environments.

Requirements

  • Proficiency with AWS services (EC2, S3, IAM, ECR, VPC, autoscaling).
  • Good knowledge of Terraform and Infrastructure as Code methodologies.
  • Hands-on experience maintaining Jenkins CI/CD pipelines.
  • Experience with C++ compilation toolchains (understanding build systems, not necessarily writing C++).
  • Strong Docker knowledge.
  • General IT infrastructure knowledge (networking basics, system administration, Linux environments).
  • Fullstack experience for building internal tools.
  • Comfortable with in-office collaboration (5 days/week in Campbell, CA).
  • Comfortable supporting multiple engineers daily, including rapid troubleshooting.
  • Strong problem-solving skills and ability to autonomously improve existing systems.
  • Ability to document processes, propose improvements, and work crossfunctionally with software teams.

Nice To Haves

  • Experience with NVIDIA Jetson boards (flashing, OS preparation, infrastructure validation).
  • Experience working in fast-paced R&D or robotics/AI environments.

Responsibilities

  • Maintain and improve the Jenkins CI/CD infrastructure.
  • Scale Jenkins with on-demand workers using AWS ECS and Terraform.
  • Maintain and evolve custom Docker images based on NVIDIA CUDA for AMD and Jetson (ARM-based).
  • Improve CI/CD caching strategies to significantly reduce Docker build times.
  • Maintain IaC for training AI/ML models using Terraform and SageMaker AI.
  • Optionally integrate with a Dashboard for training orchestration and monitoring: Tensorboard or Weights & Biases.
  • Support lab operations by preparing, installing, and maintaining workstations, Jetson.
  • Assist engineers when blocked by DevOps, CI/CD, IT, or cloud-related issues.
  • Optional: Build small internal web dashboards or automation tools.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service