Senior Platform Engineer

ARMPittsburgh, PA
55dHybrid

About The Position

The ARM (Advanced Robotics for Manufacturing) Institute is a national public-private partnership dedicated to accelerating the adoption of robotics and artificial intelligence (AI) in U.S. manufacturing to strengthen national competitiveness and security. Based in Pittsburgh, Pennsylvania, the ARM Institute is one of the Manufacturing USA Institutes and is sponsored by the Department of Defense (DoD). We’re a small, collaborative team seeking a Senior Platform Engineer to own and operate the data, machine learning, and deployment infrastructure that powers next-generation robotics R&D. Our platform connects robotics and AI research with American manufacturers, enabling real-world adoption and impact. In this role, you’ll build and maintain open-source tooling for data pipelines, model development, automated deployment, and scalable compute, while supporting manufacturers as they adopt and integrate the platform. We’re looking for someone who is adaptable, curious, and comfortable working across a broad and evolving platform. You’ll have opportunities to learn new skills, explore new cutting-edge technologies, pivot between projects, and step in wherever the team needs support. Through this work, you’ll help strengthen the American manufacturing base by enabling small- and medium-sized manufacturers to adopt robotics and AI. Please note: This is a hybrid role with 3 days in office, 2 days remote, in our Pittsburgh office.

Requirements

  • Bachelor’s degree in computer science, engineering, or a related technical field.
  • Minimum of 5 years of experience in DevOps, MLOps, platform engineering, or data engineering.
  • Strong experience with Docker and dev containers, GitHub Actions, and Linux debugging.
  • Hands-on experience with Ansible or similar automation tools.
  • Experience using MLflow, ClearML, Weights & Biases (W&B), or comparable MLOps frameworks.
  • Familiarity with monitoring and observability tools such as Grafana, Prometheus, OpenTelemetry (OTel), or similar stacks.
  • Experience managing cloud and on-premises compute and storage and building reliable distributed services.
  • Comfortable performing light physical activity and using standard office equipment.
  • Willingness and ability to travel up to 20%.

Nice To Haves

  • Master’s degree in a relevant field.
  • Experience with robotics or industrial automation.
  • Contributions to open-source projects.

Responsibilities

  • Build and maintain data pipelines, storage, and ingestion workflows for robotics datasets.
  • Develop and operate an end-to-end MLOps platform (experiment tracking, model registry, packaging, training/eval automation, deployment).
  • Own CI/CD and automation using GitHub Actions, Ansible, and containerized dev environments.
  • Help maintain a library of curated Docker containers and base images following industry best practices for security and performance.
  • Implement telemetry/monitoring stacks for remote debugging and system health.
  • Collaborate with robotics engineers, data scientists, and industry partners to integrate and transfer platform components.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service