Senior DevOps Engineer

ProsciaMiddle City West, PA
13d

About The Position

As a Senior DevOps Engineer, you'll be a key contributor to Proscia's platform—helping build the infrastructure that enables pathologists to diagnose patients more accurately and efficiently. Our product powers high-resolution virtual microscopy and AI-assisted workflows at massive scale. If you enjoy solving hard reliability and performance problems in data-intensive systems, this role will challenge you and offer uncommon growth opportunities. You'll work closely with engineering and technical operations teams to ensure our infrastructure, CI/CD pipelines, and observability stack support high-performance, compute-heavy workloads. You'll also serve as a bridge between development and operations, helping teams ship quickly without compromising reliability or security.

Requirements

  • Deep expertise with Kubernetes, including Helm chart development and lifecycle management.
  • Strong containerization experience, including Docker and Docker Compose.
  • Proficiency with Terraform (or comparable IaC tooling) for production infrastructure.
  • Hands-on experience operating workloads on AWS and/or Azure.
  • Proven CI/CD ownership and implementation experience, ideally with GitHub Actions.
  • Strong background in observability tooling such as OpenTelemetry, DataDog, Prometheus, Grafana, or equivalents.
  • Experience supporting large-scale data pipelines and storage systems—especially for large image files or compute-heavy workloads.
  • Working knowledge of private artifact repositories (e.g., AWS CodeArtifact, Artifactory, GitHub Packages).
  • Excellent problem-solving skills, good judgment under pressure, and the ability to prioritize effectively in dynamic environments.

Nice To Haves

  • Experience with distributed file systems or high-throughput storage solutions.
  • Familiarity with production AI/ML deployment patterns (model serving, data versioning, pipeline orchestration).
  • Knowledge of database performance tuning for large datasets.
  • Experience in industries with similar large-scale imaging or data-intensive workflows.
  • Exposure to regulated medical-device or other compliance-heavy development processes.

Responsibilities

  • Serve as the embedded operations advocate within engineering teams building applications for digital pathology and AI workflows.
  • Design, deploy, and maintain Kubernetes-based container orchestration platforms using Helm, ensuring scalability for large datasets and compute-intensive services.
  • Develop and maintain CI/CD pipelines (GitHub Actions) that support rapid iteration, testing, and safe production releases.
  • Optimize infrastructure for storage, transfer, and processing of very large files and high-throughput AI workloads.
  • Expand application observability through metrics and traces (OpenTelemetry/DataDog), and strengthen logging and alerting practices.
  • Use Infrastructure-as-Code (Terraform) to provision and evolve required cloud resources alongside engineering teams.
  • Partner with developers to streamline deployment of AI models and data pipelines into production.
  • Manage artifacts in private repositories to ensure secure, reliable distribution across environments.
  • Improve resiliency, recovery workflows, and operational readiness across services.
  • Collaborate with cross-functional partners to balance competing priorities in a fast-moving, innovation-driven environment.

Benefits

  • In addition to competitive pay, we ensure everyone on our team is supported with savings, schedule, and insurance options that promote long-term health and personal growth.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

101-250 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service