Senior DevOps Engineer

DealpathNew York, NY
Hybrid

About The Position

As a Senior DevOps Engineer, you will be a critical member of our infrastructure team based in New York, providing earlier-hours coverage and support for our cloud-based platform as we scale into a multi-shard environment. You will work closely with software engineers to deploy code at scale, drive SRE practices, and lead infrastructure reliability, observability, and security initiatives. This is a high-visibility, highly independent role that requires strong ownership. You will be instrumental in expanding our logging, monitoring, and observability tooling, and will also play a key part in setting up our Google Cloud Platform (GCP) infrastructure as we expand our AI capabilities.

Requirements

  • B.S. in Computer Science or equivalent experience.
  • 8+ years of DevOps/SRE experience in a production environment.
  • Deep experience deploying and managing infrastructure on Amazon Web Services (EC2, S3, RDS, EKS, Opensearch and related services).
  • Strong understanding of networking fundamentals, Linux systems administration, and modern web architectures (HTTP, REST).
  • Proficiency in shell scripting and at least one scripting language (Python, Ruby, etc.).
  • Hands-on experience building and maintaining logging, monitoring, alerting, and observability systems.

Nice To Haves

  • Experience with Google Cloud Platform (GCP) — particularly IAM, networking, and managed services.
  • Familiarity with Gemini Enterprise Agent Platform or Vertex AI infrastructure setup.
  • Experience scaling infrastructure to support AI/ML workloads.
  • Database administration experience (PostgreSQL, MySQL, or similar).
  • Experience with automation and configuration management tools such as Ansible, Chef, or Puppet.
  • Experience with multi-shard or multi-tenant SaaS architectures.

Responsibilities

  • Provide infrastructure support and on-call coverage from New York, ensuring east-coast availability during early business hours.
  • Manage and scale our multi-shard AWS environment, maintaining high availability, performance, and security.
  • Own SRE responsibilities: incident response, reliability engineering, and improving system uptime and resilience.
  • Build and expand logging, monitoring, alerting, and observability tooling across our cloud infrastructure.
  • Work with engineering teams to ensure new features are deployed quickly, safely, and easily.
  • Spearhead network security and compliance initiatives, implementing fine-grained access controls.
  • Lead the setup and configuration of GCP infrastructure, including permissions, IAM, and AI platform services (Gemini Enterprise Agent Platform / Vertex AI).
  • Design and implement automation and configuration management to streamline deployment processes.

Benefits

  • Medical, dental, and vision insurance.
  • Health Savings Account (HSA) & Flexible Spending Account (FSA) options.
  • 401(k) retirement plan.
  • Paid Parental Leave.
  • Flexible Time Off (FTO) policy.
  • Commuter benefits program.
  • Monthly wellness reimbursement to support physical and mental well-being.
  • Equity plan.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service