DevOps Engineer

Two95 International Inc.Princeton, NJ

About The Position

We are seeking a DevOps Engineer to strengthen our software delivery and infrastructure operations capability. This role is responsible for designing, implementing, and maintaining scalable infrastructure, automation frameworks, and CI/CD pipelines that enable engineering teams to deliver secure and reliable software at speed. The ideal candidate operates comfortably at the intersection of software engineering, cloud infrastructure, and operational reliability, with a strong focus on automation, observability, and continuous improvement.

Requirements

  • 3 to 6+ years experience in DevOps, Site Reliability Engineering, or Infrastructure Engineering.
  • Strong experience with several of the following:
  • Cloud Platforms: Azure and AWS
  • Infrastructure as Code: Terraform
  • Containers & Orchestration: Docker, Kubernetes
  • Experience operating production Kubernetes clusters
  • CI/CD Platforms: Jenkins, Argo CD, Azure DevOps
  • Configuration Management: Ansible
  • Monitoring & Observability: Prometheus, Grafana, ELK / OpenSearch, Datadog or similar tools
  • Scripting & Development: Python, Bash
  • Version Control: Git and Git-based workflows

Nice To Haves

  • Experience implementing DevSecOps pipelines
  • Knowledge of zero-trust security models

Responsibilities

  • Design, deploy, and maintain cloud-native infrastructure using Infrastructure as Code (IaC) practices.
  • Manage and optimize environments across development, staging, and production.
  • Implement high availability, scalability, and fault tolerance in platform architecture.
  • Support containerized workloads using Docker and Kubernetes.
  • Build and maintain CI/CD pipelines to automate build, testing, and deployment workflows.
  • Integrate security, testing, and compliance controls into the pipeline (DevSecOps practices).
  • Reduce manual operational work through automation and scripting.
  • Implement monitoring, alerting, and observability solutions.
  • Maintain platform reliability through incident response, root cause analysis, and continuous improvement.
  • Define and track SLOs, SLIs, and operational KPIs.
  • Work closely with development teams to optimize deployment pipelines and runtime environments.
  • Support developers in adopting cloud-native and containerization best practices.
  • Contribute to internal documentation and knowledge sharing.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service