Stage 4 Solutions-posted 9 months ago
Senior
Remote • Palo Alto, CA
Professional, Scientific, and Technical Services

We are seeking a Staff Infrastructure Engineer to design, deploy, and optimize infrastructure supporting the data acquisition and observability stack. This role focuses on Infrastructure as Code (IaC), platform management, and performance tuning in high-scale distributed environments. This is a 12-month contract (extensions or conversion to FTE possible), 40 hr/week remote role. This is a W2 role as a Stage 4 Solutions employee. Health benefits and 401K are offered.

  • Architect and build and manage scalable infrastructure to support data pipelines and observability services.
  • Develop and maintain Infrastructure as Code (IaC) solutions (Terraform, Pulumi, Ansible).
  • Optimize platform performance for low-latency, high-throughput workloads.
  • Deploy and manage Kubernetes clusters and containerized services.
  • Implement automated deployment, CI/CD, and system monitoring solutions.
  • Ensure networking, security, and system reliability across cloud and bare-metal environments.
  • Collaborate with software engineers on system integration and data engineers on storage and schema management.
  • Bachelor's Degree or higher in Computer Science, Engineering or other related technical field.
  • 10+ years in infrastructure engineering, platform deployment, or DevOps managing high performance data pipelines.
  • Expertise in Infrastructure as Code (IaC) tools (Terraform, Pulumi, Ansible).
  • Hands-on experience with Kubernetes, container orchestration, and service mesh architectures.
  • Strong understanding of high-performance networking for data pipelines.
  • Experience with streaming/messaging platforms (Kafka, Pulsar, Flink, RabbitMQ).
  • Strong background in real-time data pipelines and observability stacks (OpenTelemetry, Prometheus, Grafana, ELK).
  • Deep knowledge of high-performance databases (Postgres, InfluxDB, Prometheus, Elasticsearch, Cassandra).
  • Proficiency in cloud platforms (AWS, GCP, Azure) and on-prem infrastructure.
  • Experience with observability tools (Prometheus, Grafana, OpenTelemetry, ELK stack).
  • Knowledge of CI/CD workflows and automated deployment strategies.
  • Able to work effectively across a number of different teams/functionalities.
  • Excellent communication skills
  • Health benefits
  • 401K
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service