About The Position

As a Senior AI DevOps and Cloud Infrastructure Engineer, you will be critical in building and maintaining the scalable, reliable, and secure cloud infrastructure for our AI initiatives. Your primary focus will be on GCP deployment architecture, implementing infrastructure-as-code principles, and establishing robust CI/CD pipelines and release cycles. You will ensure the highest standards of reliability, scalability, and security for our AI systems. A key responsibility will be translating compliance requirements into automated deployment rules, proactively safeguarding our solutions against prompt injection, and ensuring zero-downtime launches for applications scaling to thousands of concurrent users. Your deep GCP skills, combined with expertise in Terraform and CI/CD automation, will be essential in optimizing compute resources and leveraging event-driven architectures.

Requirements

  • Deep GCP skills, including GKE, CloudRun, and IAM.
  • Expertise in Terraform and CI/CD automation.
  • Proficiency in Python and React JS (full stack capability is a plus).
  • Experience with Google ADK/Vertex AI.
  • Strong understanding of GCP Compute FinOps & Resource Optimization.
  • Knowledge of event-driven architectures.

Nice To Haves

  • Proven ability to engineer deployment pipelines that automate compliance and security checks.
  • Experience in securing LLM solutions and ensuring zero-downtime scaling.

Responsibilities

  • Design and implement GCP deployment architectures using infrastructure-as-code principles.
  • Establish and optimize CI/CD pipelines and release cycles for AI applications.
  • Ensure high reliability, scalability, and security of AI infrastructure.
  • Translate compliance requirements into automated deployment rules, generating security and model risk evidence.
  • Proactively identify and mitigate vulnerabilities, including red-teaming and securing LLM solutions against prompt injection.
  • Lead zero-downtime launches and ensure the infrastructure scales seamlessly to support thousands of concurrent users.
  • Optimize GCP compute FinOps and resource utilization.
  • Implement and manage event-driven architectures for AI workflows.

Benefits

  • A diverse and inclusive environment that embraces change, innovation, and collaboration
  • A hybrid working model, allowing for in-office / work from home flexibility, generous vacation, personal and volunteer days
  • Employee Resource Groups support an inclusive workplace for everyone and promote community engagement
  • Competitive compensation packages including health and wellbeing benefits, retirement savings plans, parental leave, and family building benefits
  • Educational resources, matching gift and volunteer programs
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service