Kafka DevOps Engineer

CbSan Francisco, CA
Onsite

About The Position

We are seeking a skilled Kafka DevOps Engineer to build, manage, and support Kafka and NoSQL platforms in production environments. This role involves designing and implementing scalable platform architectures, developing automation tools, integrating AI/GenAI capabilities, and ensuring the reliability, availability, and security of our systems. The ideal candidate is a self-driven engineer who can take ownership of complex platform challenges, build innovative solutions, communicate effectively, and operate with minimal supervision in a fast-paced production environment.

Requirements

  • 8+ years of overall IT industry experience.
  • 5+ years of hands-on experience with Kafka or NoSQL technologies.
  • Strong programming skills in Python and/or Java, with a focus on automation and tooling.
  • Experience with CI/CD pipelines and Infrastructure as Code (IaC) tools such as Git, CloudFormation, and Terraform.
  • Experience with at least one cloud platform: AWS, Azure, or Kubernetes-based environments.
  • Experience building AI-powered solutions, MCP Servers, Agentic AI systems, or GenAI-based automation tools.
  • Strong Linux/Unix administration and troubleshooting experience.
  • Excellent analytical, debugging, problem-solving, verbal, and written communication skills.
  • Amazon Web Services (AWS) No 2-5 Years Is Required
  • Amazon Web Services S3 (AWS S3) No 2-5 Years Is Required
  • Amazon Web Services EKS (AWS EKS) No 2-5 Years Is Required
  • Apache Kafka No 2-5 Years Is Required
  • Artificial Intelligence No At least 1 year Is Required
  • AWS-EC2 No 2-5 Years Is Required
  • GitHub No 2-5 Years Is Required
  • Kubernetes No 2-5 Years Is Required
  • NoSQL No 5-10 Years Is Required
  • python No 5-10 Years Is Required

Nice To Haves

  • Experience with DevOps and Site Reliability Engineering (SRE) practices.
  • Strong production support, incident management, issue triaging, and root cause analysis experience.
  • Experience with Docker and Kubernetes administration, deployment, and performance tuning.
  • Knowledge of security best practices, vulnerability management, CVE analysis, and monitoring cloud/system/device logs.
  • Experience designing self-service platforms and operational automation solutions.

Responsibilities

  • Build, manage, and support Kafka and NoSQL platforms in production environments.
  • Design, implement, and maintain scalable platform architectures and deployment solutions.
  • Develop and maintain automation tools for infrastructure provisioning, monitoring, and operational workflows.
  • Integrate AI/GenAI capabilities into operational tools and platform management processes.
  • Design and implement CI/CD pipelines and Infrastructure as Code solutions.
  • Execute and manage code deployments across development, testing, staging, and production environments.
  • Troubleshoot and resolve platform, infrastructure, and application issues across all environments.
  • Monitor system performance, reliability, availability, and security, and drive continuous improvements.
  • Collaborate with development, operations, and architecture teams to improve platform efficiency and developer productivity.
  • Drive operational excellence through automation, observability, reliability engineering, and proactive issue resolution.

Benefits

  • Company parties
  • Health insurance
  • Opportunity for advancement
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service