AI Platform Engineer

X-EnergyRockville, MD
Hybrid

About The Position

This role is responsible for designing, implementing, and maintaining the cloud infrastructure and CI/CD systems that power X-energy's AI-native application platform (APEX). The DevOps Engineer will work with the AI and Application Development team to accelerate deployment velocity, ensure system reliability, and enable the rapid delivery of AI-infused capabilities across engineering, manufacturing, regulatory, and deployment processes. This role requires expertise in containerization, cloud infrastructure automation, and modern DevOps practices to support X-energy's mission of becoming an AI-first organization and setting the industry standard for nuclear deployment speed and operational excellence.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Engineering, or related field is required
  • 10 plus years of relevant experience
  • 3+ years of hands-on experience with Docker containerization including building, testing, and deploying production applications
  • Proven experience administering GitLab CI/CD systems, including runner setup, pipeline configuration, and troubleshooting
  • Strong proficiency with Amazon Web Services, particularly ECS, ECR, VPC, and Terraform infrastructure-as-code
  • Experience managing AWS data services such as DocumentDB, OpenSearch, Redis, Aurora Postgres, or DynamoDB
  • Proficiency with Linux and/or macOS command-line environments
  • Demonstrated experience with release engineering and deployment automation
  • Knowledge of monitoring and observability tools (Datadog preferred)
  • Understanding of AI/ML concepts and LLM architectures
  • Proficiency with Claude Code or similar AI-assisted development tools
  • Strong problem-solving skills and ability to work independently
  • Excellent communication and collaboration skills
  • Ability to work hybrid schedule in Rockville, MD office Tuesday, Wednesday, and Thursday

Responsibilities

  • Design, build, and maintain containerized applications using Docker, including image building, testing, versioning, and optimization for production deployment
  • Develop and maintain GitLab CI/CD pipelines, including runner configuration, pipeline optimization, monitoring dashboards, and automated testing workflows
  • Architect and manage AWS infrastructure using Terraform, including ECS (Elastic Container Service), ECR (Elastic Container Registry), VPC, EC2, ALB/NLB, and other cloud services
  • Administer and optimize AWS data services including DocumentDB, OpenSearch, Redis, Aurora Postgres, DynamoDB, and S3
  • Implement and maintain comprehensive monitoring, alerting, and observability solutions using Datadog and CloudWatch
  • Manage security and compliance requirements including ACM (AWS Certificate Manager) for certificate management and implementing security best practices across all infrastructure
  • Lead release engineering efforts, including versioning strategies, deployment automation, and rollback procedures
  • Collaborate with development teams to optimize application performance, troubleshoot production issues, and implement infrastructure improvements
  • Leverage Claude Code and AI tools to accelerate infrastructure development and maintenance tasks
  • Apply knowledge of LLMs and AI systems to support the platform's AI-native architecture
  • Maintain professional demeanor and behavior at all times in all forms of communication
  • Execute core tasks and responsibilities with minimal supervision in a fast-paced, team oriented environment
  • Participate in on-call rotation to ensure system reliability and rapid incident response
  • Perform other duties as assigned by manager

Benefits

  • 401K plan with an employer match
  • Medical/Dental /Vision Insurance
  • Life and Disability Insurance
  • Paid Time Off
  • Tuition Reimbursement/Professional Development policy
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service