Site Reliability Engineer

MetaPhase ConsultingWashington, DC
Hybrid

About The Position

MetaPhase is seeking a hands-on Site Reliability Engineer with a minimum of three or more (3+) years of experience to support a federal learning management system. This role is for an engineer who can build, deploy, monitor, troubleshoot, and improve production software while working across application code, AWS cloud infrastructure, CI/CD, and operations. This is an AI-native engineering role. The Site Reliability Engineer will use ChallengeAI by MetaPhase to accelerate development, maintenance, troubleshooting, documentation, testing, deployment, and production support. The environment includes modern web technologies such as React, TypeScript, PostgreSQL, GitHub Actions, and AWS.

Requirements

  • 3+ years of experience in software development, site reliability engineering, DevOps, cloud engineering, platform engineering, infrastructure engineering, or production application support
  • Hands-on experience building, deploying, supporting, or maintaining modern web applications
  • Experience with React, TypeScript, JavaScript, or comparable modern front-end technologies
  • Strong working knowledge of AWS, including core cloud concepts related to compute, networking, IAM, storage, monitoring, logging, security, and deployment operations
  • Experience with Git, GitHub, CI/CD concepts, and modern software development practices
  • Experience with AI-native development practices and enthusiasm for using generative AI to improve engineering productivity and software quality
  • Ability to troubleshoot across application, database, cloud, authentication, networking, and deployment layers
  • Must be located in the Washington, DC metro area and available to work from MetaPhase headquarters in Reston, Virginia and travel to client offices in Tysons, Virginia as needed
  • U.S. Citizenship is required
  • Must be eligible for Public Trust clearance and DHS Entry on Duty (EOD)

Nice To Haves

  • 5+ years of experience in software development, SRE, DevOps, cloud engineering, or production operations in a federal environment supporting FedRAMP-aligned cloud systems
  • Experience with learning management systems, training platforms, public-facing web applications, or SaaS products
  • Strong GitHub profile, portfolio, personal project, technical blog, demo, or other proof that you build and ship real software

Responsibilities

  • Support the delivery, deployment, operations, maintenance, and continuous improvement of a federal learning management system
  • Build, maintain, troubleshoot, and improve application functionality using React, TypeScript, JavaScript, PostgreSQL, and related modern web tools
  • Use ChallengeAI by MetaPhase as part of daily development, maintenance, troubleshooting, testing, documentation, and operations workflows
  • Support AWS deployment, configuration, monitoring, security, logging, backup, recovery, and cost-aware operations
  • Support GitHub-based development workflows, including pull requests, GitHub Actions, automated checks, and release coordination
  • Maintain and improve CI/CD pipelines, test automation, deployment scripts, rollback procedures, and operational runbooks
  • Monitor application health, performance, errors, uptime, logs, and security events to identify and resolve issues before they affect users
  • Participate in on-call support and provide occasional after-hours support for production incidents, releases, and urgent operational issues
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service