AI Engineer III - Blue Ring

BLUE ORIGINReston, VA
1dOnsite

About The Position

At Blue Origin, we envision millions of people living and working in space for the benefit of Earth. We’re working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. Join our team of problem solvers as we add new chapters to the history of spaceflight! The role is part of the In-Space Systems business unit, which is focused on addressing two of the most compelling challenges in spaceflight today: space infrastructure and increasing mobility on-orbit. Blue Ring is Blue Origin's multi-mission space mobility platform, and this team is building the next generation of intelligent ground systems that will redefine how satellite fleets are operated—moving beyond manual monitoring and reactive processes. We are seeking an AI Engineer III to join the Blue Ring software team. You will help develop and deliver the AI and machine learning capabilities embedded in Blue Ring's ground system, spanning multi-agent monitoring, predictive anomaly detection, conversational knowledge retrieval, and AI-assisted decision-making that enable a small operations team to manage an entire satellite fleet. This role requires a hands-on engineer with deep expertise across the full AI/ML development lifecycle, from building multi-agent systems and retrieval-augmented generation pipelines to training and deploying time-series ML models on real spacecraft telemetry.

Requirements

  • Able to work onsite in one of our Kent, WA, Renton, WA, or Reston, VA offices.
  • Bachelor's or Master's degree in Computer Science, Machine Learning, Electrical Engineering, or a related field
  • 5+ years of professional software development experience with a focus on AI/ML applications in production environments
  • Proficiency in Python and at least one AI/ML framework (PyTorch, TensorFlow)
  • Hands-on experience building and deploying multi-agent systems, agentic AI workflows, or LLM-based applications in production
  • Experience designing and operating RAG pipelines, including vector databases, embedding models, and retrieval strategies
  • Experience with time-series ML models (RNNs, LSTMs, Transformers) for anomaly detection or forecasting on sensor/telemetry data
  • Experience with cloud platforms (AWS preferred), containerization (Docker, Kubernetes), and CI/CD pipeline implementation
  • Knowledge of professional software engineering practices including code reviews, source control, automated testing, and operational excellence
  • Strong analytical and problem-solving skills with attention to detail
  • Excellent written and verbal communication skills for documentation and cross-team collaboration

Nice To Haves

  • Experience with AWS Bedrock, SageMaker, or equivalent managed AI/ML services
  • Experience with Databricks or similar platforms for large-scale data engineering and ML model training
  • Familiarity with spacecraft operations, satellite telemetry, or aerospace systems
  • Experience with reinforcement learning, including multi-agent RL, deep Q-networks, or policy gradient methods
  • Experience with graph neural networks for network optimization or topology-aware problems
  • Background in model compression, quantization, or edge deployment for resource-constrained environments
  • Experience with real-time data streaming systems (Kafka, Kinesis) and high-frequency data ingestion
  • Understanding formal verification, safety-critical systems, or graduated autonomy frameworks
  • Experience building observability and audit systems for AI decision pipelines in regulated or mission-critical domains
  • Research publications in machine learning, multi-agent systems, or applied AI, and/or open-source contributions

Responsibilities

  • Multi-Agent System Development: Implement and optimize multi-agent systems that coordinate across telemetry analysis, anomaly detection, flight dynamics, and mission planning domains
  • Build and maintain agent orchestration frameworks that integrate with LLM services and ground system services for reliable agent execution with observability and memory
  • Develop and deploy RAG pipelines over flight manuals, operations procedures, design documents, and historical log files
  • Manage vector database infrastructure, embedding pipelines, and retrieval strategies to deliver accurate answers to operator queries
  • Implement human-in-the-loop decision workflows where agents present investigation plans and remediation options for operator approval
  • Develop confidence scoring and validation systems for AI recommendations
  • Deploy containerized AI services on AWS with infrastructure-as-code, CI/CD pipelines, observability, and automated testing
  • Support on-call rotations and drive root-cause analysis for production issues
  • Model Development (Post-training and Time-series Analysis)
  • Develop anomaly detection models that identify telemetry deviations beyond threshold-based monitoring
  • Train and deploy sequence-based models for failure prediction on telemetry data
  • Build automated model retraining pipelines that incorporate operator feedback and new flight data
  • Explore compressed models for resource-constrained deployment scenarios

Benefits

  • Medical, dental, vision, basic and supplemental life insurance, paid parental leave, short and long-term disability, 401(k) with a company match of up to 5%, and an Education Support Program.
  • Paid Time Off: Up to four (4) weeks per year based on weekly scheduled hours, and up to 14 company-paid holidays.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service