AI Engineer III - Blue Ring

BLUE ORIGIN•Reston, VA

1d•Onsite

About The Position

At Blue Origin, we envision millions of people living and working in space for the benefit of Earth. We’re working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. Join our team of problem solvers as we add new chapters to the history of spaceflight! The role is part of the In-Space Systems business unit, which is focused on addressing two of the most compelling challenges in spaceflight today: space infrastructure and increasing mobility on-orbit. Blue Ring is Blue Origin's multi-mission space mobility platform, and this team is building the next generation of intelligent ground systems that will redefine how satellite fleets are operated—moving beyond manual monitoring and reactive processes. We are seeking an AI Engineer III to join the Blue Ring software team. You will help develop and deliver the AI and machine learning capabilities embedded in Blue Ring's ground system, spanning multi-agent monitoring, predictive anomaly detection, conversational knowledge retrieval, and AI-assisted decision-making that enable a small operations team to manage an entire satellite fleet. This role requires a hands-on engineer with deep expertise across the full AI/ML development lifecycle, from building multi-agent systems and retrieval-augmented generation pipelines to training and deploying time-series ML models on real spacecraft telemetry.

Requirements

Able to work onsite in one of our Kent, WA, Renton, WA, or Reston, VA offices.
Bachelor's or Master's degree in Computer Science, Machine Learning, Electrical Engineering, or a related field
5+ years of professional software development experience with a focus on AI/ML applications in production environments
Proficiency in Python and at least one AI/ML framework (PyTorch, TensorFlow)
Hands-on experience building and deploying multi-agent systems, agentic AI workflows, or LLM-based applications in production
Experience designing and operating RAG pipelines, including vector databases, embedding models, and retrieval strategies
Experience with time-series ML models (RNNs, LSTMs, Transformers) for anomaly detection or forecasting on sensor/telemetry data
Experience with cloud platforms (AWS preferred), containerization (Docker, Kubernetes), and CI/CD pipeline implementation
Knowledge of professional software engineering practices including code reviews, source control, automated testing, and operational excellence
Strong analytical and problem-solving skills with attention to detail
Excellent written and verbal communication skills for documentation and cross-team collaboration

Nice To Haves

Experience with AWS Bedrock, SageMaker, or equivalent managed AI/ML services
Experience with Databricks or similar platforms for large-scale data engineering and ML model training
Familiarity with spacecraft operations, satellite telemetry, or aerospace systems
Experience with reinforcement learning, including multi-agent RL, deep Q-networks, or policy gradient methods
Experience with graph neural networks for network optimization or topology-aware problems
Background in model compression, quantization, or edge deployment for resource-constrained environments
Experience with real-time data streaming systems (Kafka, Kinesis) and high-frequency data ingestion
Understanding formal verification, safety-critical systems, or graduated autonomy frameworks
Experience building observability and audit systems for AI decision pipelines in regulated or mission-critical domains
Research publications in machine learning, multi-agent systems, or applied AI, and/or open-source contributions

Responsibilities

Multi-Agent System Development: Implement and optimize multi-agent systems that coordinate across telemetry analysis, anomaly detection, flight dynamics, and mission planning domains
Build and maintain agent orchestration frameworks that integrate with LLM services and ground system services for reliable agent execution with observability and memory
Develop and deploy RAG pipelines over flight manuals, operations procedures, design documents, and historical log files
Manage vector database infrastructure, embedding pipelines, and retrieval strategies to deliver accurate answers to operator queries
Implement human-in-the-loop decision workflows where agents present investigation plans and remediation options for operator approval
Develop confidence scoring and validation systems for AI recommendations
Deploy containerized AI services on AWS with infrastructure-as-code, CI/CD pipelines, observability, and automated testing
Support on-call rotations and drive root-cause analysis for production issues
Model Development (Post-training and Time-series Analysis)
Develop anomaly detection models that identify telemetry deviations beyond threshold-based monitoring
Train and deploy sequence-based models for failure prediction on telemetry data
Build automated model retraining pipelines that incorporate operator feedback and new flight data
Explore compressed models for resource-constrained deployment scenarios

Benefits

Medical, dental, vision, basic and supplemental life insurance, paid parental leave, short and long-term disability, 401(k) with a company match of up to 5%, and an Education Support Program.
Paid Time Off: Up to four (4) weeks per year based on weekly scheduled hours, and up to 14 company-paid holidays.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume