About The Position

AWS Infrastructure Services is responsible for the design, planning, delivery, and operation of all AWS global infrastructure, ensuring customers have continuous access to the innovation they rely on. This role is within a diverse team focused on challenging problems in the supply chain and data center operations. The Software Development Manager for the Eva Data Center Assistant team will lead a team of Software Development Engineers in building AWS data center's agentic GenAI platform. This platform powers AI-assisted operations across global data center infrastructure. The manager will own the technical vision and strategic roadmap for the Eva platform, driving investments in agentic AI systems, full-stack serverless engineering, and search/knowledge systems. This leadership will shape a next-generation AI/ML platform for a large, globally distributed user base, focusing on orchestrating physical work processes, automating decision-making, and enhancing operational efficiency. The role involves championing platform thinking to build reusable primitives, APIs, and extensible components for use by numerous teams within the Data Center Community. The manager will drive the design and delivery of production-grade agentic systems, including LLM orchestration, tool-calling patterns, agent frameworks, and intelligent workflow automation. Collaboration with cross-functional stakeholders such as data center operations, controls engineering, product management, and peer engineering teams is crucial for translating operational needs into scalable AI-powered solutions. The role also includes establishing and improving engineering practices like code reviews, CI/CD, progressive deployment, observability, and operational readiness for ML/AI systems. Additionally, the manager will own hiring strategy and talent development, building a high-performing team with expertise in generative AI, distributed systems, and full-stack development, and communicating platform strategy and impact to senior leadership.

Requirements

  • 3+ years of engineering team management experience
  • 7+ years of working directly within engineering teams experience
  • Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
  • Experience partnering with product or program management teams
  • 3+ years of developing large-scale, multi-tiered distributed software systems using distributed programming experience

Nice To Haves

  • Experience delivering products against plan in a fast-paced, multi-disciplined, distributed-responsibility and often ambiguous environment
  • Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers
  • Knowledge of ML, NLP, Information Retrieval and Analytics
  • Experience working with fast-moving, high-performance teams and driving innovative solutions tailored to unique business environments
  • Experience leading teams building AI/ML or generative AI systems in production, including LLM-based applications, agentic architectures, or RAG systems

Responsibilities

  • Lead and mentor a team of SDEs building and operating the Eva agentic GenAI platform, fostering a culture of ownership, innovation, and operational excellence
  • Own the end-to-end technical roadmap for Eva, balancing investments across agentic AI capabilities, platform infrastructure, frontend experiences, search/knowledge systems workstreams
  • Drive the architecture and delivery of agentic AI systems including LLM orchestration, prompt engineering, skills, harness, tool-calling patterns, semantic search, and agent frameworks (e.g., Strands)
  • Lead the development of full-stack serverless solutions leveraging AWS Lambda, API Gateway, DynamoDB, EventBridge, CDK, and related services to deliver scalable, production-grade platform capabilities
  • Own the design of search and knowledge systems including vector embeddings, hybrid retrieval, document processing pipelines, and semantic chunking to power Eva's intelligent responses
  • Build and evolve platform primitives and reusable components that enable dozens of teams across the Data Center Community to build AI-powered capabilities on top of Eva
  • Partner with data center operations, controls engineering, product management, and peer engineering teams to identify high-impact use cases and translate them into platform features
  • Establish and enforce engineering excellence including CI/CD pipeline design, progressive deployment, synthetic monitoring, observability (CloudWatch, X-Ray, OpenTelemetry), and operational readiness reviews
  • Own hiring, performance management, and career development for the team, building a diverse pipeline of engineers with expertise in GenAI, distributed systems, and full-stack development
  • Communicate platform strategy, project status, and business impact to senior leadership, driving alignment on priorities and resource allocation

Benefits

  • Sign-on payments
  • Restricted stock units (RSUs)
  • Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • Paid time off
  • Parental leave
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service