Full Stack Software Engineer (AI Infrastructure), Level 3

IntezraColumbia, MD
9h$193,000 - $235,000

About The Position

We are seeking a senior Full Stack Software Engineer (SWE3) to lead the design, development, and operation of enterprise-scale AI infrastructure platforms . In this role, you will own critical components of the AI platform, with a focus on scalable inference services and the broader ecosystem of AI-enabled applications. This position combines hands-on technical leadership with people leadership responsibilities . You will guide a small, integrated engineering team within a larger AI platform organization while remaining deeply involved in architecture, implementation, and operational excellence. This role is ideal for engineers who enjoy balancing technical depth with leadership, influence, and cross-team coordination.

Requirements

  • Extensive experience designing, building, and operating large-scale production systems
  • Deep expertise in systems integration across diverse technologies and platforms.
  • Hands-on experience with cloud engineering , preferably AWS
  • Advanced proficiency with Kubernetes administration and deployment patterns.
  • Strong Python programming skills.
  • Experience implementing and scaling observability solutions , including APM, OpenTelemetry, Grafana, and Prometheus.
  • Proven ability to lead technical initiatives and influence organizational change.
  • Experience developing and enforcing technical policies and governance frameworks
  • Excellent communication, stakeholder management, and leadership skills.
  • Ability to balance hands-on engineering with leadership, coordination, and strategic responsibilities.

Nice To Haves

  • Experience with AI inference serving technologies such as vLLM , LiteLLM , or similar platforms.
  • Previous experience with agentic frameworks such as LangChain
  • Knowledge of vector databases and embedding systems.
  • Experience with distributed systems or high-performance computing environments.
  • Demonstrated track record of driving technical and cultural change in engineering organizations.

Responsibilities

  • Design, implement, and optimize infrastructure supporting AI model inference at scale
  • Lead the development and maintenance of production AI services and applications, including retrieval-augmented generation (RAG) , autonomous agents, and emerging AI technologies.
  • Serve as technical lead for AI infrastructure initiatives, coordinating work across integrated engineering teams.
  • Provide people leadership through regular one-on-ones, coaching, feedback, and professional development support.
  • Act as the primary point of contact for operational and administrative coordination related to the team.
  • Navigate ambiguous and complex problem spaces, defining scalable and maintainable solutions.
  • Establish and evolve technical standards, policies, and governance frameworks where gaps exist.
  • Drive adoption of new tools, technologies, and best practices across engineering teams.
  • Implement and oversee monitoring, logging, and observability solutions for AI services.
  • Ensure high availability, reliability, performance, and security of AI platform components.
  • Communicate technical strategy and status effectively to stakeholders at multiple organizational levels.

Benefits

  • Three CareFirst medical plans available; Intezra pays up to 100% of healthcare premiums and up to 100% of deductibles (based on plan selection) for employees and dependents
  • Intezra pays 100% for CareFirst Dental and Vision plans for employees and dependents
  • 401(k): 15% company contribution (no match required)
  • PTO: 160 hours, increasing with seniority
  • 12 Floating Holidays
  • 4 Code Red Days
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service