Senior Infrastructure Engineer

Vast.aiLos Angeles, CA
68d$180,000 - $300,000

About The Position

Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity. We are a growing and highly motivated team dedicated to an ambitious technical plan. Our structure is flat, our ambitions are bold, and leadership is earned by shipping excellence. We seek engineers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of architecture, coding, and communication skills. As a Senior Infrastructure Engineer, you will help design and scale the core systems that power Vast.ai’s global GPU marketplace. You’ll work closely with our founders and core engineering team to extend the underlying compute infrastructure — from GPU provisioning and scheduling to billing, orchestration, and marketplace dynamics. We’re looking for someone who has previously built large-scale infrastructure platforms — systems with similarities to Vast.ai, or distributed compute orchestration frameworks.

Requirements

  • Experience building high-throughput backend systems or compute clouds
  • Familiarity with Docker, or custom scheduling frameworks
  • Understanding of GPU provisioning, driver management, and workload scheduling
  • Implemented or integrated usage-based billing and account credit systems
  • Knowledge of dynamic pricing, spot instances, or supply-demand balancing mechanisms
  • Experience designing secure, multi-tenant systems in cloud environments
  • Strong programming skills in Python and C++; ability to write performant, maintainable, well-architected code
  • Comfortable designing schemas and queries for large-scale data systems (PostgreSQL preferred)

Nice To Haves

  • Experience with GPU security, virtualization, or zero-trust compute isolation
  • Prior startup experience or end-to-end product ownership

Responsibilities

  • Improve the backend systems that power Vast.ai’s compute marketplace
  • Integrate GPU provider onboarding, usage tracking, billing, and orchestration APIs
  • Develop scalable infrastructure for workload scheduling and resource management
  • Optimize pricing and marketplace logic for efficiency and transparency
  • Benchmark, profile, and harden systems for performance, reliability, and fault tolerance
  • Collaborate with product and infrastructure teams to shape the future of decentralized compute

Benefits

  • Comprehensive health, dental, vision, and life insurance
  • 401(k) with company match
  • Meaningful early-stage equity
  • Onsite meals, snacks, and close collaboration with founders and tech leads
  • Ambitious, fast-paced startup culture where initiative is rewarded
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service