AI Infrastructure Engineer

Meshy LLC
9hHybrid

About The Position

This role sits at the intersection of platform engineering, site reliability, and applied ML systems. The function owns the reliability, scalability, and operability of Meshy's AI model serving stack, along with core engineering infrastructure. The team operates a conventional production infrastructure (CI/CD, build systems, deployment, runtime environments) and develops a model-serving platform that connects the models developed by our Research Team to product-facing backend systems. The position is systems-heavy, production-oriented, and focused on turning experimental model artifacts into robust, observable, and cost-efficient services.

Requirements

  • Linux fundamentals
  • Networking fundamentals
  • Experience with Kubernetes
  • Experience with incident response
  • Experience with observability tools
  • Strong software engineering ability in at least one of: Go / Python
  • Ability to reason about performance tradeoffs and measure before optimizing

Responsibilities

  • Own production reliability: availability, latency, error budgets, incident response, postmortems, and follow-ups
  • Build/maintain observability: metrics, logs, traces, alerting, SLOs/SLIs, dashboards
  • Improve deployment safety: CI/CD, rollout strategies (canary/blue-green), automated rollback, runbooks
  • Capacity planning + cost control: GPU/CPU sizing, autoscaling, queue/backpressure management, cost attribution
  • Security + compliance: secrets management, least privilege, patching, vulnerability response
  • Disaster recovery + operational readiness: backups, failover plans, game days
  • Develop and maintain the GPU inference serving stack (APIs, schedulers, workers, batching, caching)

Benefits

  • Competitive salary, equity, and benefits package.
  • Opportunity to work with a talented and passionate team at the forefront of AI and 3D technology.
  • Flexible work environment, with options for remote and on-site work.
  • Opportunities for fast professional growth and development.
  • An inclusive culture that values creativity, innovation, and collaboration.
  • Unlimited, flexible time off.
  • Stock options available for core team members.
  • 401(k) plan for employees.
  • Comprehensive health, dental, and vision insurance.
  • The latest and best office equipment.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service