Systems Administrator

TEKsystemsSanta Clara, CA
Onsite

About The Position

The team supports the bring-up, testing, validation, release, and lifecycle management of non-production systems. They inherit newly built lab or data center environments after build-out and ensure systems remain stable, operational, and test-ready. Platforms supported include Enterprise (DGX, workstations, liquid cooled enterprise systems), Tegra (Robotics and Automotive), and Compute (build/compile servers; largely remote capable). Engineers are expected to work hands-on with hardware, debug issues across hardware, OS, firmware, networking, and automation, and own systems end-to-end in failure-prone, pre-production environments. There are four roles available, not all identical: two baseline platform engineers, one engineer with stronger automation, debugging depth, or project leadership, and one with a specialized focus potentially around liquid cooling or mechanical/system integration. The team values complementary skill sets.

Requirements

  • GPU
  • Linux
  • liquid cooling
  • System administrator
  • Python
  • Intermediate Level experience
  • On-site presence is required
  • System ownership
  • Hands-on work with hardware

Responsibilities

  • Own the bring up, validation, and lifecycle management of pre-production systems across Enterprise platforms (DGX, workstations, liquid cooled enterprise systems), Tegra platforms (Robotics, Automotive), and Compute systems used for software build and compilation.
  • Perform advanced troubleshooting across hardware (boards, GPUs, CPUs, interconnects), operating systems (Linux variants and other OS environments), firmware/flashing/board enablement, and networking & connectivity issues.
  • Debug systems that are intentionally non-production and failure-prone, ensuring issues are caught internally rather than by customers.
  • Use scripting, tooling, and automation to validate platform health, accelerate bring up and testing workflows, and reduce manual, repetitive tasks.
  • Contribute to SOPs and standardized processes in partnership with Infra/SRE and Data Center Operations teams.
  • Perform hands-on work on hardware, including installing, configuring, and validating systems.
  • Coordinate with DC Ops for heavy lifting, racking, stacking, and cabling.
  • Follow strict handling procedures for pre-production, high-value hardware.
  • Interface closely with internal partners (IPP, enterprise teams, validation, software).

Benefits

  • Medical, dental & vision
  • Critical Illness, Accident, and Hospital
  • 401(k) Retirement Plan – Pre-tax and Roth post-tax contributions available
  • Life Insurance (Voluntary Life & AD&D for the employee and dependents)
  • Short and long-term disability
  • Health Spending Account (HSA)
  • Transportation benefits
  • Employee Assistance Program
  • Time Off/Leave (PTO, Vacation or Sick Leave)

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

501-1,000 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service