Data Center Manager (Dallas, TX)

LambdaDallas, TX
Onsite

About The Position

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU. If you'd like to build the world's best AI cloud, join us. Note: This position requires presence in our Dallas, TX Data Center 5 days per week.

Requirements

  • Have 5+ years experience with critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management
  • Have basic understanding of Linux administration
  • Have experience in setting up networking appliances (Ethernet and InfiniBand) across multiple data center locations
  • Are someone who pays attention to detail and has the ability to follow instructions
  • Are action-oriented and have a strong willingness to learn
  • Have a desire to mentor other team members and help them reach their full potential

Nice To Haves

  • Experience with troubleshooting and theoretical knowledge the following network layers, technologies, and system protocols: TCP/IP, OSPF, SNMP, SSL, HTTP, FTP, SSH, Syslog, DHCP, DNS, RDP, NETBIOS, IP routing, Ethernet, switched Ethernet, 802.11x, NFS, and VLANs
  • Experience with working in large-scale distributed data center environments
  • Experience working with auditors to meet all compliance requirements (ISO/SOC)
  • Experience Supermicro & Nvidia hardware
  • Previous data center team management experience

Responsibilities

  • Manage and lead a team of data center technicians
  • Maintain high availability, reliability, and security in the data center environment
  • Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured
  • Troubleshoot hardware and software issues in some of the world’s most advanced systems
  • Document data center layout and network topology in DCIM software
  • Work with supply chain & manufacturing teams to ensure timely deployment of systems and project plans for large-scale deployments
  • Assess current and future state data center requirements based on growth plans and technology trends
  • Manage a parts depot inventory and track equipment through the delivery-store-stage-deploy-handoff process in each of our data centers
  • Create installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers
  • Oversee deployments and day-to-day operations of the data center
  • Maintain uptime for assets and infrastructure, and ensure customer SLAs are met
  • Participate in technical discussions and provide expertise on data center integration and deployment strategies
  • Understand power/cooling requirements as well as cabling needs required within data center space to support high performance infrastructures.
  • Work closely with cross-functional teams, including Hardware Engineering, Software Engineering, Supply Chain, Customer Experience and Sales, to align data center solutions with business goals
  • Ensure the data center complies with Lambda’s standards and policies

Benefits

  • We offer generous cash & equity compensation
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible paid time off plan that we all actually use
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service