Production Systems Engineer

MetaMenlo Park, CA
3d$118,000 - $170,000

About The Position

Meta is seeking a forward thinking, experienced candidate to join the Hardware Design, Release to Production (HDRTP) team as a Foundation Labs Engineer. Our mission is to imagine, build and deploy industry-leading hardware systems to fuel Meta's products and services for billions of people around the world. Within our Labs, you'll have the opportunity to help deliver AI technology that is fueling Meta's products and services for billions of people around the world. Our team works in a fast-paced environment where change is constant, every day brings new challenges and collaboration is key.

Requirements

  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Experience communicating across multiple groups and types of work
  • Experience influencing cross functional teams
  • Experience managing technical projects with a high degree of ambiguity
  • 2+ years experience in one or more of the following core areas: Capacity Planning, Networking, Project Management, Tooling and Automation, Hardware Design, Systems Administration, Hardware Validation (NPI), or Data Center Operations
  • 2+ years of experience managing servers in a large-scale distributed environment

Nice To Haves

  • Hardware Debug Experience

Responsibilities

  • Drive labs participation in program design, test, phase exit, and retrospective efforts
  • Complex, open-ended troubleshooting and diagnostics for new hardware platforms
  • Troubleshoot, repair, document, and provide feedback for Linux-based data center hardware platforms
  • Work closely with remote hardware design and validation teams, and vendors to deploy and manage new server, storage, and networking products in the data center infrastructure
  • Test and troubleshoot new hardware products and components with minimal documentation and direction
  • Manage full lifecycle for lab hardware assets from provisioning through decommissioning
  • Identify, characterize, and root cause hardware failures and error conditions
  • Collaborate with hardware teams by running small scale experiments, collecting data, and providing feedback on failure symptoms for lab and production servers
  • Drive cross-functional coordination & communication with other data center operations teams
  • Lead efforts to deliver operational and serviceability feedback on new hardware platforms
  • Serve as a local point of contact and subject matter expert regarding Lab activities and new hardware to operations staff
  • Maintain an efficient, orderly hardware test lab operation within the production data center

Benefits

  • bonus
  • equity
  • benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service