Data Center Engineer

RobloxAshburn, VA
3dOnsite

About The Position

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. As a Data Center Technician II, you'll independently manage and prioritize host repair efforts for multiple datacenters, perform initial troubleshooting for ambiguous hardware and network issues, and own well-defined projects with guidance from senior engineers while helping us scale our Core/Edge Data Centers and hardware infrastructure at a time of incredible growth for our business.

Requirements

  • At minimum 3+ years of experience working in large-scale Data Center Infrastructure environments and experience planning, executing, and documenting repairs in the server and networking domains.
  • Extensive experience installing, monitoring, and maintaining server and network equipment. This includes brand new server and network provisioning.
  • In-depth knowledge of data center environments, servers, and network equipment.
  • Proven experience executing on multiple tasks simultaneously.
  • Proficiency with server out‑of‑band management tools to perform initial troubleshooting on servers, including when the operating system is not fully available.
  • Proficiency with Linux/Unix or Windows command-line tools to collect logs, run diagnostics, and perform initial troubleshooting on servers and network devices
  • You have installed various equipment that commonly resides in the data center environment and are able to lift 75 pounds occasionally.

Responsibilities

  • Manage and prioritize your ticket queue according to defined priorities, performing initial troubleshooting for server and network issues, and escalating clearly when problems fall outside standard procedures.
  • Maintain the Core Data Center and hardware infrastructure to meet the large scale and real-time requirements of our Imagination Platform™ to ensure our community has an awesome experience anywhere in the world. This includes all aspects of the server, network infrastructure, power, and environmental life cycles.
  • Collaborate across regions to track and mitigate systemic issues preventing hosts from returning to service.
  • Identify and solve recurring operational problems through root cause analysis, and propose improvements to runbooks, SOPs, and MOPs to prevent re-occurrence.
  • Contribute data, feedback, and requirements to partners building automation, ensuring that automation reflects real-world operational workflows
  • Coordinate with peers to establish and uphold best practices related to breakfix, install, decom and all other aspects of datacenter operations.
  • Influence, and improve the development platform, infrastructure, standards (Runbooks, SOPs, MOPs), and methods to ensure the goal of scalability and high availability can be achieved.
  • Leverage partnerships across teams to ensure prompt expansion and recovery of hardware capacity.
  • Actively participate in continuous improvement and ongoing learning within the engineering team
  • Assist in coordinating vendors and ensuring quality of outsourced projects
  • Participate in the on-call rotation for our critical infrastructure.
  • Travel: International and Domestic travel may be required 25%

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service