Sr. Hardware Debug Engineer/Rack Solution (27402)

Super Micro Computer, Inc.San Jose, CA
53d

About The Position

Supermicro Computer is looking for experienced Sr. Hardware Debug Engineer in the Rack solutions team to help introduce and promote Rack solution products to potential customers. The individual must have deep understanding of rack solutions and the market trend and further promote SMC rack solutions precisely to fit customers' need. The ideal candidate would need to have the ability to identify potential customers, promote in-house solutions and build solid relationship with customer to ensure customer satisfaction. The candidate will complete system troubleshooting and debugging, to improve the system quality and production yield rate.

Requirements

  • Bachelor or Master degree in Electrical Engineering, Computer Engineering or equivalent
  • 8+ year of experience in server HW debugging
  • Hardware Expertise: In-depth knowledge of server hardware components, architectures (like x86, ARM, Nvidia and AMD GPUs), and technologies
  • Troubleshooting & Problem Solving: Possessing strong analytical and problem-solving skills to diagnose and resolve hardware-related issues efficiently
  • Diagnostic Tools Proficiency, familiar with Nvidia DCGM and Field Diagnostics is a plus
  • Scripting Skills: Proficiency in scripting languages like Python and Bash for automating tasks and streamlining workflows
  • Circuit Design & Analysis: Understanding of analog and digital circuit design principles
  • Networking & Security Concepts: Familiarity with network infrastructure, protocols, and security best practices for server environments
  • Communication & Collaboration: Effectively communicating technical information with diverse teams and individuals
  • Attention to Detail: Ensuring precision and accuracy in troubleshooting and repair processes
  • Adaptability & Continuous Learning: Staying updated with emerging technologies and adapting to new challenges in the field

Responsibilities

  • Diagnosis and troubleshooting: Running automated diagnostic tests, interpreting results, analyzing logs, and systematically pinpointing hardware issues within servers and rack systems
  • Hardware repair and replacement: Implementing fixes or escalating complex problems to specialized engineering teams
  • System optimization: Working to ensure optimal thermal performance, signal integrity, and overall efficiency of server hardware
  • Collaboration: Working closely with various teams, including production, test, software engineering, and others, to ensure seamless integration and problem resolution
  • Documentation: Creating and maintaining detailed documentation of hardware designs, troubleshooting procedures, and test results
  • Staying Current: Continuously learning about new technologies and industry trends to stay ahead of the curve in a rapidly evolving field

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service