Staff Reliability Engineer

ZT SystemsSecaucus, NJ
122d$105,000 - $154,000

About The Position

Our reliability team is responsible to evaluate, develop, design, and implement software and product reliability test regimens to ensure ZT products of the highest quality are delivered to our customers. We are looking for a passionate Reliability Engineer with exceptional knowledge/ experience developing and manufacturing scalable infrastructures. You will be working with the latest technologies that go into building a hyperscale cloud services.

Requirements

  • Minimum B.S. in Electrical Engineering, Computer Science/Engineering, or Software development.
  • 2+ years of relevant work experience.
  • Knowledge of computer systems/hardware structure, as well as switch/network interfaces.
  • Knowledge and/or experience with programming languages like Python or Unix (Bash and/or PowerShell).
  • Knowledge of statistical & probability techniques and reliability modeling.
  • Ability to communicate, collaborate and lead cross-functionally to resolve issues.

Nice To Haves

  • Fundamental knowledge of Computer Architecture, Server architecture at the block level, and Hardware/Firmware/OS interactions.
  • Working knowledge of PCBA (printed circuit board assembly) design, fabrication, and validation testing.
  • Experience using tools such as ReliaSoft & JMP statistical software packages.
  • Working knowledge of electronic components/devices and their failure modes & failure mechanism.
  • Knowledge of industry standards, IPC, JEDEC, Telcordia, and MIL-STD.

Responsibilities

  • Use Design for Reliability principles to ensure cloud hardware meets specified use-conditions and stresses.
  • Act as the internal consultant on all reliability matters and interface with program management, vendors, and design engineering.
  • Support the Software/script development needs of the reliability team.
  • Create or revise reliability engineering guidelines to improve product field performance.
  • Use principles of performance evaluation and prediction to improve reliability and maintainability of Cloud Infrastructure servers.
  • Identify, collect, analyze, and manage various types of data to minimize failures and improve product performance.
  • Develop scripts that represent the expected environment and operational conditions.
  • Collaborate with other development functional teams and internal stakeholders regarding Design for Reliability principles.

Benefits

  • Competitive base salary.
  • Performance-based annual bonus eligibility.
  • 401(k) retirement savings plan with generous company match.
  • Tuition reimbursement for eligible education programs.
  • Comprehensive medical, dental, and vision coverage.
  • Mental health resources and employee wellness support programs.
  • Company-paid life and disability insurance.
  • Generous paid time off (PTO) and company-paid holidays.
  • Parental leave and family care support programs.
  • Structured training programs and on-the-job learning opportunities.
  • Matching gifts and volunteer programs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service