Firmware Triage Lead

Pure Storage Inc.Santa Clara, CA
69dOnsite

About The Position

We're in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry. Here, you lead with innovative thinking, grow along with us, and join the smartest team in the industry. This type of work-work that changes the world-is what the tech industry was founded on. So, if you're ready to seize the endless opportunities and leave your mark, come join us. THE ROLE You will be the central owner and analytical leader for the firmware triage process, ensuring the swift and accurate resolution of complex, deep-level storage issues across our industry-leading product lines. This is a critical technical leadership position where your primary mission is to drive product quality improvements by leading root-cause identification, coordinating cross-functional engineering efforts, and delivering data-driven insights to executive stakeholders. You will bridge advanced technical debugging with operational excellence to safeguard our customer experience.

Requirements

  • Domain Expertise in Storage Firmware: Strong, in-depth technical knowledge of SSD technology, NAND flash media characteristics, and the Flash Translation Layer (FTL) within embedded storage systems.
  • Advanced Debugging and Root Cause Analysis: Highly proficient ability to conduct deep technical failure analysis using standard embedded debugging tools, event logs, and crash dumps to isolate complex firmware defects.
  • Technical Process Leadership: Proven track record of leading, managing, and optimizing a complex technical triage or debug process across multiple geographically distributed engineering teams.
  • Data and Analytics Fluency: Expert capability in using analytics platforms (e.g., Jira filters/dashboards) to process large datasets, extract meaningful quality metrics, and drive data-informed decisions and communications.
  • Scripting for Automation: Solid experience utilizing Python or similar scripting languages to develop tools for issue reproduction, test automation, and efficient data analysis.

Responsibilities

  • Own the End-to-End Triage Workflow: Define, manage, and optimize the daily triage process, acting as the primary point of control to prioritize, assign, and track all incoming firmware defects, specifically within the Direct Flash Module (DFM) and SSD layers.
  • Drive Failure Analysis and RCA: Lead the technical investigation into critical firmware failures, utilizing your expertise in embedded systems, NAND flash characteristics, and debugging tools to identify the fundamental root cause and ensure fixes are targeted and effective.
  • Synthesize and Present Product Quality Insights: Design, maintain, and leverage Jira dashboards and analytics platforms to transform raw defect data into actionable reports, clearly communicating current product health, key quality trends, and risk exposure to engineering and product leadership.
  • Streamline Engineering Feedback Loops: Apply expertise in CI/CD systems (like Jenkins) and scripting (Python) to automate issue reproduction, failure injection, and streamline the feedback process between triage, development, and validation to accelerate time-to-resolution.
  • Influence Product Improvement: Continuously collaborate with Engineering Managers to refine triage priorities based on customer impact and business needs, and propose systemic process and technical improvements to prevent recurrence of critical failures.

Benefits

  • flexible time off
  • wellness resources
  • company-sponsored team events

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service