This role involves leading the firmware triage process for storage products, conducting root cause analysis of complex firmware failures, and driving product quality improvements through data analysis, process optimization, and cross-functional collaboration.
Key Responsibilities
Own and optimize the firmware triage workflow for storage products
Lead failure analysis and root cause investigation of firmware issues
Develop and present product quality insights using analytics tools
Automate issue reproduction and failure injection using scripting and CI/CD systems
Collaborate with engineering teams to refine triage priorities and improve processes
Requirements
Strong, in-depth technical knowledge of SSD technology, NAND flash media characteristics, and the Flash Translation Layer (FTL) within embedded storage systems.
Highly proficient ability to conduct deep technical failure analysis using standard embedded debugging tools, event logs, and crash dumps to isolate complex firmware defects.
Proven track record of leading, managing, and optimizing a complex technical triage or debug process across multiple geographically distributed engineering teams.
Expert capability in using analytics platforms such as Jira filters and dashboards to process large datasets, extract meaningful quality metrics, and drive data-informed decisions and communications.
Solid experience utilizing Python or similar scripting languages to develop tools for issue reproduction, test automation, and efficient data analysis.
Experience defining, managing, and optimizing the end-to-end firmware triage workflow, including prioritizing, assigning, and tracking all incoming firmware defects within the Direct Flash Module (DFM) and SSD layers.
Ability to lead failure analysis and root cause analysis (RCA) for critical firmware failures, utilizing expertise in embedded systems, NAND flash characteristics, and debugging tools.
Experience designing, maintaining, and leveraging dashboards and analytics platforms to communicate product health, quality trends, and risk exposure to engineering and product leadership.
Proven technical process leadership in leading, managing, and optimizing a complex technical triage or debugging process across multiple engineering teams.
Educational background or equivalent experience demonstrating expertise in storage firmware, embedded systems, or related fields.
Ability to automate issue reproduction, failure injection, and streamline feedback processes between triage, development, and validation teams using scripting and CI/CD systems like Jenkins.
Benefits & Perks
Salary range: 211,000 - 317,000 USD
Potential for incentive pay and equity
Flexible time off
Wellness resources
Company-sponsored team events
Ready to Apply?
Join Pure Storage and make an impact in renewable energy