This role involves leading the firmware triage process for storage products, conducting root cause analysis of complex firmware failures, and providing data-driven insights to improve product quality and reliability in a cutting-edge storage technology company.
Key Responsibilities
Own and manage the firmware triage workflow for storage products.
Lead technical investigations and root cause analysis of firmware failures.
Develop and maintain dashboards and analytics to monitor product quality and trends.
Automate issue reproduction, failure injection, and feedback processes using scripting and CI/CD tools.
Collaborate with engineering teams to prioritize triage efforts and implement process improvements.
Requirements
In-depth technical knowledge of SSD technology, NAND flash media characteristics, and the Flash Translation Layer (FTL) within embedded storage systems.
Highly proficient ability to conduct deep technical failure analysis using standard embedded debugging tools, event logs, and crash dumps to isolate complex firmware defects.
Proven track record of leading, managing, and optimizing a complex technical triage or debug process across multiple geographically distributed engineering teams.
Expert capability in using analytics platforms such as Jira filters and dashboards to process large datasets, extract meaningful quality metrics, and drive data-informed decisions and communications.
Solid experience utilizing Python or similar scripting languages to develop tools for issue reproduction, test automation, and efficient data analysis.
Experience owning and managing the End-to-End Triage Workflow, including defining, managing, and optimizing the daily triage process, prioritizing, assigning, and tracking all incoming firmware defects within the Direct Flash Module (DFM) and SSD layers.
Ability to lead Failure Analysis and Root Cause Analysis (RCA) by investigating critical firmware failures and identifying fundamental root causes to ensure targeted and effective fixes.
Experience designing, maintaining, and leveraging Jira dashboards and analytics platforms to transform raw defect data into actionable reports, communicating current product health, key quality trends, and risk exposure to engineering and product leadership.
Ability to automate issue reproduction, failure injection, and streamline feedback processes between triage, development, and validation using CI/CD systems like Jenkins and scripting in Python.
Strong, in-depth technical knowledge of storage firmware, including SSD technology, NAND flash media characteristics, and the Flash Translation Layer (FTL).
Demonstrated ability to lead, manage, and optimize technical processes related to firmware triage or debugging across multiple engineering teams.
Excellent data and analytics fluency, with the ability to process large datasets and extract meaningful quality metrics for decision-making.
Educational background or equivalent experience in a relevant technical field such as Electrical Engineering, Computer Engineering, or Computer Science.
Benefits & Perks
Salary range: 211,000 - 317,000 USD
Potential eligibility for incentive pay and/or equity
Flexible time off
Wellness resources
Company-sponsored team events
Ready to Apply?
Join Pure Storage and make an impact in renewable energy