The role involves designing and leading the development of a global automation platform for data storage infrastructure, focusing on orchestration, reliability, and scalability of complex hardware and software systems to support innovative product development.
Key Responsibilities
Design and oversee the development of a global automation platform and unified orchestration layer managing infrastructure across physical and virtual environments
Ensure infrastructure reliability by eliminating systemic bottlenecks and maintaining 99.9% availability of provisioning workflows
Define the technical roadmap for hardware lifecycle management and integration with next-generation hardware and operating systems
Partner with SRE and Product teams to establish standards for environment requests and streamline CI/CD pipelines
Mentor engineering teams on distributed systems design, code quality, and complex root-cause analysis
Requirements
Proven ability to architect and scale infrastructure management tools, with a deep understanding of the trade-offs between consistency, availability, and partition tolerance in distributed systems.
Advanced knowledge of the Linux kernel, boot processes including PXE and iPXE, and virtualization/containerization technologies such as KVM, QEMU, and Docker.
Expert-level proficiency in Python or Go, with the ability to design robust APIs and scalable microservices that interact directly with physical and virtual hardware.
A track record of solving complex technical challenges that span the entire stack, from low-level hardware and networking to high-level software orchestration.
Ability to define and oversee the development of a unified orchestration layer that manages state, scheduling, and resource allocation across physical and virtual environments.
Experience in designing infrastructure that ensures 99.9% availability of provisioning workflows for global engineering teams.
Experience in defining long-term strategies for hardware lifecycle management and ensuring automation stacks seamlessly integrate with next-generation hardware and evolving operating systems.
Experience partnering with SRE and Product teams to establish company-wide standards for environment requests, reducing time-to-test and streamlining CI/CD pipelines.
Ability to cultivate a high-performance engineering culture by setting code quality standards and coaching engineers on distributed systems design and root-cause analysis.
Location requirement: Ability to work primarily in-office at the OFFICE_LOCATION in compliance with company policies, unless on PTO, work travel, or other approved leave.
Benefits & Perks
Flexible time off
Wellness resources
Company-sponsored team events
Ready to Apply?
Join Pure Storage and make an impact in renewable energy