The role involves designing and building automation tools, developer platforms, and high-availability systems to improve software delivery, reliability, and scalability for data storage products, working closely with engineering teams to streamline development processes and ensure system robustness.
Key Responsibilities
Design and build developer platforms and automation services to improve software delivery reliability, scalability, and velocity
Architect and implement high-availability solutions, disaster recovery, failover, and scaling strategies
Develop automation tooling and frameworks in Python, Go, or Rust to enhance developer productivity and system reliability
Extend and optimize CI frameworks for complex, distributed build and test environments
Apply Linux/Unix fundamentals to design resilient systems, debug issues, and optimize performance
Embed security into CI/CD systems through access control, patch automation, and system hardening
Drive capacity planning and system optimization to ensure efficient infrastructure utilization
Build observability and debugging tools for metrics, logs, and traces to facilitate troubleshooting
Standardize monitoring, logging, and incident management practices across engineering teams
Collaborate with developers to integrate CI/CD systems into the full development lifecycle
Requirements
Eight (8) years of software engineering experience, with a strong background in designing scalable architectures and building reliable systems end-to-end.
Deep expertise with CI/CD platforms and modern automation practices for large-scale systems.
Proficiency in one or more modern programming languages such as Python, Rust, with a proven track record of building automation, developer tooling, or scalable services.
Strong Linux/Unix fundamentals, including advanced troubleshooting, debugging, and performance tuning.
Hands-on experience with containerization and orchestration technologies such as Kubernetes (preferred), Docker Swarm, Nomad, or similar.
Proven experience in observability, including designing and integrating monitoring, logging, tracing, and alerting into CI/CD pipelines and production systems.
Ability to design and implement high-availability solutions, disaster recovery, failover, and scaling strategies.
Experience embedding security into CI/CD systems by design, including access control, patch automation, and system hardening.
Experience in capacity planning and system optimization to ensure efficient utilization of infrastructure and resources.
Ability to perform risk analysis and mitigation through proactive vulnerability assessments and automation of remediation processes.
Experience building observability and debugging tools to surface metrics, logs, and traces for troubleshooting and root cause analysis.
Ability to standardize and advance monitoring, logging, and incident management practices across engineering teams.
Experience collaborating closely with developers to integrate CI/CD systems into the full development lifecycle, ensuring fast, reliable, and secure code delivery.
Benefits & Perks
Salary range: 175,000 - 263,000 USD
Work environment: primarily in-office at Santa Clara
Incentive pay and/or equity eligibility
Flexible time off
Wellness resources
Company-sponsored team events
Ready to Apply?
Join Pure Storage and make an impact in renewable energy