Pure Storage logo

FY26 Q4 Site Reliability Engineer

Pure Storage
Prague, Czech Republic
Full Time
Posted December 4, 2025
Apply Now

Application opens on company website

Job Description

The Site Reliability Engineer will ensure the performance, stability, and reliability of mission-critical cloud infrastructure and services through automation, monitoring, incident response, and collaboration with development teams to improve system architecture and scalability.

Key Responsibilities

  • Ensure performance and stability of mission-critical infrastructure and services through monitoring, incident response, and root cause analysis.
  • Design and implement automation and orchestration solutions to improve operational efficiency and reduce manual errors.
  • Collaborate with development teams to integrate reliability principles into service architecture for high availability and scalability.
  • Develop and enhance observability tools by configuring monitoring, metrics collection, and alerting systems.
  • Adopt and integrate modern cloud operations technologies, including Infrastructure as Code, container orchestration, and high-availability solutions.

Requirements

  • Demonstrated ability to write production-quality code using languages such as Python, Go, Java, C, or C++, including experience with software design, implementation, and maintenance.
  • At least 3 years of experience as a Site Reliability Engineer (SRE) or DevOps supporting globally distributed SaaS services.
  • A systematic and data-driven problem-solving approach, coupled with strong communication skills and a deep sense of ownership for critical production services.
  • A solid understanding of enterprise systems performance analysis and debugging, with the ability to leverage metrics and data to drive system improvements.
  • Ability to establish and maintain service reliability for core cloud platforms and infrastructure by implementing monitoring, incident response, root cause analysis (RCA), and resolution in a 24x7 environment.
  • Experience participating in transforming operational practices by designing and implementing automation and orchestration solutions for manual cloud service operations and deployment to enhance efficiency and reduce human error.
  • Experience partnering cross-functionally with development teams to integrate Site Reliability Engineering principles early in the development lifecycle, including defining improvements to service architecture that support high availability, scalability, and adherence to SLAs.
  • Experience building and evolving observability stacks by configuring service health monitoring, collecting and reporting key metrics, and establishing alerting systems to monitor system performance and health.
  • Experience driving adoption of modern cloud operations technologies, including exploring and integrating tools for Infrastructure as Code (IaC), container orchestration, and high-availability (HA) solutions to optimize reliability and scalability.

Benefits & Perks

Compensation/salary range (not specified in the posting)
Work schedule: primarily in-office in Prague, with flexibility for PTO, work travel, or approved leave
Work environment perks: flexible time off, wellness resources, company-sponsored team events
Additional benefits: support for growth and development, inclusive and diverse workplace culture, accommodations for disabilities

Ready to Apply?

Join Pure Storage and make an impact in renewable energy

Stay Updated on Sustainability Jobs

Get the latest renewable energy jobs and career tips delivered to your inbox.

More jobs at Pure Storage

Pure Storage logo

ServiceNow Solutions Architect

Pure Storage
NEW
Santa Clara
Full Time
2d
$165k-248k
Pure Storage logo

Senior Manager, Domestic Tax

Pure Storage
NEW
Santa Clara
Full Time
2d
$176k-265k
Pure Storage logo

Software Engineer, DRaaS

Pure Storage
NEW
Prague
Full Time
2d