The role involves ensuring the reliability, availability, and scalability of cloud services through automation, monitoring, and incident management, primarily within a cloud-first infrastructure, to support the company's data storage solutions and drive operational excellence.
Key Responsibilities
Ensure the reliability and availability of core cloud services through operational frameworks, monitoring, and automation.
Lead incident response and perform root cause analysis to ensure rapid recovery and prevent recurring issues.
Architect and implement automation and Infrastructure-as-Code to streamline deployments and service management.
Collaborate with product and engineering teams to influence service architecture and embed SRE best practices.
Develop and enhance observability systems, including metrics, logging, tracing, and alerting, for system health visibility.
Requirements
Strong experience designing, operating, and improving highly available cloud services, including deep understanding of service uptime, Service Level Objectives (SLOs), and production operational excellence.
Expertise with public cloud platforms such as AWS, Azure, or GCP, and hands-on experience with cloud-native architectures.
Proficiency in Infrastructure-as-Code and automation using tools such as Terraform, Ansible, CloudFormation, Puppet, or similar.
Practical experience running containerized environments and orchestration systems such as Kubernetes.
Ability to build and operate observability stacks, including metrics, logging, tracing, and actionable alerting, using tools such as ELK, Prometheus, or OpenTelemetry.
Experience managing on-call processes using tools like PagerDuty.
Strong programming skills in languages such as Python, Go, Java, Ruby, or similar.
Deep understanding of Linux systems and networking fundamentals.
Knowledge of modern software delivery practices.
Ability to work from the Prague office in an in-office environment, in compliance with company policies.
Benefits & Perks
Flexible time off
Wellness resources
Company-sponsored team events
Opportunities for growth and development
Inclusive and supportive work environment
Ready to Apply?
Join Pure Storage and make an impact in renewable energy