The Cloud Platform Engineer is responsible for designing, implementing, and maintaining multi-cloud infrastructure, ensuring high availability, security, and performance, while automating processes and supporting internal engineering teams to deliver high-quality products.
Key Responsibilities
Design and implement Cloud Governance at scale, enforce security baselines, and automate infrastructure provisioning using Infrastructure-as-Code tools.
Ensure continuous availability and performance of multi-cloud environments by maintaining systems that meet SLAs and SLOs.
Create and implement tools and processes to improve cloud cost efficiency and resource utilization.
Manage the Cloud infrastructure lifecycle, including deployments, upgrades, incident, problem, and capacity management.
Collaborate with project teams to ensure smooth delivery of critical infrastructure projects.
Develop documentation and runbooks to streamline operations and maintenance of Cloud infrastructure.
Requirements
At least 5 years of essential experience as a Cloud Engineer or SRE engineer with a focus on AWS services including EC2, VPC, TransitGateway, S3, CloudWatch, and System Manager.
Deep expertise in Kubernetes, with experience troubleshooting complex issues in production environments; knowledge of EKS and AKS is preferred.
Proficiency in Terraform for managing the full Cloud infrastructure lifecycle and demonstrable experience with the AWS governance framework including Organization, SCP, and Control Tower for baseline enforcement.
Experience with multi-cloud environments, specifically designing or maintaining enterprise-grade Azure landing zones, including Virtual Machines, AKS, Azure AD, and Azure Policy.
Proficiency in a modern cloud programming language such as Python or Go, and experience building and maintaining cloud-native applications.
Strong experience with modern DevOps tools including ArgoCD, Docker, and GitHub.
Hands-on experience utilizing observability and monitoring platforms such as Prometheus, Grafana, Thanos, or ELK for debugging and performance analysis.
Ability to work in a 24x7 on-call rotation and lead incident management, demonstrating systematic problem-solving, analytical, and collaboration skills.
Experience collaborating with global teams and the ability to attend meetings in the later afternoon or occasionally evening hours.
Willingness to work from the Prague, Czech Republic office in compliance with company policies, unless on PTO, work travel, or other approved leave.
Benefits & Perks
Compensation/salary range (not specified in the posting)
Work schedule: 24x7 on-call rotation, occasional evening meetings
Work environment perks: flexible time off, wellness resources, company-sponsored team events
Additional benefits: support for growth and development, inclusive and diverse workplace culture, accommodations for disabilities
Ready to Apply?
Join Pure Storage and make an impact in renewable energy