The role involves managing and improving Everpure's infrastructure, internal tooling, and production services as a Reliability Engineer, focusing on designing, maintaining, and troubleshooting enterprise systems, enhancing system resiliency, and automating operations to support innovative data storage solutions.
Key Responsibilities
Design, operate, maintain, and troubleshoot enterprise systems such as databases, message queues, APIs, and distributed applications.
Establish and practice incident response procedures and conduct blameless postmortems to prevent recurrence of issues.
Support services through system design, software development, capacity planning, and launch reviews.
Scale and evolve systems using automation and scripting to improve operational management, reliability, and velocity.
Collaborate with cross-functional teams to deliver high-quality customer outcomes.
Requirements
Demonstrated coding ability with any functional or object-oriented programming languages, with a preference for Python and Go.
Demonstrated experience in the design, implementation, delivery, and maintenance of large-scale, distributed software systems.
Ability to work in a 24x7 on-call rotation using a follow-the-sun model, i.e., 8am to 8pm local time pager duty, approximately 1 week every 2-3 months.
Systematic problem-solving approach, strong communication skills, and a sense of ownership and drive.
Experience in analyzing performance and troubleshooting distributed systems.
Minimum of 5 years of experience as a Site Reliability Engineer, DevOps Engineer, or Infrastructure Engineer.
Strong understanding of Unix/Linux operating systems.
Experience working with Infrastructure as Code automation tools such as ArgoCD, Ansible, Terraform, or CloudFormation.
Ability to prioritize tasks independently, set goals, and follow through to completion.
Experience with containers and container orchestration systems, in particular Kubernetes.
Expertise with hybrid environments including bare metal, public cloud, and cloud environments, with AWS preferred.
Benefits & Perks
Flexible time off
Wellness resources
Company-sponsored team events
Ready to Apply?
Join Pure Storage and make an impact in renewable energy