As part of a Data Center DevOps team, you will work in a local datacenter interfacing directly with the datacenter operations team to assist in providing post hardware repair support to ensure hardware gets validated and returned back to the production fleet. As an autonomous, go-getter, you will be required to take initiative and drive leadership representing the Infrastructure and Shared Services (ISS) team in the datacenter.
• Orchestrate Fleet Reliability: Own the end-to-end validation and recovery of server, storage, and network hardware, ensuring assets are returned to the production fleet with high confidence and minimal downtime.
• Engineer Automated Workflows: Design and implement scalable automation for repetitive tasks using Ansible and Python to eliminate manual bottlenecks in the deployment lifecycle.
• Drive Root Cause Resolution: Lead deep-dive troubleshooting for complex hardware and software failures, performing thorough analysis to prevent systemic issues and improve overall fleet health.
• Technical Documentation & Standards: Architect comprehensive technical runbooks and infrastructure diagrams (LucidChart/Visio) that standardize complex processes for the global DevOps and R&D communities. Infrastructure Mentorship: Act as a domain expert, providing technical guidance and cross-training to Datacenter Operations teams to elevate service delivery SLAs and operational excellence.
• We are primarily an in-office environment and therefore, you will be expected to work from the Dallas, TX office in compliance with Pure’s policies, unless you are on PTO, or work travel, or other approved leave.
• Systems Expertise: Advanced hands-on proficiency in Linux administration and deep knowledge of enterprise hardware (Cisco, Brocade, Supermicro), including BIOS configurations and storage networking.
• Automation & Scripting: Demonstrated ability to write production-grade scripts (Python) and manage infrastructure via configuration management platforms like Ansible or Puppet.
• Operational Agility: Proven track record of managing high-volume ticketing queues (Jira) and driving complex technical projects to completion in a fast-paced datacenter environment. Collaborative Problem Solving: Exceptional communication skills with the ability to translate complicated hardware failures into actionable insights for both local operations and remote engineering teams.
• Modern Infrastructure Tooling: Experience working with virtualization (VMware/ESXi), containerization (Docker), and automated deployment tools to support a hybrid infrastructure model.
#LI-ONSITE
Salary ranges are determined based on role, level and location. For positions open to candidates in multiple geographical locations, the base salary range is reflective of the labor market across the applicable locations.
This role may be eligible for incentive pay and/or equity.
There is no application deadline and we accept applications on an ongoing basis until the job is filled.