As a Senior Software Engineer in Production Engineering, you will own the continuous integration and test-orchestration platform empowering the FlashArray and FlashBlade teams to predictably ship high-quality products. This developer-first role offers real operational impact by allowing you to design, build, and run the critical systems behind code merges and release operations. You will eliminate manual processes through automation, leveraging data and observability to enhance reliability, throughput, and time to signal. By collaborating closely with cross-functional engineering teams, you will directly accelerate Everpure’s innovation and elevate our overall developer experience.
• Design, implement, and maintain advanced continuous integration (CI) and test-orchestration services, determining architectures that provide reliable, high-quality feedback for FlashArray and FlashBlade engineers.
• Develop developer-facing automation (such as CLIs and APIs) and encode quality gates directly into release systems to eliminate manual processes and streamline workflows.
• Drive cross-functional engineering initiatives to improve system reliability, utilizing advanced observability tools and metrics to reduce the time-to-resolve for production incidents.
• Serve as a technical leader in on-call rotations and incident triage, independently directing mitigation strategies and ensuring robust root-cause resolutions across the infrastructure.
• We are primarily an in-office environment and therefore, you will be expected to work from the Prague office in compliance with Everpure's policies, unless you are on PTO, or work travel, or other approved leave.
• Demonstrated expertise in software, production, or site reliability engineering, with a proven ability to design, build, and lead complex backend services and CI/release infrastructure across multiple teams.
• Deep understanding of software engineering fundamentals, distributed systems, and Linux/Unix environments, enabling you to independently evaluate variable factors and debug complex issues across OS, network, storage, and application layers.
• Advanced proficiency in modern programming languages (such as Python or Go) alongside strong scripting and automation skills to build robust pipelines-as-code and orchestration tools.
• Exceptional problem-solving capabilities and composure during high-impact incidents, combined with the ability to influence, collaborate, and determine methods and procedures for cross-functional engineering teams.
#LI-ONSITE