Senior Site Reliability Engineer (AI)
ZartisRemote
Full Time
Posted February 5, 2026
Not Specified
Remote
Apply Now
Application opens on company website
Job Description
A Senior Site Reliability Engineer responsible for designing, building, and maintaining AWS infrastructure, driving reliability improvements, and supporting AI/ML platform components to ensure operational excellence for a digital fitness platform.
Key Responsibilities
- Take end-to-end ownership of the company's core infrastructure, including incident response, uptime practices, and operational maturity.
- Design, build, and maintain AWS infrastructure using best-practice architecture and service selection.
- Drive reliability improvements through observability practices such as metrics, logs, tracing, and alerting.
- Lead capacity planning efforts to ensure scalable and resilient platform performance.
- Implement and enforce Infrastructure as Code standards using Terraform.
- Support deployment and operations of AI-enabled platform components like proxies, vector stores, and inference services.
- Collaborate with cross-functional teams to develop and deliver digital solutions for the fitness industry.
Requirements
- More than 8 years of experience in Site Reliability Engineering or Infrastructure-focused DevOps roles.
- Deep understanding of distributed systems and how to operate them reliably in production.
- Expertise across core AWS services including EC2, Load Balancers, ECS, VPC, IAM, S3, Secrets Manager, and related foundational services.
- Strong experience with Infrastructure as Code using Terraform, ensuring all infrastructure is fully managed through code.
- Hands-on experience with AI ML platform infrastructure such as Bedrock, inference tooling, and vector stores.
- Observability-first approach with hands-on experience implementing monitoring and tracing systems (metrics, logs, tracing, alerting).
- Strong production mindset with the ability to think in terms of resilience, failure modes, and operational safety.
- Solid capacity planning skills and experience scaling platforms responsibly.
- Solid knowledge of SRE best practices and industry standards.
Benefits & Perks
100% remote work
Work from Home allowance with monthly financial support
Access to technical training during work hours, including online courses, books, conferences, and events
Mentoring program for both mentors and mentees
Access to Zartis Wellbeing Hub Kara Connect with sessions from mental health professionals, nutritionists, physiotherapists, and fitness coaches
Participation in multicultural work environment with online team-building events, webinars, parties, and contests
Ready to Apply?
Join Zartis and make an impact in renewable energy
Stay Updated on Sustainability Jobs
Get the latest renewable energy jobs and career tips delivered to your inbox.
Job Alerts
Get notified about new sustainability jobs
More jobs at Zartis
Senior Full-Stack Engineer (React, NestJS)
Zartis
Remote
Full Time
Jan 14
Lead QA Engineer
Zartis
Remote
Full Time
Jan 15
Lead Data Scientist
Zartis
Remote
Full Time
Jan 23
More jobs in Remote
IT Support Specialist
AlertMedia
NEW
Remote
Full Time
21h
Customer Success Manager, Growth
Affinity
NEW
VISA
Remote
Full Time
2d
Creative Video Producer, Brand
Affinity
NEW
Remote
Full Time
2d
$119k-137k