Zartis logo

Senior Site Reliability Engineer (AI)

Zartis
Remote
Full Time
Posted February 5, 2026
Apply Now

Application opens on company website

Job Description

A Senior Site Reliability Engineer responsible for designing, building, and maintaining AWS infrastructure, driving reliability improvements, and supporting AI/ML platform components to ensure operational excellence for a digital fitness platform.

Key Responsibilities

  • Take end-to-end ownership of the company's core infrastructure, including incident response, uptime practices, and operational maturity.
  • Design, build, and maintain AWS infrastructure using best-practice architecture and service selection.
  • Drive reliability improvements through observability practices such as metrics, logs, tracing, and alerting.
  • Lead capacity planning efforts to ensure scalable and resilient platform performance.
  • Implement and enforce Infrastructure as Code standards using Terraform.
  • Support deployment and operations of AI-enabled platform components like proxies, vector stores, and inference services.
  • Collaborate with cross-functional teams to develop and deliver digital solutions for the fitness industry.

Requirements

  • More than 8 years of experience in Site Reliability Engineering or Infrastructure-focused DevOps roles.
  • Deep understanding of distributed systems and how to operate them reliably in production.
  • Expertise across core AWS services including EC2, Load Balancers, ECS, VPC, IAM, S3, Secrets Manager, and related foundational services.
  • Strong experience with Infrastructure as Code using Terraform, ensuring all infrastructure is fully managed through code.
  • Hands-on experience with AI ML platform infrastructure such as Bedrock, inference tooling, and vector stores.
  • Observability-first approach with hands-on experience implementing monitoring and tracing systems (metrics, logs, tracing, alerting).
  • Strong production mindset with the ability to think in terms of resilience, failure modes, and operational safety.
  • Solid capacity planning skills and experience scaling platforms responsibly.
  • Solid knowledge of SRE best practices and industry standards.

Benefits & Perks

100% remote work
Work from Home allowance with monthly financial support
Access to technical training during work hours, including online courses, books, conferences, and events
Mentoring program for both mentors and mentees
Access to Zartis Wellbeing Hub Kara Connect with sessions from mental health professionals, nutritionists, physiotherapists, and fitness coaches
Participation in multicultural work environment with online team-building events, webinars, parties, and contests

Ready to Apply?

Join Zartis and make an impact in renewable energy

Stay Updated on Sustainability Jobs

Get the latest renewable energy jobs and career tips delivered to your inbox.

More jobs at Zartis

Zartis logo

Senior Full-Stack Engineer (React, NestJS)

Zartis
Remote
Full Time
Jan 14
Zartis logo

Lead QA Engineer

Zartis
Remote
Full Time
Jan 15
Zartis logo

Lead Data Scientist

Zartis
Remote
Full Time
Jan 23

More jobs in Remote

AlertMedia logo

IT Support Specialist

AlertMedia
NEW
Remote
Full Time
21h
Affinity logo

Customer Success Manager, Growth

Affinity
NEW
VISA
Remote
Full Time
2d
Affinity logo

Creative Video Producer, Brand

Affinity
NEW
Remote
Full Time
2d
$119k-137k