Celonis logo

Staff Reliability Engineer

Celonis
Redwood City, California
Full Time
Posted October 21, 2025
$195k - $235k
Apply Now

Application opens on company website

Job Description

The role involves leading reliability engineering efforts for a cloud-based microservices platform, ensuring system performance, resilience, and automation through applying SRE principles, technical leadership, and collaboration with engineering teams.

Key Responsibilities

  • Ensure the health, performance, and resilience of the platform using SRE principles
  • Lead reliability efforts for microservices on Kubernetes, including observability, automation, and incident prevention
  • Develop and enforce SLOs, SLAs, and error budgets to drive reliability
  • Own high-priority application incident escalations, perform technical analysis, and restore services within SLOs
  • Automate manual processes to improve availability, latency, and operational efficiency
  • Collaborate with engineering teams to conduct post-incident reviews and implement systemic reliability improvements

Requirements

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field or equivalent hands-on experience.
  • Minimum of 8 years of experience in software engineering or Site Reliability Engineering (SRE) roles.
  • Deep experience with cloud platforms AWS, GCP, or Azure.
  • Proficiency in Java, the Spring framework, and Python or a similar scripting language in a Linux environment.
  • Prior experience contributing to Site Reliability Engineering initiatives or similar operational roles.
  • Demonstrated ability to lead projects and influence engineering culture.
  • Knowledge of SRE principles, including SLI, SLO design, error budgets, and toil reduction strategies.
  • Excellent written and verbal communication skills in English.
  • Experience leading reliability efforts for microservices, including developing and enforcing SLOs, SLAs, and error budgets.
  • Experience owning high-priority application incident escalations, performing deep technical analysis, and restoration within defined SLOs.
  • Experience automating manual processes to eliminate toil and improve operational efficiency.
  • Experience collaborating with platform and application engineering teams to conduct post-incident reviews, extract insights, and implement systemic reliability improvements.

Benefits & Perks

Base salary range: 195,000 - 235,000 USD
Total compensation package including bonus, commission, equity, and benefits
Health, dental, and life insurance
401k retirement plan
Paid time off
Hybrid working options
Generous PTO
Company equity RSUs
Extensive parental leave
Dedicated volunteer days
Gym subsidies
Counseling and well-being programs
Internal mobility and mentorship opportunities
Community and inclusion programs

Ready to Apply?

Join Celonis and make an impact in renewable energy

Stay Updated on Sustainability Jobs

Get the latest renewable energy jobs and career tips delivered to your inbox.

More jobs at Celonis

Celonis logo

Global People Business Partner

Celonis
NEW
Raleigh
Full Time
14h
Celonis logo

Global People Business Partner

Celonis
NEW
New York
Full Time
14h
$145k-165k
Celonis logo

Senior Management Technology Consultant

Celonis
NEW
Munich
Full Time
14h

More jobs in Redwood City, California

SB Energy logo

Supervisor, Renewable Project Accounting

SB Energy
Redwood City
Full Time
Dec 24
$110k-140k
SB Energy logo

Supervisor, Renewable Project Accounting

SB Energy
Redwood City
Full Time
Dec 27
$110k-140k
SB Energy logo

Project Development Associate Manager

SB Energy
Redwood City
Full Time
Nov 7
$110k-150k