Job Description
The Senior Manager at Aurora is a technical leadership role responsible for developing and executing incident response and resilience strategies across the company's autonomous vehicle systems, IT infrastructure, and operational centers, ensuring system stability, security, and continuous improvement in a complex, safety-critical environment.
Key Responsibilities
- Define and execute the incident response technical roadmap and frameworks.
- Serve as the escalation point and command for high-severity incidents, including cybersecurity breaches and system outages.
- Lead cross-functional collaboration with engineering, IT, safety, legal, and cybersecurity teams to influence system architecture and risk mitigation.
- Translate complex technical issues into clear business risks for executive communication during major incidents.
- Conduct post-incident reviews to identify root causes and systemic improvements, and develop KPIs and SLOs for incident response performance.
- Lead engagement with auditors to demonstrate process maturity and compliance with relevant frameworks.
- Develop and mature incident management processes, SOPs, and training programs across the organization.
- Manage and scale high-performance, geographically distributed incident response teams.
- Oversee the development and implementation of resilience and stability strategies for the technical ecosystem.
Requirements
- At least 10 years of experience in a technical operations domain, with a significant emphasis on Incident Management in large-scale environments such as cloud computing, enterprise software, robotics, or other safety-critical industries.
- At least 7 years of experience in direct people management with demonstrated excellence in leading, mentoring, and scaling high-performance, geographically distributed, 24x7 technical teams.
- Ability to rapidly comprehend the interconnectedness of complex system architectures, spanning on-vehicle compute, cloud infrastructure, enterprise IT systems, and the operational dynamics of a Network Operations Center (NOC) and Security Operations Center (SOC).
- Proven experience leading an enterprise through the development and execution of incident response and crisis management programs, including creating mature processes, Standard Operating Procedures (SOPs), and comprehensive training programs.
- Exceptional communication and presentation skills, with a proven ability to brief C-level executives and translate deep technical issues into concise, business-oriented terms.
- Proven ability to drive complex, cross-functional initiatives from conception to completion, demonstrably improving operational maturity and efficiency.
- Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
- Experience in defining and executing the technical roadmap for incident response functions, including architecting new processes, developing data-driven playbooks, and integrating tooling and automation to enhance response maturity.
- Experience serving as the ultimate point of escalation and command for high-severity, complex incidents such as cybersecurity breaches, mission-critical production service outages both on-vehicle and in the cloud, and widespread IT infrastructure failures.
- Experience leading the Incident Governance Board, fostering collaboration with senior leaders in Engineering, IT, Safety, Legal, and Cyber Security to influence architectural decisions, resiliency planning, and vulnerability management programs.
- Experience translating complex technical impacts into clear business risks for executive audiences during major incidents.
- Experience conducting rigorous, blameless post-incident reviews to identify root causes and systemic lessons learned, and developing and analyzing KPIs and SLOs related to incident response.
- Experience leading engagement with internal and external auditors, demonstrating process maturity and adherence to relevant compliance frameworks such as ISO 27001 and NIST.
- Expert-level understanding of incident response frameworks (e.g., ICS NIMS) and hands-on experience managing enterprise-level incidents like major cloud service disruptions, large-scale network failures, or critical security incidents.
- Experience developing and managing department-level budgets and resource allocation plans.
- Familiarity with modern observability, monitoring, and logging platforms such as Datadog, Splunk, or Prometheus.
Benefits & Perks
Salary range: 171,000 - 273,000 USD per year
Annual bonus
Equity compensation
Benefits (unspecified)
Ready to Apply?
Join Aurora and make an impact in renewable energy
Stay Updated on Sustainability Jobs
Get the latest renewable energy jobs and career tips delivered to your inbox.
Job Alerts
Get notified about new sustainability jobs
More jobs at Aurora
Data Platform Engineer
Aurora
NEW
Pittsburgh
Full Time
14h
$105k-157k
Security Engineering Technical Lead Manager TLM - Aurora Enterprise Security
Aurora
NEW
Seattle
Full Time
2d
$189k-274k
FP A Analyst Procurement Systems
Aurora
Pittsburgh
Full Time
3d
$104k-166k
More jobs in Pittsburgh, Pennsylvania
Patient Care Consultant
Jushi
Pittsburgh
Full Time
Nov 3
Patient Care Consultant
Jushi
Pittsburgh
Full Time
Nov 17
Shift Supervisor
Jushi
Pittsburgh
Full Time
Nov 18