The role involves designing and implementing infrastructure as code at a global scale, leading IT operations, and automating cloud infrastructure to improve reliability, scalability, and security for Canonical's Ubuntu platform, with a focus on open-source technologies and DevOps practices.
Key Responsibilities
Define and implement a holistic vision for world-class internal cloud infrastructure
Develop and maintain technical design roadmaps and guidelines to improve reliability, resilience, and scalability
Collaborate with cloud-ops software development teams to shape roadmaps, requirements, and priorities
Advise IS management on technology choices, reliability, resilience, and business cases
Lead technical decisions to create scalable, self-service solutions
Work with security teams to establish best practices and mitigate threats
Automate operations for distributed systems across large enterprises
Design service architecture, documentation, policies, and operational procedures
Analyze incidents to identify root causes and recommend structural improvements
Requirements
Exceptional academic track record from both high school and university
Undergraduate degree in a technical subject or a compelling narrative about your alternative chosen path
Confidence to respectfully speak up, exchange feedback, and share ideas without hesitation
Track record of going above-and-beyond expectations to achieve outstanding results
Extensive knowledge of cloud computing concepts, technologies operation
Practical knowledge of Linux networking, routing, and firewalls, internet transit and large scale bandwidth networking
Experience dealing with significant production outages, incident response and postmortems
A passion for writing, sharing, and maintaining enterprise open-source software solutions
Able to communicate clearly and effectively in English over email, chat, video or voice calls and in-person
Familiarized and passionate about open-source, especially Ubuntu or Debian
Define, get buy-in and implement the holistic vision of a world class internal cloud
Setup, maintain and update the technical design roadmap and guidelines for the SREs within IS, with the aim of improving reliability, resilience, operational scalability, and technical scalability
Collaborate with, and provide the cloud-ops software development teams with input for roadmap, requirements and prioritization to build a world-class, highly standardized and automated operation
Provide the IS management with input and advice with regards to technology, reliability, resilience and business cases
Lead technical choices to implement solutions as self-service products, ensuring scalable operation
Collaborate with product security as well as operations security to set best practice and mitigate new threats in a timely manner
Automate operations for reuse across the world’s largest companies, taking into consideration the complexities of distributed systems
Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures
Analyze incidents and events, and establish what the reason behind the reasons are, and what structural improvements can be made to minimize the chance of them reoccurring
Hands-on experience of automatic administration of enterprise Linux servers at scale
Benefits & Perks
Annual compensation review
Performance-driven annual bonus or commission
Distributed work environment with twice-yearly team sprints in person
Personal learning and development budget of USD 2,000 per year
Recognition rewards
Annual holiday leave
Maternity and paternity leave
Employee Assistance Program
Opportunity to travel to new locations to meet colleagues
Priority Pass, and travel upgrades for long haul company events
Ready to Apply?
Join Canonical and make an impact in renewable energy