A role focused on developing a dynamic resource allocation system to optimize cloud infrastructure, improve efficiency, and streamline workflows for AI development at a self-driving technology company.
Key Responsibilities
Develop a system to provide users with GPU VMs for their development environment
Create a dynamic VM allocation mechanism integrated into Google Kubernetes Engine (GKE)
Integrate VM provisioning and lifecycle management into the in-house ML Scheduler
Requirements
Currently pursuing a Bachelor’s or Master’s degree in Computer Science or related field and graduating before December 2026
Proficient in Machine Learning concepts and applications
Familiarity with Google Kubernetes Engine (GKE) and cloud resource management
Develop the ability to create a system to provide users with GPU VMs for their development environment
Develop a dynamic VM allocation mechanism integrated into a shared Google Kubernetes Engine (GKE) resource pool
Integrate the VM provisioning and lifecycle management into the in-house ML Scheduler
Outstanding problem-solving abilities coupled with great attention to detail