- Career Center Home
- Search Jobs
- Director, Engineering, AI Accelerator Cluster Systems
Results
Job Details
Explore Location
Google
Kirkland, Washington, United States
(on-site)
Posted
14 hours ago
Google
Kirkland, Washington, United States
(on-site)
Job Type
Full-Time
Director, Engineering, AI Accelerator Cluster Systems
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Director, Engineering, AI Accelerator Cluster Systems
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Description
Minimum qualifications:- Master's degree in Computer Science, Computer Engineering, a related technical field, or equivalent practical experience.
- 15 years of professional engineering experience.
- 5 years of experience in a senior leadership role managing large-scale infrastructure or systems teams.
- Experience in engineering leadership managing both physical systems design (electrical/mechanical) and the software stack (operating systems, firmware, and drivers) for networking or computing products.
Preferred qualifications:
- Experience building and scaling cloud platforms or high-performance computing infrastructure in a fast-paced, dynamic environment.
- Exposure to, or experience with, physical data center engineering, infrastructure certification, and physical constraints (e.g., custom cooling schemes, power delivery, rack density).
- Strong executive communication skills, with the ability to simplify complex silicon-to-software co-design topics to influence senior leadership and company strategy.
- Technical expertise in bare metal provisioning, advanced accelerators (TPUs/GPUs), and high-performance cluster networking.
- Deep technical expertise in ML workloads, AI infrastructure scaling, and the unique performance requirements of foundation models.
About the job
As the Director, Engineering, AI Accelerator Cluster Systems you will be responsible for driving the provisioning architecture, cluster operations experience, physical-to-software integration, and overall operator experience for Google Cloud's bare metal accelerator infrastructure. In this role, you will sit at the intersection of physical systems engineering and the underlying software stack, leading strategic efforts to deliver AI infrastructure on prem.
Google Cloud accelerates every organization's ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google's cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
The US base salary range for this full-time position is $307,000-$427,000 bonus equity benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.
Responsibilities
- Build physical infrastructure and custom networking, while designing data centers focused on custom cooling, rack density, and CDU placement.
- Oversee the core bare-metal software stack, including drivers, OS, firmware, and accelerator release management.
- Engineer systems with direct GPU/TPU access, ensuring high compute density and low latency for training foundation models.
- Partner cross-functionally to optimize the end-to-end AI accelerator stack from large-scale networking to Kubernetes.
- Collaborate with product and GTM leaders to shape the multi-year bare-metal AI infrastructure strategy.
${qualifications}${responsibilities}
Requisition #: 121414718207206086
pca3lyuhf
Job ID: 84325569
Jobs You May Like
Community Intel Unavailable
Details for Kirkland, Washington, United States are unavailable at this time.
Loading...
