Reduce GPU Allocation Time in GKE Autopilot Cluster

deepaksingh · August 27, 2024, 1:07pm

Hi all,
I’m currently experiencing a delay of approximately 2 to 3 minutes in GPU allocation for my GKE Autopilot cluster. Here are the details:

Region: asia-south1
Cluster Type: Autopilot
Kubernetes Version: 1.29.6-gke.1254000

Is there anything I can do to decrease the time it takes for GPUs to be allocated? Are there any specific configurations or optimizations that can help speed up this process in an Autopilot cluster?

Any guidance or recommendations would be greatly appreciated!

deepaksingh · August 27, 2024, 1:09pm

@knet your insights on this issue would be especially valuable. Any guidance or recommendations you could provide would be greatly appreciated!

knet · August 27, 2024, 6:47pm

I’m sorry, I don’t work on GKE Autopilot. Though this doesn’t sound too out of the ordinary.

If you’re looking for fast startup times, Cloud Run just launched a preview of GPU support! It’s signup-only at the moment, we’re steadily adding people. https://cloud.google.com/run/docs/configuring/services/gpu

Topic		Replies	Views
GKE Autopilot cluster and Wanted up a GPU ( Nvidia-l4 or Nvidia-tesla-t4 ) Serverless Applications	4	131	July 30, 2024
GPU Staging times Compute Infrastructure compute-engine	1	14	September 30, 2021
Reporting outage - GKE autopilot scheduling with NVIDIA GPUs broken Serverless Applications gke	12	47	July 31, 2023

Reduce GPU Allocation Time in GKE Autopilot Cluster

AI Suggested topics