Hi all,
I’m currently experiencing a delay of approximately 2 to 3 minutes in GPU allocation for my GKE Autopilot cluster. Here are the details:
- Region: asia-south1
- Cluster Type: Autopilot
- Kubernetes Version: 1.29.6-gke.1254000
Is there anything I can do to decrease the time it takes for GPUs to be allocated? Are there any specific configurations or optimizations that can help speed up this process in an Autopilot cluster?
Any guidance or recommendations would be greatly appreciated!

