Is anyone else seeing unexpected billing spikes for gemini-3.1-flash-image starting on March 27?
My project has a completely frozen codebase (using google-cloud-aiplatform==1.133.0, no deployments since March 19). However, starting on March 27, all of my standard image generation traffic silently migrated on the backend from the Standard SKU (7D83-2D44-B50E) to the Priority SKU (EC3C-2188-6892), effectively doubling my costs.
The catch: According to the official Vertex AI model documentation for Gemini 3.1 Flash Image, Priority PayGo is explicitly NOT supported for this model.
Because Priority isn’t supported, it’s impossible for my code to have requested it. It appears the Vertex AI global load balancers are currently defaulting unflagged requests for this preview model to a non-existent premium tier and passing the cost to users.
Has anyone else caught this in their billing report?