Since 2026-04-21 around 09:00 CEST, all of my Gemini-based tools have started failing with:
429 Resource exhausted. Please try again later.
This affects multiple tools/workflows, not just one script.
Originally I was using gemini-2.0-flash, but I also tested gemini-2.5-flash and I get the same error there as well.
Important details:
-
low request volume
-
requests are serialized on my side
-
no burst traffic
-
often even the first request after startup fails
-
retries with backoff do not recover
So this does not look like a simple client-side rate spike.
The error happens during the Gemini generation step.
Questions:
-
Is there a known incident today affecting Gemini API / Vertex AI?
-
Could this be a quota/billing/tier mismatch even at low usage?
-
How can I distinguish between actual quota exhaustion and backend/shared-capacity exhaustion?
-
Has anyone else seen this today with both
gemini-2.0-flashandgemini-2.5-flash?
Any help would be appreciated.