Receiving quota error when trying to use bison chat model in Vertex AI

Hi, I want to try out the new bison chat model. However, when I’m asking anything I’m receiving this error:

Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: chat-bison. Please submit a quota increase request.

7 Likes

Me too.

Submitting a prompt on https://console.cloud.google.com/vertex-ai/generative/language/create/text results in the following error

Failed to submit prompt

Error message: “Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: text-bison. Please submit a quota increase request.”

Status: 429 Error code: 429

Tracking number: xxxxxxxxxxxxx

I’ve tried to follow the docs on Quotas and limits but there seem to be no quotas for Generative AI.

I’m on the GCP Free Trial if that is relevant. Unfortunately, this means I can’t contact (paid) support.

Here are the quotas https://cloud.google.com/vertex-ai/docs/quotas. When requesting a quota increase look for this metric in the filter ‘aiplatform.googleapis.com/online_prediction_requests_per_base_model’ . You should see quotas by region and model with default values.

3 Likes

Yes, but all quotas a set to 0 and I can’t request increase quota either. It mentions that I need to contact sales.

1 Like

could you share a screenshot?

Hey! I am seeing the same issue :pensive_face:

Here’s a screenshot

Here is what my quotas look like in my paid account

Since this is a preview service and Google is currently not charging to leverage this service, it may possibly be that Google has disabled this service for free tier accounts.

1 Like

thanks for sharing! looking into it. should have a response tomorrow.

1 Like

Hi Kolban, I’m using an enterprise account (no free trial). However, I just tried with my personal account and that have quotas and I can use the models. Not sure. Thanks for checking it out!

Dashy,

Maybe the account that shows 0 quota doesn’t have IAM permissions to use Vertex AI but the other account does?

Hi, I was able to enable all Vertex AIs with no issues. Can you help me with the steps on how I can check whether Vertex AI is enabled on the IAM level?

@dashy can you try again? I believe we increased to 30 queries per minute.

@oiwejdsd can you try again as well and confirm that you can now access 30 queries per minute? Make sure you’re using us-central1 region.

1 Like

@j_molina can you try again and let us know if you can now see 30 queries per minute for us-central1 for the bison models. Your screenshot is showing AutoML services which are different.

It’s still at 0 for the enterprise account. For the individual account it’s 30.

@mchrestkha1 Thanks for your replies to this thread. My quota screen looks like @dashy 's and is still at zero.

I’m having the same issue; could you increase my limit?

I have the same problem. I was not able even to try it.
ERROR. Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: chat-bison
I even don’t have Vertex Service on the Quotas page.

Correction. Vertex has appeared. Now just the quota is zero.

I’m having similar issue. I created a free account to evaluate the process of finetuning a language model using Generative AI Studio. I have a valid dataset JSONL file and I’m set to use us-central 1, but no matter what I do, every attempt always fails in the pipeline at the ‘large-language-model-tuning’ step with error AiPlatformException: code=RESOURCE_EXHAUSTED messsage=The following quota metrics exceed quota limits. Oddly enough further down in the large complex error stack I notice in the path in the Execution name: that…/locations/europe-west4 is always mentioned even though all my settings are set to us-central1, even shows location as us-central1 in the Input Parameters.

I guess my ask is; Is there some kind of error going on, or is using Generative AI Studio’s Tuning feature just not something you are permitted evaluate in Free mode?