I’ve been trying to use aider in vscode with no luck setting it up. Plenty of credits in my account. Have tried setting different regions as well with no luck. I read at https://console.cloud.google.com/vertex-ai/publishers/anthropic/model-garden/claude-3-5-sonnet that us-east5 and europe-west1 are the only available regions with Claude sonnet 3.5 but neither seem to work. Using gcloud init shows a “zone” setting as well for some reason:
gcloud init
Welcome! This command will take you through the configuration of gcloud.
Settings from your current configuration [default] are:
accessibility:
screen_reader: ‘False’
compute:
region: us-east5
zone: us-east5-a
It looks like you’re encountering a quota limit issue with the Claude 3.5 model in Vertex AI. The error message indicates that you have exceeded the quota for online prediction requests for this model. Quotas are in place to ensure fair usage and to prevent any single user from overloading the system.
You can confirm whether the quota limit in your Google Cloud project has indeed been exceeded. You can navigate to the Google Cloud Console, and, in the left-hand navigation panel, click on “IAM & Admin” and then select “Quotas & System Limits." You can filter by the specific service that might be exceeded.
If you want to increase any of your quotas on Vertex AI, you can use the Google Cloud Console to request a quota increase. You may follow the steps in this documentation. Keep in mind that these requests are subject to review and approval and may take some time to process. Additionally, quota increase requests are typically evaluated based on the validity of the business case provided.
You can read through this documentation for more information regarding Vertex AI quotas and limits.