Estimating Vertex AI Vector Search Costs: Seeking Cost-Effective Alternatives

trk · August 19, 2024, 11:11am

Hello, Community!

I’ve been working on calculating the costs associated with running a vector search system, and I’d love to get your thoughts and suggestions on more cost-effective alternatives.
reference:https://cloud.google.com/vertex-ai/pricing#vectorsearch

My Current Setup:

Number of Records: 10,000 - json records with description key
Embedding Dimensions: 768
Machine Type: e2-standard-16 (is e2-standard-2 sufficient for my current setup)
Cost per Node Hour: $0.75

Cost Breakdown:

Data Size Calculation:

Data Size = 10,000 records × 768 dimensions × 4 bytes = ~0.0286 GiB

Serving Cost:

Assuming 1 replica per shard and 1 shard.
Cost: $0.75 per node hour × 730 hours (per month) = $547.50/month

Building Cost:

At $3/GiB and 1 update per month.
Cost: ~0.0286 GiB × $3 = ~$0.086/month

Total Monthly Cost: ~$547.586

Seeking Alternatives:

Given that this cost seems quite high, especially for smaller-scale projects, I’m looking for recommendations on how to reduce these expenses. Are there any alternative setups, different machine types, or other optimizations that could help bring down the overall cost?

Any suggestions or experiences you can share would be greatly appreciated!

Thanks in advance for your help!

McMaco · August 23, 2024, 7:13pm

Hello trk,

Welcome to Google Cloud Community!

Let’s explore some strategies to optimize costs for Google Cloud’s vector search solutions.

Data Size Calculation - As you mention ~0.0286 GiB was your data size being said this may be categorized to shards_size_small, the machine types that you can use to deploy your index (using public endpoints or using VPC endpoints) depends on the shard size of the index. Based on the image below machine type support you may use e2-standard-2.

Screenshot 2024-08-24 2.33.51 AM.png795×265 17.2 KB
Serving Cost - with regards to replicas you may also consider the parameter leafNodeEmbeddingCount since replicas are directly proportional to cost.
Building Cost - This is very straight forward given the formula of data size(in GiB) * $3/GiB * # of updates/month.

For more information you may refer to overview of vertex ai vector search documentation as a reference.

I hope the above information is helpful.

trk · August 27, 2024, 6:14am

@McMaco , Thank you for the above info .The insight wrt machine selection based on shard size is really helpful.But my concern is regarding serving cost which is 24730

I checked the pricing calculator, but there is no information on serving costs. The documentation mentions serving costs as being calculated on a 24730 days basis, which seems quite expensive (no pay-as-you-go model). Given this, the costs appear to be high, especially for small-scale projects.

1)Does vertex ai matching engine service offer any pay-as-you-go serving cost options ?

For the above setup, what would be the total cost including serving?
Any alternative vector db setups with in gcp that could help reduce the overall cost?

Looking forward for your guidance on these topics

McMaco · August 27, 2024, 3:04pm

Hello @trk ,

Once we choose the matching engine for a specific task we are billed for the used resources here are the estimated monthly serving cost.

The estimated monthly serving cost is directly related to the number of nodes used in the console. To learn more about configuration parameters that affect cost, see Configuration parameters which affect recall and latency.
2. I used a pricing calculator given the data you’d mention, you can double check it for reference.
3. To reduce the overall cost by using alternative GCP products is a great idea. But we need to keep in mind that every product has its own composition to become suitable in our needs. We have Alloy DB and BigQuery for vector implementation.

If you’d like to talk to a support representative about a Cloud Billing question or a billing-related issue, visit our Cloud Billing Support page for contact options.

Topic		Replies	Views
How is Cost Calculated for GCP Vertex AI Vector Search? Custom ML & MLOps vertex-ai-platform	4	1556	January 28, 2025
Help with RAG Implementation and Cost Optimization on Vertex AI Generative AI & Foundational Models vertex-ai-vector-search	4	85	October 31, 2025
Vertex AI vector search vs Firestore vectore search Custom ML & MLOps vertex-ai-model-registry	1	256	September 30, 2024

Estimating Vertex AI Vector Search Costs: Seeking Cost-Effective Alternatives

My Current Setup:

Cost Breakdown:

Seeking Alternatives:

AI Suggested topics