GCP Billing Export to BigQuery: Quick Guide to Tracking AI costs

Meghana_Reddy · January 7, 2026, 11:07pm

FinOps for AI/ML: Advanced Cost Management in Google Cloud

This export is the foundation for advanced cost management (FinOps), especially for complex AI/ML workloads like Vertex AI and Gemini.

Granular Cost Analysis: Drill down to resource-level usage (e.g., specific GPU hours, API calls).
Accurate Attribution: Segment costs by project, team, environment, or custom labels.
Custom Reporting: Use BigQuery SQL to build detailed reports, dashboards, and anomaly detection.

Project: Use a separate project for FinOps (best practice).
Dataset: Create a new BigQuery dataset.
Crucial: Choose a multi-region location (US or EU) to ensure you get retroactive data (backfill). Regional datasets only collect new data.

Navigate to Billing > Billing export in the GCP Console.
Under the BigQuery export tab, enable:
- Detailed usage cost data: MANDATORY for resource-level AI tracking.
- Pricing data: Recommended for cost-vs-list analysis.
Select your project/dataset and Save.

Note: Data typically starts flowing into BigQuery within a few hours.

AI workloads are tracked using the service.description field and Labels. Labels are your primary mechanism for granular attribution.

Scenario	Field/Label to Focus On	Purpose
General AI Usage	`service.description`	Filter for all costs related to “Vertex AI,” “Cloud Storage,” etc.
Job/Model Tracking	`labels.key` & `labels.value`	Apply custom labels (e.g., `model_name: v2-rec`) to training jobs.
Vertex AI Pipelines	`labels.vertex-ai-pipelines-run-billing-id`	Automatically propagates to all sub-resources (VMs, storage) in a run.
Generative AI (LLMs)	`sku.description`	Track Gemini costs via token usage SKUs. Combine with Project filters.

To ensure accurate reporting, keep the following considerations in mind:

Area	Issue to Watch Out For	Recommended Solution
Data Granularity	Standard export is insufficient for AI tracking.	Always enable “Detailed usage cost data.”
Retroactive Data	Regional datasets do not receive backfilled data.	Use a multi-region (US/EU) location for the dataset.
Data Lag	Billing data is not real-time (few hours delay).	Use Cloud Monitoring/Budget Alerts for real-time warnings.
Schema Changes	Raw table schema changes can break SQL queries.	Create BigQuery Views to shield reports from schema changes.
GKE Costs	GKE resource breakdowns aren’t included by default.	Manually enable GKE cost allocation in the GCP Console.
Shared Costs	Hard to attribute shared VPC or BigQuery costs.	Define internal allocation logic based on proportional usage.

The information provided is based on official Google Cloud documentation and best practices guides.

Cloud Billing Data Export Setup:
- Set up Cloud Billing data export to BigQuery (Google Cloud Documentation)
Data Structure and Limitations:
- Export Cloud Billing data to BigQuery (Google Cloud Documentation, covers limitations, supported regions, and backfill notes)
- Understand the Cloud Billing data tables in BigQuery (Google Cloud Documentation, covers schema details and the necessity of BigQuery Views for schema changes)
AI Cost Attribution (Labels and SKUs):
- Understand pipeline run costs (Google Cloud Documentation, confirms the use of the vertex-ai-pipelines-run-billing-id label for cost tracking)
- Vertex AI Pricing (Google Cloud Documentation, shows costs are broken down by specific SKUs, which is the mechanism used to track token usage via sku.description in the billing export).

Topic		Replies	Views
How to programmatically retrieve GCP billing cost (API vs. BigQuery export) Cloud Foundations & Onboarding cloud-logging , cloud-profiler , troubleshooting	2	226	August 22, 2025
How to Track User-wise LLM Usage in Vertex AI Custom ML & MLOps vertex-ai-platform , vertex-ai-model-registry	4	403	July 25, 2025
CUD analysis report, formatted as a CSV file Data Analytics bigquery	6	78	December 13, 2023