Vertex AI announced Infrastructure-as-Code (IaC) for open model deployments on Vertex AI Model Garden.
You can now leverage the google_vertex_ai_endpoint_with_model_garden_deployment Terraform resource to automate the entire deployment process.
Key takeways:
- Manage the complete Vertex AI deployment lifecycle using a unique main.tf file.
- Declaratively configure dedicated resources, handle multiple model deployments, and set up PSC endpoints.
- Support for deploying models directly from the Vertex AI Model Garden catalog or the Hugging Face hub.
Please refer to the official documentation and the example notebook for implementation details (links available in the original post).
Notebook
Vertex AI Model Garden doc
Terraform doc
Happy building!
1 Like
I need to deploy a custom docker container for a custom Pytorch inference engine for geospatial. The model uploads but cannot inference on the existing docker containers.
Deploying Vertex AI Model Garden models with Terraform can really streamline infrastructure management, especially if you’re aiming for consistency and repeatability across environments. Using Terraform to automate model deployment helps reduce manual steps and makes your AI workflows more scalable and maintainable. If you run into permission or configuration issues, double‑checking your service account roles and Terraform provider settings usually helps. It’s exciting to see more folks combining IaC with AI tools, it’s a big step toward more robust MLOps!