Hi,
I am trying to deploy a simple model on an endpoint in order to start making predictions.
I followed these steps:
-
Create the model, create a docker image, push it into Artifact Registry
-
Upload the model to Vertex AI:
! gcloud ai models upload --container-image-uri=<REGION>-docker.pkg.dev/<PROJECT>/<REPOS>/<MODEL>:latest --region=<REGION> --display-name=my-model
- Create an endpoint on Vertex AI:
! gcloud ai endpoints create --display-name=my-model-endpoint --region=<REGION>
- Deploy the model to the endpoint:
! gcloud ai endpoints deploy-model <ENDPOINT_ID> --model <MODEL_ID> -display-name my-model --traffic-split=0=100 --region <REGION> --machine-type=n1-standard-8 --enable-access-logging
Here’s the message:
Model server terminated: model server container terminated: exit_code: 0 reason: “Completed” started_at { seconds: 1684309447 } finished_at { seconds: 1684309449 } . Model server logs can be found at xxxxx
When i check the logs:
As you can see i have no errors.
What i tried:
-
upsized the machine-type
-
deployed with a python script
-
deployed manually using the portal
-
changed my model to a simple linear regression to test