I am using vertex AI online prediction with a custom container. To save on autoscaling time, I am using mutateDeployedModel API(https://cloud.google.com/vertex-ai/docs/reference/rest/v1/projects.locations.endpoints/mutateDeployedModel)
The strange issue that I am facing is, I am able to change minimum replicas a few times and then it stops working.
The API call was successful with replica_target got increased but actual replica count coun’t increase as shown in below screenshot:
- Is there any limit on the number of times, I can mutate a deployed model?
- Any other way of changing the nodes of the deployed model
Thanks in advance!
Regards,
Anil

