Following the instructions and script here https://cloud.google.com/kubernetes-engine/docs/how-to/migrate-containerd
gcloud container clusters upgrade 'xxxx-production' --project 'project-id-xxxxx' --zone 'us-xxxxx' --image-type 'COS_CONTAINERD' --node-pool 'default-pool'
resulted in following error message
All nodes in node pool [default-pool] of cluster [xxxxx-production] image will change from COS to COS_CONTAINERD. This operation is long-running and will block other operations on the
cluster (including delete) until it has run to completion.
Do you want to continue (Y/n)? Y
Upgrading xxxx-production... Updating default-pool, done with 0 out of 3 nodes (0.0%): 1 being processed...done.
ERROR: (gcloud.container.clusters.upgrade) Operation [<Operation
clusterConditions: [<StatusCondition
canonicalCode: CanonicalCodeValueValuesEnum(NOT_FOUND, 5)
message: 'Google Compute Engine: Managed instance gke-xxx-default-pool-4a9ae595-tuog not found.'>]
detail: 'Google Compute Engine: Managed instance gke-xxx-default-pool-4a9ae595-tuog not found.'
endTime: '2023-05-13T01:08:09.455974926Z'
error: <Status
code: 5
details: []
message: 'Google Compute Engine: Managed instance gke-xxx-default-pool-4a9ae595-tuog not found.'>
name: 'operation-1683938273861-...........'
nodepoolConditions: []
operationType: OperationTypeValueValuesEnum(UPGRADE_NODES, 4)
progress: <OperationProgress
metrics: [<Metric
intValue: 3
name: 'NODES_TOTAL'>, <Metric
intValue: 1
name: 'NODES_FAILED'>, <Metric
intValue: 0
name: 'NODES_COMPLETE'>, <Metric
intValue: 1
name: 'NODES_DONE'>, <Metric
intValue: 0
name: 'NODE_PDB_DELAY_SECONDS'>]
stages: []>
selfLink: 'https://container.googleapis.com/v1/projects/..........'
startTime: '2023-05-13T00:37:53.861775789Z'
status: StatusValueValuesEnum(DONE, 3)
statusMessage: 'Google Compute Engine: Managed instance gke-xxxx-default-pool-4a9ae595-tuog not found.'
targetLink: 'https://container.googleapis.com/v1/projects/....'
zone: 'us-xxx'>] finished with error: Google Compute Engine: Managed instance gke-xxx-default-pool-4a9ae595-tuog not found.
Any ideas on how to debug this? (worked on both our staging clusters flawlessly)

