Hello Team,
I am trying to run a cloud batch job (container runnable) using batch_v1 python API using an instance template and I am getting the below error which seems Ito me it’s not able to find the persistent disk when the persistent disk can be seen under google compute/disks.
As per the volume method dis is a string and I am passing it as below:
resources = batch_v1.ComputeResource()
# The milliCPU count.
# cpuMilli defines the amount of CPU resources per task in milliCPU units.
# For example, 1000 corresponds to 1 vCPU per task.
resources.cpu_milli = 4000
resources.memory_mib = 16000
volume = batch_v1.Volume()
volume.device_name = "google-xxxxxxxx-gke-batch-dev-08ab6cad-7"
volume.mount_path = "/mnt/disks/batch"
volume.mount_options = ["rw", "async"]
Can some one help me debug this issue or point me if I am missing anything here.
Error Description:
Job gets non-retryable information Batch Error: code - CODE_VOLUME_INVALID_ARGUMENT, description - when mounting device, the job abc-orch-202-580b1037-5e67-47330 in project 999999999999 cannot find valid PD info from vmSchedulingInfo map[group0:vm_schedulings:{vm:{machine_type:“e2-standard-4” cpu_milli:4000 memory_mib:16384 boot_disk:{new_disk:{type:“pd-standard” size_gb:200 disk_interface:“SCSI” image:“projects/xxxxxxxxx-xxxx-xxxx/global/images/xxxx-golden-rhel-8-2023-08-11t07-12-33z”} device_name:“abc-gke-batch-dev-08ab6cad-7”} network:{network_interfaces:{network:“https://www.googleapis.com/compute/v1/projects/xxxxxxxxx/global/networks/vpc-xxxx” subnetwork:“https://www.googleapis.com/compute/v1/projects/xxxxxxxxxregions/us-central1/subnetworks/xxxxxxx” no_external_ip_address:true}} instance_template:“abc-gke-batch-dev-08ab6cad-7”} task_pack:1}], which should not happen.
Thanks!
G