Hi,
I have a docker image that I am able to run using batch and it uses the resources that are needed from a bucket using volume gcs such as in the example here https://cloud.google.com/batch/docs/create-run-job-storage#use-bucket
The issue I have is the very high latency to read and generate the intermediate files in the bucket, it literally takes more than 10 hours, where locally takes ~2 hours to produce these files using the same machine-type. I see that an option is to use persistent disks to reduce latency but I am not aware on how to connect/bind this new pd-disk and also be able to use the resources from the bucket. My intuition is to maybe copy the resources needed from the bucket to the pd-disk, then generate the intermediate files there, and finally copy the output to a bucket?
Thanks in advanced any help!
Diego