Process queue of long-running tasks with a cap on concurrent processing

vmmishchenko · October 4, 2025, 9:07pm

Example: Clients upload videos for processing. Each video processing takes ~5 hours. I need to cap concurrent processing per client.

What are the best practices to implement this flow in the most serverless way possible?

Some ideas I have but they all with their own cons.

Option 1:
Google Tasks → Cloud Run (enforce cap limits) → Cloud Run Job or Batch

Option 2:
GKE or Cloud Run workers pulling from Pub/Sub

Cons: Requires managing Kubernetes for GKE
Cons: Pub/Sub has limited ack timeout → in case of error workers need to handle retry logic

Topic		Replies	Views
Need Help Architecting Low-Latency, High-Concurrency Task Execution with Cloud Run (200+ tasks in pa Serverless Applications cloud-run	1	78	May 13, 2025
Best approach to enqueue long-running run jobs? Serverless Applications cloud-run	1	151	May 28, 2024
Best way to run a data processing task on GCP, with API Serverless Applications app-dev-general	2	64	October 19, 2022