Hello.
We use Cloud Run to spin up a bunch of HTTP servers, that all receive traffic from our user’s via Fastly. Now we want to implement rate-limiting based on IPs of our clients, in order to prevent resource starvation.
I was researching how people regularly do this on GCP, but couldn’t find any helpful resources. I’ve seen people recommending various different solutions: Load Balancer, Cloud Endpoints, Cloud Armor, Apigee etc.
What would be the easiest way to do this? We’d like to ideally avoid keeping the state ourselves (i.e. Redis).
P.S. For various reasons, we can’t do it in Fastly.