We use Datastream to mirror (Cloud SQL) Postgres tables in BigQuery. We are hitting an issue when trying to replicate **partitioned** Postgres tables into BQ; our Datastream pipelines never finish backfilling and the data throughput is 10 or ever 100X higher than the total size of the target tables. Is there someone who can help us debug this? We’ve have experimented with our Cloud SQL Postgres setup in accordance with the docs and looked around online, but we haven’t found any solution to this problem that will be a deal breaker for using Datastream if we can’t resolve it.
This seems like table stakes for a CDC system to be able to handle partitioned tables so I’m hoping there is a simple solution.