I use Datastream to transfer data from PostgreSQL to Cloud Storage. In the documentation it is written that there can be overlap between backfill and CDC, resulting in duplicate events. Event metadata should be used to remove duplicates. In article about events suggested to use uuid field to find duplicates.
I tried to find events with similar uuid, but turned out events from backfill have same uuid. How to find and remove duplicated events if there are any?