In my batch processing data pipeline I have transactions with booking date and accounting date, the transactions in the same time window have the same booking date and within 2 mins time window, booking date is just several minutes earlier than processing time in my data pipeline while accounting date could be earlier or later than booking date.
When querying these transactions accounting date is always in the SQL condition. So I think accounting date should be the partition key.
But when I think of writing part I'm not sure anymore. Is it better to write to more partitions(less hotspot?) or less partitions?
Is it better to use booking date or accounting date as partition key? and why?