I am working on a data engineering case where i have a table Table_Movie
partitionned by ingest date
. Now, from time to time, i receive some old data. And I need to perform operations based on business date
.
For example : Today, I received new data corresponding to the date 12/05/2020:
- I need to delete in
Table_Movie
the data corresponding to thebusiness date
12/05/2020 - Once the old data deleted, i need to insert the new data corresponding to 12/05/2020.
Would it improve performance to partition by ingest date
and business date
? :
==> eg. Table_Movie/ingestdate=20220812/business_date=20200512/
Note : we need to keep the partitionning with ingest date
for other reasons.
Please let me know if you have any suggestion or guidance