Non-partitioned hive managed ORC format table load takes 2 hours whereas the same insert takes more than 5 hours to load the partitioned table. Why dynamic partition on a single column(date) make huge difference in performance? Any suggestion to improve the performance would be great help.
Its about 500 million rows per day and I am trying to load for 3 months. So 3 months to non-partition table takes 2 hours but same to partitioned table takes 5 hours. Both source and target tables are ORC format.