I have a complexed hql script, comprises a number of tables' join, union, row_number and grouping sets.
I know all ODS data involved in this task is 40GB, I feel confused on how to evaluate the number of mappers and reducers it will use and how many CPU cores/Memory (no. of containers) it will cost?
Any help is appreciated.