I have learnt the basics of Apache Hadoop Hive. And know majority of commands. Now, how to exactly work on the data. I have huge amt of data available with me(got it from a person). But dont know what exactly to do.
The data(.xlsx) is weekly sales, quarterly sales of a huge company (cant name it). The data is column wise sales of different products in different branches of US.
What processing can be done on this? Should I filter the data before doing that?