On Hadoop Cluster, Where we need to install Hive ,on DataNode or Namenode? On which factor we need to decide the installation node ( Datanode ot Namenode)
Thanks !!!
On Hadoop Cluster, Where we need to install Hive ,on DataNode or Namenode? On which factor we need to decide the installation node ( Datanode ot Namenode)
Thanks !!!
Installation of hive is independent of the fact that it resides on the namenode or the datanode. Hive configuration file needs to know where is hadoop installed so that it can access the job tracker.
Once it has the knowledge of where job tracker is running, whenever you execute a query in Hive, it will convert your query into one or more mapreduce program and then it will submit this program to hadoop's jobtracker. Jobtracker then executes this map reduce program and show/store the output.