I have to duplicate Hive tables to another cluster keeping the schema and the hierarchy of my tables, so my question is : What is the safest and proper way to do it, in order to have the exact tables (and databases) copies of Cluster1 into Cluseter2.
I have found ways which globally said:
- hive > export TABLE1;
- distcp hdfs:source_Path hdfs:dest_Path
- hive > import TABLE1; #in Cluster 2
- hive> MSCK REPAIR TABLE TABLE1;
But as I have to copy a numerous number of databases and tables, is there any fast and safe way, like duplicate the state or snapshot of Datawarehouse1 into Datawarehouse1 ...etc ?
Thanks in advance.