I have two hive tables named omniture3
and omniture4
.
Size of both the tables is 50*20
each.
I am executing a JOIN
query on the tables, but it is taking too long to execute!
The Hadoop cluster consists of a single node.
I know hive query takes long time to execute on big tables, so I reduced the table size , but still it is not executing(it has already been more than half hour and it is still stuck).
Following is the query:
SELECT omniture3.*, omniture4.* FROM omniture3 JOIN omniture4 on (omniture3.id = omniture4.id) LIMIT 10;
How much time do these queries generally take for the given size of tables, and if this time is normal how can I speed up the process?
Edit: I have a cluster with 2 nodes(1 datanode and 1 namenode) and each node is allocated 4GB of RAM)