I have a multinode cluster in Hadoop consisting of two machines(one name node and two data nodes in each machine).
I am using:
hadoop fs -put dir1 hdfspath
In the above command: will the data be distributed in both machines or only in one machine?
What is the way to balance : is it by using the hadoop balancer tool or there can be an automatic way for this?