I am trying to upload 4K files (approx) of size 5G each on HDFS for processing. I am using the command way to do this:
Iterating over each file -
hadoop fs -copyFromLocal "LocalPath" "HDFSPath"
It is taking a lot of time. Is there a faster way to do this? Does block size matter here?
Thanks in advance.