To be specific, for example, given
hadoop jar hadoop-streaming.jar \
-input myInputDirs \
-output myOutputDir \
-mapper /bin/cat \
-reducer /usr/bin/wc
Where myInputDirs has a dated subfolder structure of
input_dir/yyyy/mm/dd/part-*
I want myOutputDir has the same dated subfolder structure:
output_dir/yyyy/mm/dd/part-*
Guess there should be an option to do this. Can "-partitioner" or any "-D" option make this?