gurus!
A long time i can't found answer on following question: how hadoop splitting big file during writing. Example: 1) Block size 64 Mb 2) File size 128 Mb (flat file, containing text).
When i writing file it's will be split at 2 part (file size / block size). But... Could occurrence following Block1 will be ended at ... word300 word301 wo and Block 2 will be start rd302 word303 ... Write case will be
Block1 will be ended at ... word300 word301 and Block 2 will be start word302** word303 ...
or can you link at the place where write about hadoop splitting algoritms.
Thank you in advance!