I heard we can use multiple mappers to read different parts of one bzip2 file in parallel in Hadoop, to increase performance. But I cannot find related samples after search. Appreciate if anyone could point me to related code snippet. Thanks.
BTW: is gzip has the same feature (multiple mapper process different parts of one gzip file in parallel).