0

What are some methods for finding X data ranges in Hadoop so that one can use these ranges as partitions in the reducer step?

syker
  • 10,912
  • 16
  • 56
  • 68

1 Answers1

0

Looks like you need something like TotalOrderPartitioner, which allows a total order by reading split points from an externally generated source. You might find this link useful : http://chasebradford.wordpress.com/2010/12/12/reusable-total-order-sorting-in-hadoop/.

Don't know if this is exactly what you need? Apologies if I have get it wrong.

Community
  • 1
  • 1
Tariq
  • 34,076
  • 8
  • 57
  • 79