How can map-reduce job generate metrics about how many keys it has processed and give data like the following?
% of keys that belonged to this particular value.
How can map-reduce job generate metrics about how many keys it has processed and give data like the following?
% of keys that belonged to this particular value.
Use a custom counter in reduce method and increment it every-time the reduce method is called. Please note that the reduce method is called only once for a given key. Once the job is completed successfully, this custom counter will appear in the log.