Below is the code for a Hadoop Reducer, I am not able to understand why the comparison(placed between slashes) always failing, here we are comparing two Text type values. This code is for a Reducer doing Inverted Indexing.
public static class IntSumReducer
extends Reducer<TextPair, Text, Text, Text>{
private Text indexedData = new Text();
public void reduce(TextPair key, Iterable<Text> values, Context context)
throws IOException, InterruptedException {
Iterator<Text> itr = values.iterator();
Text oldValue = itr.next() ;
String old = oldValue.toString();
//String next;
int freq = 1;
Text nextValue = null;
StringBuilder stringBuilder = new StringBuilder();
if(itr.hasNext()==false) {
stringBuilder.append(old + 1);
}
while(itr.hasNext()) {
nextValue = itr.next();
int compareValue = oldValue.compareTo(nextValue);
while(compareValue == 0) {
freq++;
if(itr.hasNext()) {
nextValue = itr.next();
////////////////////////////
// following comparison always returning zero
// Although values are changing
compareValue = oldValue.compareTo(nextValue);
///////////////////////////
System.out.println(compareValue);
} else {
freq++;
System.out.println("Break due to data loss..");
break;
}
}//end while
System.out.println("Value Changed..");
old = old + freq;
stringBuilder.append(old);
stringBuilder.append(" | ");
oldValue = nextValue;
old = nextValue.toString();
freq = 1;
}//endwhile
//System.out.println("KEY :: " + key.toString());
context.write(key.getFirst(),new Text(stringBuilder.toString()));
}
}
Any help is appreciated as I am completely new to this area.