I've got a Hadoop SequenceFile where the key is IntWritable and the value is some arbitrary Java class implementing Writable, and with an interesting toString() method. I would love to make a two column Hive table where the first column is the key as an int and the second column is the value as a string or varchar.
I would love to do this in the most tasteful and easiest way possible: I shouldn't have to write 200 lines of code to say "just decode this and then call toString()".
My current solution is just to do an extra MapReduce job to put the thing in the format I want before inputting it into Hive, but I find this is offensive for obvious reasons.
Thanks!