I am a newbie to Hadoop and Mahout. I wanted to know how to convert a simple text file containing a set of vectors to sequence file. I have tried the MR framework and changed outputFormat to SequenceFileOutputFormat, and I get following output
SEQorg.apache.hadoop.io.Textorg.apache.hadoop.io.Text��.�U_v�;�Vs�'�sample0 1 2 3 4 5sample1 6 7 8 9 10sample211 12 13 14 15sample316 17 18 19 20
Those hazy characters are in binaries so can't be read but my issue is how to get sample0 1 2 3 4 , similarly others to SequenceFile format (binary format).
I believe it can be done by changing the output of mapper function, however I am unable to figure it out.
-Thanks for your time.