I'm new to Spark and try to read a sequence file and use it in a classification problem. Here is how I read the sequence file
val tfidf = sc.sequenceFile("/user/hadoop/strainingtesting/tfidf-vectors", classOf[Text], classOf[VectorWritable])
I don't know how to split each line of the sequence file by tab? i.e. how to get the Text value?
How can I use it for NAiveBayes classifier in Mllib?