0

The task is to save dataframe as sequenceFile to hdfs.

Following this advice Can I create sequence file using spark dataframes? I tried the following approach:

val data = sourceDf.repartition(1).map{ (NullWritable.get(), _) }
//sourceDf is a source DataFrame 

val seq = new SequenceFileRDDFunctions(data) 
seq.saveAsSequenceFile(s"hdfsPath", None)

But constructor of SequenceFileRDDFunctions requires second argument

What am I doing wrong?

Jelly
  • 972
  • 1
  • 17
  • 40

0 Answers0