Questions tagged [sequencefile]

A SequenceFile is a Hadoop binary file containing key/value pairs.

A SequenceFile is a file format used by Hadoop for the efficient storage and retrieval of key/value pairs. It is also possible to use compression techniques for more efficient storage.

For more information view the API documentation or the Wiki page.

157 questions
0
votes
1 answer

sequence files created by HBASE export utility aren't readable

I tried the HBase export tool to transfer a table to HDFS. I tried to hadoop dfs -text the file to see a collection of contents. However, I got a fatal error: java.lang.RuntimeException: java.io.IOException: WritableName can't load class:…
user2325080
  • 47
  • 2
  • 7
0
votes
1 answer

how to limit size of Hadoop Sequence file?

I am writing Hadoop seq file using txt as input. I know how to write Sequence file from text file. But i want to limit the output sequence file to some specific size say, 256MB. Is there any inbuilt method to do this?
Pradeep Bhadani
  • 4,435
  • 6
  • 29
  • 48
0
votes
2 answers

use SequenceFile class write file

I use the follow codes to write some data into a SequenceFile Format file . when the program run a while , I interrupte the program via red button on eclipse console . however , when I check the data file on hdfs , the sequence file's size is zero.…
user2193944
  • 11
  • 1
  • 1
0
votes
1 answer

Write binary data from HDFS files to SequenceFile

I a lot of files in HDFS and want to copy them into sequence files by MR job. The key type of the seq file is TEXT (I use SHA1), and the value type is BytesWritable(the file content). I find some example code reads all the file content into a byte…
avhacker
  • 667
  • 1
  • 9
  • 20
0
votes
1 answer

Read Hadoop SequenceFile: weird hex number stream

I am trying to convert a piece of Hadoop SequenceFile into plain text with the following code: Configuration config = new Configuration(); Path path = new Path( inputPath ); SequenceFile.Reader reader = new…
Yuhao
  • 1,570
  • 1
  • 21
  • 32
0
votes
1 answer

Is there an efficient way to find the sequence file for a given key?

My map-reduce job writes the final results to sequence files. Is there any efficient way to find the sequence file for a given key?
flyingfromchina
  • 9,571
  • 12
  • 35
  • 38
0
votes
0 answers

How I can create sequence file in hadoop with API?

I want create sequence file form array list of files address with haddop's API in one machine. Then give this output to sparse-vector and then clustering. For do sparse and clustering, I get code from here. ReutersToSparseVectors and…
1 2 3
10
11