Questions tagged [bzip2]

bzip2 is a Unix command used for compression and decompression of files. The main advantage of bzip2 is that it has a high compression ratio with reasonable speed.

bzip2 one of the most widely used free compression programs for the terminal.

It typically compresses files to within 10% to 15% of the best available techniques, whilst being around twice as fast at compression and six times faster at decompression.

The current version is 1.0.6, released 20 Sept 2010.

327 questions
1
vote
1 answer

How to read a .gz or .bzip2 file in java

I have .gz and .bzip2 files and I need this to be extracted and displayed. I looked at couple of places and it mentions to use zip4j utility. Can I use this to extract and display the files? Please let me know. I referred the post Uncompress BZIP2…
Akshitha
  • 89
  • 1
  • 2
  • 7
1
vote
1 answer

How can I control file compression parameters in Java to decompress objects faster?

I am writing to a file some large Java object I created, and later reading it back. I am using compression since the object is pretty large and I have around 600 different instances of it (each one in a separate file). I am currently using bzip2…
OferBr
  • 297
  • 2
  • 13
1
vote
1 answer

How to Bunzip a file stored

I have to archive the HDFS files frequently. The files have to be compressed in the Bunzip format using Java code. Now, what I did is the following: Move the input files to a local location hdfs.moveToLocalFile bzip using the bzip2 command. Move…
Anju
  • 631
  • 2
  • 9
  • 25
1
vote
2 answers

Write file data as Bzip2 to output of servlet response

I'm trying to get Tomcat to write out the servlet contents as a bzip2 file (Silly requirement perhaps but it's apparently necessary for some integration work). I'm using the Spring framework so this is in an AbstractController. I'm using the bzip2…
NeilInglis
  • 3,431
  • 4
  • 30
  • 31
1
vote
1 answer

How to convert gzip to bzip2 via HDFS / Hadoop

I have a ton of data files coming in from a client, all gzipped. I want them in .bzip2 as that is splittable and preferable for the intense analysis I have ahead. Full disclosure: I use Hive and generally have yet to do more than very basic hadoop…
Todd Curry
  • 1,045
  • 1
  • 10
  • 23
1
vote
0 answers

Hive Reading external table from compressed bz2 file

this is my scenario. I have bz2 file in Amazon s3. Within the bz2 file, there lies files with .dat,.met,.sta extensions.I am only interested in files with *.dat extensions.You can download this samplefile to take a look at bz2 file. create external…
prog_guy
  • 796
  • 3
  • 7
  • 24
1
vote
1 answer

How to estimate progress of decompress of bzip2 file using C function?

I could use gzoffset function in zlib to estimate the remaining uncompress file size. Is there a similar function in bzip2 library? If not, is there any trick that I can use?
Sangcheol Choi
  • 841
  • 1
  • 12
  • 19
1
vote
1 answer

How to zip, unzip files in Objective-C (arm64 support)

I am looking for a way to zip and unzip files on iOS with arm64 support. Is there already a ready-to-use way for zip and unzipping files with arm64 support? (Best if there is also a way to integrate 7zip, bzip2, gzip, rar) I am using SSZipArchive…
David Gölzhäuser
  • 3,525
  • 8
  • 50
  • 98
1
vote
1 answer

Python bz2 in c#

I got several strings like : BZh91AY&SYA\xaf\x82\r\x00\x00\x01\x01\x80\x02\... After googling a whlke, found that those seems to be Python bz2 encoded strings, since "BZh91AY" appears to be a bz2 standard header. Now I need to decode such strings.…
HiperiX
  • 395
  • 5
  • 19
1
vote
1 answer

How to efficiently parse large bz2 xml file in C

What I want to do: download OSM (OpenStreetMap) data in regular intervals (or update it using diffs) parse that data, which is an bzip2 compressed xml, and store the parts relevant to me in my database, as memory+cpu efficient as possible (runtime…
griffin
  • 1,261
  • 8
  • 24
1
vote
1 answer

Problems with ROOT and libbz2

I'm trying to use libbz2 library with ROOT framework but I haven't reached any success. I wrote test app without ROOT using libbz2 and it works fine. Here is my code: #ifndef BZlib_H #define BZlib_H #include #include class…
mortalland
  • 57
  • 1
  • 6
1
vote
1 answer

Optimal compression of csv files with repeated rows

I wanted to know if there was some optimal way to compress a csv file which has millions of rows that are repeated/are duplicated sequentially. Each row has 4-5 elements. There are only a few hundred unique rows, but because each of them appear so…
xbsd
  • 2,438
  • 4
  • 25
  • 35
1
vote
1 answer

Protobuf-net disable seeking

I am using protobuf-net in a network client/server application. Each packet of data is zipped with a BZip2Input/Output stream to compress my messages. This has worked fine for several weeks, but I have recently modified the server's objects, and am…
Superman
  • 3,686
  • 6
  • 34
  • 46
1
vote
1 answer

ruby 2.0 bz2 gem installation in MacPorts MacOS

Would anyone provide guidance on how to install the bzip2-ruby for ruby-2.0 in MacPorts environment? when I installed it, it complained about version.h not found. I did a symlink of the version.h to ruby-2.0.0/version.h and then I got these: ~ >$…
TX T
  • 817
  • 2
  • 16
  • 25
1
vote
1 answer

Python bz2 - text vs. interactive console (data stream)

I was using bz2 earlier to try to decompress an input. The input that I wanted to decode was already in compressed format, so I decided to input the format into the interactive Python console: >>> import bz2 >>> bz2.decompress(input) This worked…
Zhouster
  • 746
  • 3
  • 13
  • 23