Questions tagged [bzip2]

bzip2 is a Unix command used for compression and decompression of files. The main advantage of bzip2 is that it has a high compression ratio with reasonable speed.

bzip2 one of the most widely used free compression programs for the terminal.

It typically compresses files to within 10% to 15% of the best available techniques, whilst being around twice as fast at compression and six times faster at decompression.

The current version is 1.0.6, released 20 Sept 2010.

327 questions
0
votes
0 answers

Merging small .bz2 compressed files on HDFS increases the total size

I have 300K small .bz2 files on HDFS. I am writing a pig job to merge all the files and produce 500 output .bz2 files. The total size of the small files were 300GB. After merging, the total size of the 500 merged files are around 500GB. This is the…
0
votes
1 answer

Installing libbzip2 on Windows

I am trying to install libbzip2 on Windows 7. (Actually I want to install wp2txt package, but it seems need to libbzip2). I installed "Bzip2 for Windows" as suggested by Alex. But I get following errors. C:\Users\John>gem install bzip2-ruby --…
Handsome Nerd
  • 17,114
  • 22
  • 95
  • 173
0
votes
2 answers

Extremely slow R code and hanging

Calling read.table() function (on a CSV file), as follows: download.file(url, destfile = file, mode = "w") conn <- gzcon(bzfile(file, open = "r")) try(fileData <- read.table(conn, sep = ",", row.names = NULL), silent = FALSE) produces the…
Aleksandr Blekh
  • 2,462
  • 4
  • 32
  • 64
0
votes
1 answer

Unzip and save bz2 file in Objective-c

I need to decompress a gzip file that contains other files and directories, and save these files separately, it is possible to do this in iOS? My compressed file has the following structure file.tar.bz2    - db_update.sql    - Tmp /      -…
Ramon Vicente
  • 761
  • 8
  • 12
0
votes
1 answer

bzip2 - Bzipping all files inside folders (Windows)

I have a bzipping tool on my computer, but it only bzips files that are inside the "compress" directory. How would I make it so files inside all directories inside the compress directory are zipped? Example compress/image.png goes to…
JakeGriffin
  • 223
  • 1
  • 6
  • 17
0
votes
1 answer

File Validation before loading into Hadoop MR

I have a folder of input bz2 files, and some of them might be corrupted, I want to remove all the corrupted/invalid bz2 files before running my MR job. What's a good way of doing it?
Thomas Kao
  • 139
  • 2
  • 7
0
votes
1 answer

Failing to compile/install bzip2-ruby on Xubuntu 13.04 (x64)

I'm attempting to install the bzip2-ruby gem on my Xubuntu 13.04 x64 system. I'm using RVM (Using ruby 2.0.0), and installing via the following command: gem install bzip2-ruby It fails to install, and this is the output I receive (taken from the…
MisutoWolf
  • 1,133
  • 1
  • 18
  • 33
0
votes
1 answer

Python append to tarfile in parallel

import tarfile from cStringIO import StringIO from io import BytesIO as BIO unique_keys = ['1:bigstringhere...:5'] * 5000 file_out = BytesIO() tar = tarfile.open(mode='w:bz2', fileobj=file_out) for k in unique_keys: id, mydata, s_index=…
Giorgos Komnino
  • 433
  • 1
  • 7
  • 20
0
votes
1 answer

How to use SharpCompress' BZip2Stream to compress a string?

I am trying to compress a string (str) using SharpCompress' BZip2Stream but unable to achieve it. Following is the code I have so far, public static string Compress(string str) { var data = Encoding.UTF8.GetBytes(str); using (MemoryStream…
user677607
0
votes
3 answers

Uncompressing bzip2 in an iOS app

I'm working on an iOS app that needs to download ~50 MB of data from time to time. bzip2 gives me the best compression rate (reduces the size to 8 MB). The problem is: how to decompress the data in the app? I've done some research - the only two…
Bartosz Marcinkowski
  • 6,651
  • 4
  • 39
  • 69
0
votes
0 answers

downloading and unzipping - simpifying client code

I seem to need your help again :/ given the following code: /** * downloads a url (file) to a desired file name */ def downloadFile(url: URL, filename: String) { commonOp(url2InputStream(url), filename) } /** * common method for…
Stefan Kunze
  • 741
  • 6
  • 15
0
votes
0 answers

Using SevenZipSharp to extract bzip2 file add a needless tar extension to file, and can't get right archive informations?

Using following code to extracting a bzip2 file to a specified directory: string[] archive_files = null; using (SevenZip.SevenZipExtractor extrator = new SevenZip.SevenZipExtractor(source_file)); { extrator.ExtractArchive(dest_directory); …
0
votes
2 answers

How to increase map tasks for MapReduce with bzip2 inputformat

I developed mr, that correctly works with a text file running multiple map tasks, but I need to run job either on archives. My choice is bzip2 archive. With such archives my job works with only one map task. Does anyone know, how I can increase map…
Tanya
  • 23
  • 1
  • 3
0
votes
3 answers

Way to delete the already decompressed portion of a file as it is being decompressed?

I'm wondering if there is a way to delete the already decompressed portion of a file as it is being decompressed. I've got an external backup of the compressed file so I'm not worried about loosing data. The file is a bz2. I'm looking to do this…
keyneom
  • 815
  • 10
  • 12
0
votes
1 answer

Comparing bz2 files in unix

I manage a number of databases on unix servers, and do daily backups of these databases using mysqldump. Since (some of) these databases are very large (20+Gb), I usually zip the backup .sql files using bzip2, to get compressed bz2 files. As part of…
amaidment
  • 6,942
  • 5
  • 52
  • 88
1 2 3
21
22