Every system admin knows that rsync
is the way to go if you need to do backups of large files, e. g. a database file: since it uses delta compression, it only overwrites blocks that have been modified from the original file to the backup one, avoiding a lot of overhead.
Yet for certain file formats, such as gzip
and bzip2
, modifying the file causes the entirety of the blocks to be copied again, since it causes some kind of butterfly effect in the file structure that modifies it, even when changing just a couple bytes.
So, which compressed file formats are the most rsync
friendly? Conversely, are there any other formats that must be avoided when time is golden and there is 10K of data that must be backed up?