I've run a cvs2git migration on a CVS repository that's over 2 GB. I wrote a script traverses the new git repository and the CVS module to verify that the objects are the same. I've found that the text files migrate just fine and have the same sha1sum; however, ALL of the binary files have different sha1sums and they are all flagged as binary in CVS (-kb). Every other topic I've read about cvs2git and binary files usually blame the issue on binary files not being flagged as binar (-kb), but that's not the case here. What else could be the problem?
The scripts I execute to do the migration are below:
./Python-2.7.3/python ./cvs2svn-trunk/cvs2git \
--blobfile=/path/to/git-blob.dat \
--dumpfile=/path/to/git-dump.dat \
--username=cvs2git \
/cvsroot/database
cd /gitroot; mkdir database; cd database; git init
cat /path/to/git-{blob,dump}.dat | git fast-import