I have a Google Compute Instance (VM) that has a 2TB disk and around 80GB used space. I wanted to archive this VM so that I don't get billed for the whole 2TB, and also so that it is ready to be recreated quickly if needed. Disk Snapshots seemed to be the best option since it is mentioned that I only get billed for the disk space used in that case. But when I try this, the snapshot size I get is around 600GB, almost 10 times the used space, but still less than the full 2TB.
I tried defragmenting the disk but that didn't help. I also tried using "zerofree" to write 0's to unused space, and that reduced the snapshot size to 20GB - 4x lower than the used space. However zerofree takes a lot of effort and time to run, but I'm guessing it is helping with the compression of the disk.
Is there a better way to improve disk compression efficiency in this case? Maybe any crucial step that I am missing while generating the disk snapshot?
NOTE: I also tried Machine Images but that seems to use disk snapshots under the hood, and they cost more for some reason.