I have a backup box using ZFS (on Ubuntu 20.04 LTS) where I use rsnapshot to back up a number of websites. These websites have a fair amount of duplicated data.
I have just enabled de-duplication of the ZFS volume, but I am at a loss as to how to de-duplicate the existing data - specifically as rsnapshot uses hard links, I don't know how I can rewrite the existing data to force deduplication to kick in without messing up the snapshots.
Can anyone advise of a suitable method to de-duplicate my existing data?