A few months ago I used XFS formatted zram devices strung together with GlusterFS to create a distributed / networked / replicated in-memory filesystem on a few bare metal servers (running RHEL 7.2).
I'm using this FS as a performant way to store, serve and replicate images and videos for my multi-server application server. And I was unable to find any other in-memory FS solutions, so I hacked together this one.
It's been working well for 4 months, but last night one of the servers crashed because of an XFS corruption-- and I ended up having to do an OS Reload. I don't know for sure this setup was to blame.... but the odds are.
Which leads me to...
1) Are there any best practices I should follow to make this setup more stable?
2) Is there anyway I can (or even should) setup a logging system so that I can monitor each zram+xfs node's health ongoing? And know what went wrong if anymore crashes happen.
Some performance tests:
/dev/loop0 = https://erlhelinfotech.wordpress.com/2013/02/20/ramdisk-service-for-systemd/
/dev/zram0 = my zram setup
/dev/sdb2 = a standard 7200rpm disk