Both MyRocks (MySql) and Cassandra uses LSM architecture to store their data. So I have populated around 5 million rows in MySql with MyRocks as storage engine and also in Cassandra. In Cassandra it takes only 1.7 GB of disk space while in MySql with MyRocks as storage engine, it takes 19 GB.
Am I missing something? Both use the same LSM mechanism. But why do they differ in data size?
Update:
I guess it has something to do with the text column. My Table Structure is (bigint,bigint,varchar,text).
- Rows populated: 300 000
- In MyRocks the data size 185MB
- In Cassandra - 13 MB.
But if I remove the text column then:
- MyRocks - 21.6 MB
- Cassandra - 11 MB
Any idea about this behaviour?