I just started learning microstream. After going through the examples published to microstream github repository, I wanted to test its performance with an application that deals with more data.
Application source code is available here.
Instructions to run the application and the problems I faced are available here
To summarize, below are my observations
- While loading a file with 2.8+ million records, processing takes 5 minutes
- While calculating statistics based on loaded data, application fails with an
OutOfMemoryError
Why is microstream trying to load all data (4 GB) into memory? Am I doing something wrong?