I have an application which is receiving data for thousands (say 50,000) subjects.
Each data tuple comprises a subjectId and text data.
I am looking for an embeddable Java database which will provide me the following functionality:
- Store the data quickly (thousands of tuples per second).
- Provide lookup of the textual data for a subjectId.
- (Provide efficient way to) delete data older than X days.
- Must be embedded in Java process
Ideally the Berkeley JE database seems to meet my requirements, except its a key-value DB and my data is inherently multivalued. I'm not sure if this will be a performance issue with duplicate data.
What other embeddable options exist for this simple tuple schema?