I'm planning to use the Java Weka library's EM algorithm in order to assign probabilities to objects to be in a certain cluster and then, work with these probabilities.
Furthermore, the properties of those objects will be loaded from a database, so I would like to load them into the clusterer directly from memory, instead of dumping them to an arff file as in the examples I have found around the web (e.g. Serialization).
Firstly, I would like to know if the Weka library is the proper one for my purpose of there exists another one such as Apache Commons Math.
Secondly, is there any example which does not manage any file in order to create Instances?
I would be grateful for any help.