I've been working on performing Latent Semantic Analysis using the SparseVectorsFromSequenceFiles
, RowIdJob
and RowSimilarityJob
Hadoop jobs provided by Mahout, which run Map/Reduce jobs. I've been trying to find an equivalent implementation for these functionality that runs in memory, either in a single thread, or preferably in multiple threads.
Is there such a thing?