I know how to develop a simple inverted index on a single machine. In short it is a standard hash table kept in-memory where: - key - a word - value - a List of word locations As an example, the code is here: http://rosettacode.org/wiki/Inverted_Index#Java
Question:
Now I'm trying to make it distributed among n nodes and in turn:
- Make this index horizontally scalable
- Apply automatic sharding to this index.
I'm interested especially in automatic sharding. Any ideas or links are welcome!
Thanks.