I am still researching on evaluating clusters formed using clustering (unsupervised learning)?
I tried googling but the measures I get are too theoretical. It will be great if people can share the mechanisms they are using to evaluate the clusters formed. Say I have a Java Cluster so that will contain Java EE, Java ME, RMI, JVM etc. ,another cluster say NoSQL and that will have something like Neo4j, OrientDB, CouchDB etc. This is perfect and my clustering Algorithm has given me most accurate clusters.
However after training and then testing I may get say MySQL, Oracle under NoSQL cluster so I just do a manual/visual interpretation and then re-train my Algorithm or tweak it so that I get better Clustering.
Now I want to automate this process of visualizing clusters manually and have a system that gives me the accuracy of clusters formed. I am looking out for something similar to Precision , Recall, NDCG, Map etc used in search. My clusters are varying in length and there can be n - different cluster formed so precision/recall would not be the right thing.