I am attempting to build a model that will attempt to identify the interest category / topic of supplied text. For example:
Shop for Bridal Wedding Sarees from our exhausting variety of beautiful and designer sarees. Get great deals, quality stitching and Free International delivery.
would resolve to a top level category like:
Fashion or Wedding Fashion
To acheive this, I have used Latent Dirichlet allocation (LDA) which is a topic model that generates topics based on word frequency from a set of documents.
So I got topics of document as below but don't find way to map them to human understandable format
topic #0 (0.500): 0.100*sare + 0.060*intern + 0.060*get + 0.060*deal + 0.060*exhaust + 0.060*design + 0.060*free + 0.060*qualiti + 0.060*shop + 0.060*great
topic #1 (0.500): 0.063*sare + 0.063*beauti + 0.063*deliveri + 0.063*stitch + 0.063*varieti + 0.063*wed + 0.062*bridal + 0.062*great + 0.062*shop + 0.062*qualiti
I have used this script to implement above things.
So the Question is How to map above identified topics to human readable category like Fashion?