I am using the tf - IDF to rank terms in a document. When terms are arranged in descending order of the tf - IDF, top 'n' terms are most relevant to that document. When we choose a document, top 'n' terms of that document has to be displayed. My question is how to decide the value of 'n'? For example: for a document terms arranged in descending order of the tf - IDF is as follows:
Document 1
- president
- Obama
- Barak
- speech
- inauguration
- come
- the
- look
- again
- took
Now when I want to show topics for document 1, I need only top 5 terms, since all others are not relevant or not topics for the document. How do I decide this breaking point of terms in a document? Thanks in advance