I am implementing a application that calculated the readability of Java files with the readability formula proposed by Posnett, Hindle and Devanbu (here).
The formula is: z = 8.87 - 0.033 * Volume + 0.40 * Lines - 1.5 * Entropy
They say that Entropy
is calculated from the counts of terms (tokens
or bytes) as well as the number of unique terms and bytes.
I made some research, but couldn't find a definition of a term in Java. The only thing that I found was this, that list some "useful Java terms", but I don't think that these are the only terms in Java.
So, what should I consider as Java terms? Can anyone give me an exaplanation?