I'm working on a natural language search engine for Strasbourg's CDS. (Astronomical Data Center of Strasbourg)
I was wondering how the Stanford Part-Of-Speech tagger was tagging acronyms, as acronym are sometimes tagged as NNP, and sometimes are tagged just as a NN.
I wasn't able to find exactly how the programm is deciding on whether or not an acronym like "CDS" or "NASA" is a NNP or a NN.
If someone could help me on the subject, I'll be really glad. :)
Have a good day.