Questions tagged [part-of-speech]

Linguistic category of words

In grammar, a part of speech (also a word class, a lexical class, or a lexical category) is a linguistic category of words (or more precisely lexical items), which is generally defined by the syntactic or morphological behaviour of the lexical item in question.

From http://en.wikipedia.org/wiki/Parts_of_speech

194 questions
0
votes
1 answer

Extracting the Strings from a Chunk

I am using NLTK POS-Tagging to extract information from a text, in this example I am looking for an IBAN. For some texts the code returns more than one chunk, but I don't mind that, I will sort the correct one out later with a RegEx. Now here is my…
user11491965
0
votes
1 answer

How to get a list of parts of speech and words from Princeton English WordNet?

I'd like a complete list of parts of speech (e.g., adj., adv., and .v) for English words. All I need is just a TSV table with two columns, with the first column the word and the second column POS. I know that wordnet should contain such information.…
user1424739
  • 11,937
  • 17
  • 63
  • 152
0
votes
1 answer

How can I get a pos tagged file as output when I give a text file as input?

This is the code I am trying but the code is generating an error. import nltk from nltk.corpus import stopwords from nltk.tokenize import word_tokenize, sent_tokenize stop_words = set(stopwords.words('english')) file_content =…
0
votes
1 answer

How to do selective preposition tagging nltk

NLTK preposition tagging includes many words such as - in, at, of, as ,by , during ... etc. I want to tag only selective words - in, at,per - as 'IN'. Rest of the prepositions should be tagged differently - say 'ZZ' - meaning these can be deleted…
nkrishna
  • 25
  • 8
0
votes
0 answers

Open POS-tagged .txt-file with tags in python

I'm trying to POS-tag a bunch of interviews in German. I've been getting great results with the "ClassifierBasedGermanTagger" using the Tiger-Corpus. I saved the tagged results as .txt files, but due to some issues regarding the quality of the…
0
votes
1 answer

How to remove an entire line if it does not have a pos tag like CD?

I am reading a news article and pos-tagging with nltk. I want to remove those lines that does not have a pos tag like CD (numbers). import io import nltk from nltk.corpus import stopwords from nltk.tokenize import word_tokenize from nltk import…
nkrishna
  • 25
  • 8
0
votes
0 answers

treetagger module returns empty list

I made a sentiment analysis program with treetagger. It worked fine two weeks ago but now it doesn't works properly. After that I used treetagger in a very simple program which returns "hello world"'s tagging. It doesn't work properly again. I…
nbsas
  • 3
  • 5
0
votes
1 answer

Python Pandas: NLTK Part of Speech Tagging for Entire Column in Dataframe

I have the following sample data frame shown below. It has been tokenized already. No category problem_definition_stopwords 175 2521 ['coffee', 'maker', 'brewing', 'properly', '2', '420', '420', '420'] 211 1438 ['galley', 'work',…
PineNuts0
  • 4,740
  • 21
  • 67
  • 112
0
votes
1 answer

Invalid parameter file in Treetaggerwrapper python

I have installed TreeTaggerwrapper for python through PyPI. I have placed the treetaggerwrapper.py and treetaggerpoll.py in the Treetagger directory. I have also placed the english.par file in the 'lib' sub directory. When I initiate the tagger…
shan
  • 467
  • 4
  • 9
  • 20
0
votes
1 answer

Rule based named entity recognizer without parts of speech label or any other information

I'm working on a project where I am trying to build a named entity recognizer from texts. So basically I want to build and experiment the NER in 3 different ways. First, I want to build it using only segmented sentences-> tokenized words. To…
0
votes
1 answer

Querying part-of-speech tags with Lucene 7 OpenNLP

For fun and learning I am trying to build a part-of-speech (POS) tagger with OpenNLP and Lucene 7.4. The goal would be that once indexed I can actually search for a sequence of POS tags and find all sentences that match sequence. I already get the…
0
votes
1 answer

Stanford CoreNLP POS tagging in French

I am looking for a way to use Pos tagging for French sentences with Python. I saw that we could use Stanford CoreNLP but after several searches on google, I did not find real examples that could satisfy me .. It would be great to have a piece of…
ElData
  • 13
  • 1
0
votes
1 answer

POS tagger for Latin Python

Any idea if there are any more POS taggers for Latin apart from CLTK available for Python or any other language? I have tried the CLTK POS taggers but they are not giving me very accurate results for my corpus
gannina
  • 173
  • 1
  • 8
0
votes
2 answers

keras add features after embedding

I want to add part of speech features into my word vector after embedding in Keras. I would like to add them as one hot and concat them after embedding. But the part of speech of a word is dynamic so I can't use another embedding layer for part of…
Zedom
  • 33
  • 4
0
votes
1 answer

Stanford NLP, Error while loading a tagger model, while reading models from path

I am using Stanford-NLP 3.8.0 for my project in work I was reading a lot questions about my problem, no in stackoverflow and any other sites, but i still didnt find the solution, and there no any situation like my in all of the places, where i was…