Questions tagged [part-of-speech]

Linguistic category of words

In grammar, a part of speech (also a word class, a lexical class, or a lexical category) is a linguistic category of words (or more precisely lexical items), which is generally defined by the syntactic or morphological behaviour of the lexical item in question.

From http://en.wikipedia.org/wiki/Parts_of_speech

194 questions
5
votes
0 answers

Case insensitive POS (Part of Speech) Tagger for SyntaxNet

I have tried, Parsey McParseface, the pre-trained POS tagger that comes with Syntax Net and it does a good job at tagging sentences that have proper capitalization. I would like to tag sentences that are all lower case, like: i grew up in toronto…
5
votes
2 answers

Python: map NLTK Stanford POS tags to WordNet POS tags

I'm reading a list of sentences and tagging each word with NLTK's Stanford POS tagger. I get outputs like so: wordnet_sense = [] for o in output: a = st.tag(o) wordnet_sense.append(a) outputs: [[(u'feel', u'VB'), (u'great', u'JJ')],…
user47467
  • 1,045
  • 2
  • 17
  • 34
5
votes
2 answers

Transformation-Based Part-of-Speech Tagging(Brill Tagging)

What are the weaknesses and strengths of the Brill Tagger? Can you suggest some possible improvements for the tagger?
user239135
  • 141
  • 1
  • 3
5
votes
2 answers

Find adjectives related to noun input

I want to try and determine the characteristics of a user's personality based on the words they input into a search box. Here's an example: Search term: "computers" Personality/descriptors detected: analytical, logical, systematic, methodical I…
Jon
  • 3,154
  • 13
  • 53
  • 96
4
votes
1 answer

Part-of-speech tagger in PHP?

I am looking for a simple part-of-speech library or code that I can download. My criteria is that it must be simple to use and free is possible. Do you know such a library ?
Benjamin Crouzier
  • 40,265
  • 44
  • 171
  • 236
4
votes
1 answer

Identify Location Within the Sentence where the Missing Word Belongs

I have the code below: import nltk exampleArray = ['The dog barking'] def processLanguage(): for item in exampleArray: tokenized = nltk.word_tokenize(item) tagged = nltk.pos_tag(tokenized) …
alyssaeliyah
  • 2,214
  • 6
  • 33
  • 80
4
votes
2 answers

How to combine both word embeddings and pos embedding together to build the classifier

You known POS is like 'NP', 'VERB'. How can I combine these features to word2vec? Just like the follow vectors? keyword V1 V2 V3 V4 V5 V6 corruption 0.07397 0.290874 -0.170812 0.085428 …
Wei Chen
  • 51
  • 1
  • 4
4
votes
2 answers

where can I get training data of part-of-speech tagger?

I want to implement a part-of-speech tagger,but I don't know where I can get a lot of training data? Thanks!
tianzhi0549
  • 479
  • 2
  • 5
  • 12
4
votes
1 answer

How to get Coarse-grained Part of Speech Tags?

I have a data set which is annotated by Collins parser. Right now, I am keeping the POS of each word in the data set as a feature. The problem is that I don't need fine-grained POS. So, I have combined some of the tags. For example, I assume all…
user1419243
  • 1,655
  • 3
  • 19
  • 33
4
votes
2 answers

How to extract lines numbers that match a regular expression in a text file

I'm doing a project on statistical machine translation in which I need to extract line numbers from a POS-tagged text file that match a regular expression (any non-separated phrasal verb with the particle 'out'), and write the line numbers to a file…
user2468610
  • 55
  • 1
  • 1
  • 5
3
votes
2 answers

Part-Of-Speech tagging and Named Entity Recognition for C/C++/Obj-C

need some help! I'm trying to write some code in objective-c that requires part-of-speech tagging, and ideally also named entity recognition. I don't have much interest in "rolling my own", so I'm looking for a decent library to use for this…
DanM
  • 7,037
  • 11
  • 51
  • 86
3
votes
2 answers

Spacy get pos & tag for specific word

I came across a situation where i have to get the pos_ & tag_ from spacy doc objects. For example, text = "Australian striker John hits century" doc = nlp(text) for nc in doc.noun_chunks: print(nc) #Australian striker John doc[1].tag_ # gives…
Wickkiey
  • 4,446
  • 2
  • 39
  • 46
3
votes
1 answer

spaCy NLP word.pos returns digits instead of POS tags

I am using spaCy library for POS tagging but when I run this code, it returns numbers in the place of the pos tags: import spacy from spacy.lang.fr.examples import sentences nlp = spacy.load('en') mystring = " I am missing my lovely family a…
Timat
  • 43
  • 4
3
votes
2 answers

How do I loop over several files, keeping the base name for further processing?

I have multiple text files that need to be tokenised, POS and NER. I am using C&C taggers and have run their tutorial, but I am wondering if there is a way to tag multiple files rather than one by one. At the moment I am tokenising the…
3
votes
2 answers

Why is NLTK's PoS tagger tagging for each letter in a word instead of tagging for each word?

Say I have this sentence: I am a boy. I want to find out the Part of Speech of each word in the sentence. This is my code: import nltk sentence = 'I am a good boy' for word in sentence: print(word) print(nltk.pos_tag(word)) But this…
Kristada673
  • 3,512
  • 6
  • 39
  • 93
1
2
3
12 13