Questions tagged [part-of-speech]

Linguistic category of words

In grammar, a part of speech (also a word class, a lexical class, or a lexical category) is a linguistic category of words (or more precisely lexical items), which is generally defined by the syntactic or morphological behaviour of the lexical item in question.

From http://en.wikipedia.org/wiki/Parts_of_speech

194 questions
0
votes
0 answers

Problems setting up and using TreeTagger in R

I am trying to use the TreeTagger function in R for POS analysis (on a Mac laptop). However, I keep getting an error message when running the following code: tagged.text <- treetag(reviews, treetagger = "kRp.env", rm.sgml = TRUE, lang…
Ceri
  • 27
  • 5
0
votes
0 answers

How can I get rid of new lines between items in a list?

I'm having trouble carrying out named entity recognition using the natural language toolkit. Before I can carry out NER I have to carry out sentence segmentation, tokenisation and POS tagging. I've done so with the following code: def…
0
votes
3 answers

Tagging a .txt file from Inaugural Address Corpus

I'm having a hard time trying to figure this out. New to coding. I'm trying to read a .txt file, tokenize it, pos tag the words in it. Here's what I've got so far: import nltk from nltk import word_tokenize import re file =…
ArchivistG
  • 168
  • 1
  • 13
0
votes
1 answer

Where can I find a list of english part of speech constraints?

I'm looking for a list of English part of speech sequencing rules (e.g. "a determiner cannot be followed by a verb"). Thought it would be easy but I couldn't find an actual list of more than several examples. Any ideas? Thanks.
user1049052
  • 123
  • 1
  • 4
  • 10
0
votes
0 answers

Given two words and their corresponding pieces-of-speech, how to return them in the order that makes most grammatical sense?

I have an app that creates two randomly generated words. My goal is to return them in the order that makes the most sense grammatically. Getting their corresponding pieces-of-speech to each word should not be difficult, but my main question is to…
gongo
  • 1
0
votes
1 answer

How to write UD Pipe tagger output to file?

I have been using UD Pipe to train and tag data in the Hindi Language. I run the tagger using udpipe --tag model.output hi-ud-test.conllu which works perfectly fine and displays the output in command line. How do I write this output in a file?
singhuist
  • 302
  • 1
  • 6
  • 17
0
votes
0 answers

How to do POS tagging for Bigrams in Python

Firstly I must admit that I am a newbie to Python or R. Here I am trying to create a file with the list of bi-grams / 2-grams along with their POS tags (NN, VB, etc...). This is used to easily identify meaningful bi-grams and their POS tag…
JKC
  • 2,498
  • 6
  • 30
  • 56
0
votes
1 answer

Proper Noun detection in Acronyms with the POS Tagger

I'm working on a natural language search engine for Strasbourg's CDS. (Astronomical Data Center of Strasbourg) I was wondering how the Stanford Part-Of-Speech tagger was tagging acronyms, as acronym are sometimes tagged as NNP, and sometimes are…
0
votes
1 answer

Training Stanford POS tagger using multiple text files

I have a corpus of about 20000 text files and i want to train the tagger using these text files, which is better,to group these text files into one text file(i don't know if it will affect tagging accuracy or not) or to include all these text files…
Abdallah Sayed
  • 105
  • 1
  • 8
0
votes
0 answers

What tags does Google NL API use in its annotate syntax response?

I am using Google NL API's annotate syntax method and using Brat to visualize the response. Where can I find the exact list of "Tags" or "Labels" for words and dependency relations that are used in Google NL API. "partOfSpeech": { "tag":…
0
votes
1 answer

Python NLTK PoS Tag inaccurate

I've been trying to improve the POS tagger on the NLTK for a few days, but I cannot figure it out. Right now, the default tagger is really inaccurate and tags most words as 'NN'. How can I improve the tagger to make it more accurate? I've already…
Lemon
  • 19
  • 1
0
votes
1 answer

Python - How do you use the tags from pos_tag (NLTK)?

I have been trying to figure out how to use the 'tagged' results from part of speech tagging. Currently I have this test code: When I run it, it returns with this: This is all fine. But I want to be able to use this result and I don't know how to.…
Lemon
  • 19
  • 1
0
votes
2 answers

StanfordNLP POS giving mixed results

I was testing Stanford NLP POS Tagger, I am getting mixed results. SOP(StanfordNLP.getInstance().getPOSMap("WHEAT flour(whole)".toLowerCase())); SOP(StanfordNLP.getInstance().getPOSMap("Whole wheat flour".toLowerCase())); Gives me the following…
user2745862
  • 103
  • 3
  • 8
0
votes
1 answer

How to implement Part Of Speech tagging for a given String array in c#?

i need a simple English POS tagging program to be implemented in c# only.
0
votes
0 answers

Launching Minipar

I would like to use Minipar parser, the POS module and especially the pronoun resolution parser (coreferrence resolution). I use Windows 10, and I downloaded the GATE version containing Minipar : GATE_Developer_8.1. I followed the user manual but…
user6284868