Questions tagged [udpipe]

UDPipe comprises a free C++ library and a binary executable for Natural Language Processing (NLP).

UDPipe is a free C++ library for Natural Language Processing (NLP). UDPipe can do tokenization, parts-of-speech tagging, lemmatization and dependency parsing of raw text.

Binaries for Windows/Linux/OS X are also available, and there exist a web service and a REST API.

For details see http://ufal.mff.cuni.cz/udpipe and https://github.com/ufal/udpipe .

37 questions
0
votes
1 answer

how to keep hashtags and their words as a single token

How to change the default setting in case I would like to keep the hashtag symbol and its word intact ( i.e. #company and not # and company) x_mod <- udpipe_load_model("D:/Users/asongara/Documents/english-ewt-ud-2.3-181115.udpipe") ud_model <-…
0
votes
1 answer

For loop when extracting keywords with udpipe in R

Let's start with a reproducible example, which is a data frame called key composed by 8 columns and 3 rows: key <- structure(c("Make Professional Maps with QGIS and Inkscape", "Gain the skills to produce original, professional, and aesthetically…
antecessor
  • 2,688
  • 6
  • 29
  • 61
0
votes
2 answers

Extracting keywords in each row of a data frame using udpipe in R

I am using the R package udpipe to extract keywords in my data frame. Let's start with some data contained in the package: library(udpipe) data(brussels_reviews) If we look at the structure, we see it contains 1500 comments (rows) and 4…
antecessor
  • 2,688
  • 6
  • 29
  • 61
0
votes
1 answer

Using content_transformer with udpipe_annotate

so I just found out that udpipe has an awesome way of showing correlations, so I started working on it. The code from this site works perfect if I use it on the csv file after importing it and don't make any changes on it. But my problem occurs as…
Robin
  • 198
  • 1
  • 1
  • 11
0
votes
1 answer

Text Similarity using PoS tag

I want to calculate text similarity by using only the words of a specific POS tag. Currently I am calculating similarity using cosine method but it does not take into account POS tagging. A <- data.frame(name = c( "X-ray right leg arteries", …
john
  • 1,026
  • 8
  • 19
0
votes
1 answer

udpipe_accuracy() always gives the same error " The CoNLL-U line '....' does not contain 10 columns!"

This is regarding the R package udpipe for NLP. I am using it to tokenize, tag, lemmatize and perform dependency parsing on text files. I am not sure which template the conllu file is needed for the function udpipe_accuracy I loaded a CSV file of…
Lazarus Thurston
  • 1,197
  • 15
  • 33
-1
votes
1 answer

Running sentiment analysis for google news headlines faced error while using udpipe

Here is my code so far pacman::p_load(dplyr, ggplot2, stringr, udpipe, lattice) gnewsheadlines <- read.csv(file.choose(), stringsAsFactors = F) udmodel_english <- udpipe_load_model(file =…
Param
  • 11
1 2
3