Questions tagged [polyglot]

Multilingual text (NLP) processing toolkit for Python

From the homepage:

Polyglot is a natural language pipeline that supports massive multilingual applications.

Features:

  • Tokenization (165 Languages)
  • Language detection (196 Languages)
  • Named Entity Recognition (40 Languages)
  • Part of Speech Tagging (16 Languages)
  • Sentiment Analysis (136 Languages)
  • Word Embeddings (137 Languages)
  • Morphological analysis (135 Languages)
  • Transliteration (69 Languages)
111 questions
1
vote
0 answers

Why polyglot does not return the un-transliterated text back?

eg = 'bajra bechna बाजरा बेचना' fin='' text = Text(eg) for x in text.transliterate("en"): fin = fin + ' ' + str(x) print(fin) output: बाजरा बेचना I am loosing the initial English text in the output. How can I get all of it in the output?
1
vote
0 answers

Polyglot >> error: input contains invalid UTF-8 around byte 35 (of 62)

My code is: from polyglot.text import Text def getPolyglotPolarity(text): return Text(text).polarity df['PolyPolarity']=df2.apply(getPolyglotPolarity) df2 is a panda data frame read from a huge csv file containing sentences from almost all…
1
vote
1 answer

Polyglot Processor with Local Dataflow Server

I have been trying to work with polyglot and build a simple python processor. I followed the polyglot recipe and I could not get the stream to deploy. I originally deployed the same processor that is used in the example and got the following…
1
vote
0 answers

How to run node.js project with yarn on graalvm?

Which steps should I take to migrate my current yarn project to run on graalvm? This doc describes a simple start but does not mention yarn, running tests, etc.
andreoss
  • 1,570
  • 1
  • 10
  • 25
1
vote
0 answers

Send data from Java to C++ Graalvm

I'm working with GraalVM coding in Java and C++ as polyglot code. I have this code to print in C++ with Java: printMessage.cpp C++ code: #include int imprimemensaje(std::string s); int main() { int imprimemensaje(std::string s); …
Antonio
  • 115
  • 6
1
vote
0 answers

What is the best Python library to get morphemes (including the stem) from a word?

I'm currently using the Polyglot library, however I would to be able to break down words like writing into write and ing. Polyglot is able to break down words like reading successfully into read and ing, but it fails when the stem is word written…
Dominique
  • 374
  • 4
  • 16
1
vote
3 answers

What are the best algorithms to determine the language of text and to correct typos in python?

I am looking for algorithms that could tell the language of the text to me(e.g. Hello - English, Bonjour - French, Servicio - Spanish) and also correct typos of the words in english. I have already explored Google's TextBlob, it is very relevant but…
1
vote
0 answers

Python Polyglot: How to prevent hyphens separating words which belong together

I'm trying to clean up sentences in order to create better word clouds, and I'm having an issue with hyphens splitting up words which belong together. An extreme case is the following where I am dropping all numbers. 2-Mics should be found in the…
Daniel F
  • 13,684
  • 11
  • 87
  • 116
1
vote
1 answer

Polyglot ubuntu not able to download language files

I have installed polyglot in ubuntu system using sudo apt-get install polyglot also installed dependencies sudo apt-get install -y libicu-dev python3-pip when i run polyglot command in console polyglot polyglot.ini download LANG:en gives me…
Rupesh Terase
  • 440
  • 3
  • 14
1
vote
2 answers

Polyglot does not detect multiple languages

I am testing the polyglot package in Python to detect the languages in a mixed languages document. I am not expecting from it the most accurate prediction but to start with the package does not return anything but one language as an answer even for…
Outcast
  • 4,967
  • 5
  • 44
  • 99
1
vote
1 answer

"python setup.py egg_info" failed with error code 1. How to fix this

How do I fix this. I cannot find any directory with "Temp\pip-install-7utykvpt\polyglot" C:\Windows\system32>pip install polyglot Collecting polyglot Using cached…
checkmate
  • 133
  • 1
  • 1
  • 9
1
vote
1 answer

How to Combine 2 Backend (PHP Laravel and Spring)

I am newbie for web programming, and I have 2 websites (still local/intranet). built with PHP Laravel built with Spring Boot What i have searched is (Polyglot, Microservice, API, Webservice) My Questions : How to Combine PHP Laravel and Spring…
1
vote
1 answer

SMIL: t:audio issue in internet explorer 8

I am trying to reanimate project made ten years ago. SMIL2 was successfully used in IE6. I have found that IE8 still supports SMIL, however nor t:media, nor t:audio elements do not play audio/media content any more. Here is simple example that works…
1
vote
0 answers

How to solve ImportError: cannot import name 'ProcessPoolExecutor'

I'm getting this error when trying to using polyglot library for the first time. from polyglot.text import Text, Word word = Text("Preprocessing is an essential step.").words[0] print(word.morphemes) here is the full error message. Traceback…
1
vote
1 answer

Rails 3.0.3 and Polyglot Gem problem

I hope you can help me. I had a rails app running locally, and also uploaded and running on Heroku. All was well. Last week, I started work again and ran into a snag. At the beginning of my day, I opened up Terminal and ran 'rails s' to start…
DougN
  • 337
  • 1
  • 2
  • 12