Questions tagged [language-detection]

Language detection or language identification is the task of identifying the language(s) in a fragment of text.

From Wikipedia:

In natural language processing, language identification or language guessing is the problem of determining which natural language given content is in. Computational approaches to this problem view it as a special case of text categorization, solved with various statistical methods.

...

One of the great bottlenecks of language identification systems is to distinguish between closely related languages. Similar languages like Serbian and Croatian or Indonesian and Malay present significant lexical and structural overlap, making it challenging for systems to discriminate between them.

http://corporavm.uni-koeln.de/vardial/sharedtask.html has input data and results from a recent competition (COLING 2014 VarDial workshop DSL task).

142 questions
0
votes
1 answer

"Daily Limit Exceeded" error in python google.cloud (translate)

I want to DETECT the language of a text (and not translate) using google API. This does not require authentications and I don't need a key. The code is as simple as: from google.cloud import translate def detect_language(text): """Detects the…
Elm662
  • 663
  • 1
  • 5
  • 18
0
votes
1 answer

window.navigator.userLanguage not found in IDE

I'm working on an Angular 2 App with Visual Studio Code (Typescript) and I want to read the User Language for translation. "navigate.language" works (at least for Chrome,...) but I can't code navigate.userLanguage for IE - my IDE tells me that this…
0
votes
1 answer

Language detection using tesseract or abby ocr

I have images of document containing either arabic or english text as image is it possible somehow to know what language out of this two an image contain
saurabhiitr
  • 47
  • 1
  • 7
0
votes
2 answers

Language identification (Opera, Safari, Chrome) in PHP

I'm using $_SERVER["HTTP_ACCEPT_LANGUAGE"]to detect the browser language, which works fine for Firefox and IE: Firefox: de,en-us;q=0.9,en;q=0.7,ru;q=0.6,ro;q=0.4,hu;q=0.3,zh;q=0.1 Internet Explorer: de Unfortunately, it doesn't work for the…
MrG
  • 5,277
  • 17
  • 48
  • 66
0
votes
0 answers

Problems with unicode, beautifulsoup, cld2, and python

I'm trying to detect the language of a number of pages in beautifulsoup/python. This is how I use beautiful soup to generate the text object... soup=BeautifulSoup(content,"html.parser") text=soup.findAll('body')[-1].text This produces a unicode…
neelshiv
  • 6,125
  • 6
  • 21
  • 35
0
votes
0 answers

Java Detect language of Content from large String

I am working on a project, where there are pdfs with content is English and Spanish language,I am interested only in English part of it and save it to Database.I am using Apache PDF box for extracting the text out of it.How can I avoid Spanish…
Sunny Gupta
  • 191
  • 1
  • 4
  • 14
0
votes
1 answer

Language detection not working in Typo3 6.2.12

I've a website which has 2 languages: German (de) as default and English (en) as translated. I am trying to implement language detection functionality in the website using the extension "rlmp_language_detection" version 7.0.0. What I want to…
Arun Chandran
  • 310
  • 5
  • 24
0
votes
1 answer

Language Detection in Solr for Nutch documents

How can I use Solr for language identification of documents obtained by crawling with nutch? I installed Nutch 1.9 and Solr 4.8.1. I added a new core, named "core-test" to solr by means of Core Admin in the Solr Admin page and I followed the steps…
eljane
  • 1
0
votes
2 answers

determination of human language from text:: system structure

I'm using these word lists. Right now I'm only thinking about German, Russian, English, and French. I guess what I'm going to do is put them all as part of a hashmap, one for each language with the word as the key, and a boolean as the value. When…
smatthewenglish
  • 2,831
  • 4
  • 36
  • 72
0
votes
2 answers

java language detection LangDetectException

Working on language detection in java, i try to use langdetect library but i got this error when running Exception in thread "main" com.cybozu.labs.langdetect.LangDetectException: need to load profiles. Could someone help me to add profile? i don't…
user3503994
  • 11
  • 1
  • 2
0
votes
1 answer

.htaccess for language detection, redirecting + clean urls

I'm not very familiar with .htaccess, but i managed to put the following file together. Sadly it's not working.. What it had to do: - Detect if a user is french = redirect to example.com/fr - Detect if user is any other language = redirect to…
0
votes
1 answer

chromium-compact-language-detector Django

I am using the chromium-compact-language-detector to detect language but it is unable to detect Japanese in the string. text = '1/15 HR Div.Q&CS Dept. 全体MTG 開催 1月15日(水)、赤溜オーディトリアムにてHR Div.Q&CS Dept.の全体MTGが開催されました。 ' cld.detect(smart_str(text),…
user1839132
  • 121
  • 2
  • 10
0
votes
2 answers

htaccess language detection returns in endless redirects - who finds the bug?

i have a survey community with 3 languages using the WPML plugin for Wordpress to handle the languages. unfortunately the language detection works only via php. i want to use htaccess, since it is faster and the user wouldn't notice a delay. The…
Boxhorn
  • 25
  • 1
  • 5
0
votes
1 answer

Language Detection on Mobile Devices

I am working on an application that needs to detect and switch languages. I have been looking for some best practices on how to properly do this on mobile devices. If there wa way to access the phone (iphone, android, windows) and determine the…
0
votes
1 answer

android auto-language detection

I tried this library, suggested in one of the posts on stack overflow, I've added the lib's jar to my build-path, but I'm not able to initialize DetectorFactory class with the languages' profiles. this is the class handling the detection, as…
Ranco
  • 893
  • 2
  • 13
  • 43
1 2 3
9
10