Questions tagged [chardet]

chardet is a python module for encoding detection

chardet is a python module for encoding detection.

See pypi project page.

36 questions
0
votes
1 answer

Java chardet that detects iso-8859-2

Is there a Java version of the python chardet that detects iso-8859-2? I've tried the Mozilla universalchardet and jchardet and neither worked, they both guessed windows-1252 but the python chardet that comes with Linux detected it just fine.
casolorz
  • 8,486
  • 19
  • 93
  • 200
0
votes
1 answer

juniversalchardet is defective on www.wikipedia.org

I'm trying to use juniversalchardet to auto-detect encoding of a saved webpage, my first test use www.wikipedia.org, which uses UTF-8 encoding according to HTTP response header (this information is lost after being saved to disk) This is my scala…
tribbloid
  • 4,026
  • 14
  • 64
  • 103
0
votes
1 answer

rchardet gem support for ISO-8859-1, and Windows-1252

I would like to know whether rchardet supports encoding for ISO-8859-1, and Windows-1252. I have seen the documentation but I didn't get proper info on this.
Kranthi
  • 1,377
  • 1
  • 17
  • 34
0
votes
1 answer

failure in installing chardet

I download a chardet module,placed it in d:\\and want it installed in python, so I use the cmd : c:\\Python27\python.exe d:\\chardet\setup.py the win command says that: Traceback (most recent call last): File "d:\\chardet\setup.py", line 13,…
ray
  • 71
  • 3
  • 11
0
votes
1 answer

How do I encode files to UTF-8 for Rails 3?

I've been working on outlook imports (linked in exports to outlook format) but I'm having troubles with encoding. The outlook format CSV I get from exporting my LinkedIn contacts are not in UTF-8. Letters like ñ cause an exception in the…
Riley Guerin
  • 305
  • 3
  • 11
0
votes
1 answer

Italian dected as iso-8859-2

I am using chardet to detect encoding of text files including Italian. The problem is it consistently detects their encoding as iso-8859-2 while the correct detection would be iso-8859-1. Does anybody know a fix? My local language is set to Polish?…
twowo
  • 621
  • 1
  • 8
  • 15
1 2
3