Questions tagged [non-ascii-characters]

ASCII stands for 'American Standard Code for Information Interchange'. ASCII is a character-encoding scheme based on the ordering of the English alphabet. Since ASCII only contains definitions for 128 characters, numerous other encoding schemes have been created to include characters from other alphabets and other symbols.

1055 questions
8
votes
7 answers

Pandas DataFrame's accented characters appearing garbled in Excel

With: # -*- coding: utf-8 -*- at the top of my .ipynb, Jupyter is now displaying accented characters correctly. When I export to csv (with .to_csv()) a pandas data frame containing accented characters: ... the characters do not render properly…
Pyderman
  • 14,809
  • 13
  • 61
  • 106
8
votes
2 answers

Char (non ascii) in Matlab

I have three characters (bigger than 127) and I need to write it in a binary file. For some reason, MATLAB and PHP/Python tends to write a different characters. For Python, I have: s = chr(143)+chr(136); f = open('pythonOut.txt',…
8
votes
1 answer

Server implementation of RFC 2388 multipart POST conflict with RFC 2047?

I'm trying to implement RFC 2388 on a HTTP server to support multipart POST. I am looking at the specification specifically at the content-disposition's "name" parameter. Under section 3 of RFC 2388 it states: Field names originally in non-ASCII…
user986139
7
votes
2 answers

haskell: output non-ascii characters

I'd like to output non-ascii characters in WinGHCi, but this is what I get: Prelude> "δ" "\948" Prelude> putStr "\948" *** Exception: : hPutChar: invalid argument (character is not in the code page) I am using WinGHCi 7.0.3 on windows xp.…
Alex
  • 135
  • 7
7
votes
0 answers

Finding non-Ascii character

Possible Duplicate: How Do I grep For non-ASCII Characters in UNIX I'm struggling trying to find an answer to how I can find a non-ascii character in a very large file of xml data. I do not want to convert the non-ascii characters, I just want to…
Walinmichi
  • 385
  • 2
  • 5
  • 12
7
votes
5 answers

grep/regex can't find accented word

I'm trying mount a regex that get some words on a file where all letters of this word match with a word pattern. My problem is, the regex can't find accented words, but in my text file there are alot of accented words. My command line is: cat…
GodFather
  • 3,031
  • 4
  • 25
  • 36
7
votes
3 answers

Regex for accent insensitive replacement in python

In Python 3, I'd like to be able to use re.sub() in an "accent-insensitive" way, as we can do with the re.I flag for case-insensitive substitution. Could be something like a re.IGNOREACCENTS flag: original_text = "¿It's 80°C, I'm drinking a café in…
7
votes
3 answers

Powershell Invoke-RestMethod incorrect character

I'm using Invoke-RestMethod to get page names from an application I'm using. I notice that when I do a GET on the page it returns the page name like so This page â is working However the actual page name is This page – is working Here's how my…
JoeRod
  • 899
  • 8
  • 20
  • 30
7
votes
3 answers

Python encoding/decoding problems

How do I decode strings such as this one "weren\xe2\x80\x99t" back to the normal encoding. So this word is actually weren't and not "weren\xe2\x80\x99t"? For example: print "\xe2\x80\x9cThings" string = "\xe2\x80\x9cThings" print…
Brana
  • 1,197
  • 3
  • 17
  • 38
7
votes
1 answer

Accents in HTML5 datalist

In a form, I have a text input with a datalist. In the datalist, I have accented terms that won't appear in my text input. For example : If I type "a", terms with "ä" or "à" won't appear in suggestion... The best would be the whole to be accent…
VeZoul
  • 500
  • 6
  • 19
7
votes
3 answers

Unicode support in Web standard fonts

I need to decide whether to render geometric symbols in a web GUI (e.g. arrows and triangles for buttons, menus, etc.) as Unicode symbols (MUCH easier and color-independent) or GIF/PNG files (lots of hassle I would like to avoid). However, I have…
Pekka
  • 442,112
  • 142
  • 972
  • 1,088
7
votes
1 answer

How to match accented characters with a regex in Python?

I need the solutions to this question, except for Python! I've tried installing the regex library for Python, as apparently that enables the use of POSIX expressions in Python's regexes, but nevertheless I guess it does not include Unicode…
wrongusername
  • 18,564
  • 40
  • 130
  • 214
7
votes
2 answers

Python: block character will not print

In IDLE, print(chr(219)) (219's the block character) outputs "Û". Is there any way to get it to output the block character instead? This might actually be some sort of computer-wide problem, as I cannot seem to get the block character to print…
Name McChange
  • 2,750
  • 5
  • 27
  • 46
7
votes
1 answer

LINQ-to-Entities with 8-bit ASCII data

I have an interesting issue handling 8-bit "ASCII" characters in LINQ-to-Entities and am hoping someone can give me a tip. I have inherited a SQL Server 2000 database that has some pseudo-encrypted columns where they just XOR'd the string with 0xFF.…
jwh20
  • 646
  • 1
  • 5
  • 15
7
votes
2 answers

Report Lab can't handle hebrew (unicode)

I am trying to generate the pdf from following python programming but generated output doesn't display hebrew letters correctly # -*- coding: utf-8 -*- from reportlab.pdfgen import canvas def hello(c): c.drawString(100,100, "מה שלומך") c =…
user634615
  • 647
  • 7
  • 17