ASCII stands for 'American Standard Code for Information Interchange'. ASCII is a character-encoding scheme based on the ordering of the English alphabet. Since ASCII only contains definitions for 128 characters, numerous other encoding schemes have been created to include characters from other alphabets and other symbols.
Questions tagged [non-ascii-characters]
1055 questions
8
votes
7 answers
Pandas DataFrame's accented characters appearing garbled in Excel
With:
# -*- coding: utf-8 -*-
at the top of my .ipynb, Jupyter is now displaying accented characters correctly.
When I export to csv (with .to_csv()) a pandas data frame containing accented characters:
... the characters do not render properly…

Pyderman
- 14,809
- 13
- 61
- 106
8
votes
2 answers
Char (non ascii) in Matlab
I have three characters (bigger than 127) and I need to write it in a binary file.
For some reason, MATLAB and PHP/Python tends to write a different characters.
For Python, I have:
s = chr(143)+chr(136);
f = open('pythonOut.txt',…

user1983388
- 83
- 5
8
votes
1 answer
Server implementation of RFC 2388 multipart POST conflict with RFC 2047?
I'm trying to implement RFC 2388 on a HTTP server to support multipart POST.
I am looking at the specification specifically at the content-disposition's "name" parameter.
Under section 3 of RFC 2388 it states:
Field names originally in non-ASCII…
user986139
7
votes
2 answers
haskell: output non-ascii characters
I'd like to output non-ascii characters in WinGHCi, but this is what I get:
Prelude> "δ"
"\948"
Prelude> putStr "\948"
*** Exception: : hPutChar: invalid argument (character is not in the code page)
I am using WinGHCi 7.0.3 on windows xp.…

Alex
- 135
- 7
7
votes
0 answers
Finding non-Ascii character
Possible Duplicate:
How Do I grep For non-ASCII Characters in UNIX
I'm struggling trying to find an answer to how I can find a non-ascii character in a very large file of xml data. I do not want to convert the non-ascii characters, I just want to…

Walinmichi
- 385
- 2
- 5
- 12
7
votes
5 answers
grep/regex can't find accented word
I'm trying mount a regex that get some words on a file where all letters of this word match with a word pattern.
My problem is, the regex can't find accented words, but in my text file there are alot of accented words.
My command line is:
cat…

GodFather
- 3,031
- 4
- 25
- 36
7
votes
3 answers
Regex for accent insensitive replacement in python
In Python 3, I'd like to be able to use re.sub() in an "accent-insensitive" way, as we can do with the re.I flag for case-insensitive substitution.
Could be something like a re.IGNOREACCENTS flag:
original_text = "¿It's 80°C, I'm drinking a café in…

Antoine Dusséaux
- 3,740
- 3
- 23
- 28
7
votes
3 answers
Powershell Invoke-RestMethod incorrect character
I'm using Invoke-RestMethod to get page names from an application I'm using. I notice that when I do a GET on the page it returns the page name like so
This page â is working
However the actual page name is
This page – is working
Here's how my…

JoeRod
- 899
- 8
- 20
- 30
7
votes
3 answers
Python encoding/decoding problems
How do I decode strings such as this one "weren\xe2\x80\x99t" back to the normal encoding.
So this word is actually weren't and not "weren\xe2\x80\x99t"?
For example:
print "\xe2\x80\x9cThings"
string = "\xe2\x80\x9cThings"
print…

Brana
- 1,197
- 3
- 17
- 38
7
votes
1 answer
Accents in HTML5 datalist
In a form, I have a text input with a datalist.
In the datalist, I have accented terms that won't appear in my text input. For example :
If I type "a", terms with "ä" or "à" won't appear in suggestion...
The best would be the whole to be accent…

VeZoul
- 500
- 6
- 19
7
votes
3 answers
Unicode support in Web standard fonts
I need to decide whether to render geometric symbols in a web GUI (e.g. arrows and triangles for buttons, menus, etc.) as Unicode symbols (MUCH easier and color-independent) or GIF/PNG files (lots of hassle I would like to avoid).
However, I have…

Pekka
- 442,112
- 142
- 972
- 1,088
7
votes
1 answer
How to match accented characters with a regex in Python?
I need the solutions to this question, except for Python! I've tried installing the regex library for Python, as apparently that enables the use of POSIX expressions in Python's regexes, but nevertheless I guess it does not include Unicode…

wrongusername
- 18,564
- 40
- 130
- 214
7
votes
2 answers
Python: block character will not print
In IDLE, print(chr(219)) (219's the block character) outputs "Û".
Is there any way to get it to output the block character instead?
This might actually be some sort of computer-wide problem, as I cannot seem to get the block character to print…

Name McChange
- 2,750
- 5
- 27
- 46
7
votes
1 answer
LINQ-to-Entities with 8-bit ASCII data
I have an interesting issue handling 8-bit "ASCII" characters in LINQ-to-Entities and am hoping someone can give me a tip.
I have inherited a SQL Server 2000 database that has some pseudo-encrypted columns where they just XOR'd the string with 0xFF.…

jwh20
- 646
- 1
- 5
- 15
7
votes
2 answers
Report Lab can't handle hebrew (unicode)
I am trying to generate the pdf from following python programming but generated output doesn't display hebrew letters correctly
# -*- coding: utf-8 -*-
from reportlab.pdfgen import canvas
def hello(c):
c.drawString(100,100, "מה שלומך")
c =…

user634615
- 647
- 7
- 17