Questions tagged [non-ascii-characters]

ASCII stands for 'American Standard Code for Information Interchange'. ASCII is a character-encoding scheme based on the ordering of the English alphabet. Since ASCII only contains definitions for 128 characters, numerous other encoding schemes have been created to include characters from other alphabets and other symbols.

1055 questions
5
votes
1 answer

How to remove accent from string in WP7

I want to remove accent (diacritic) from string in Windows Phone 7. The solution here works for .NET (desktop version). However, WP7 string has no Normalize method. Someone suggest change from string to byte, but I dont know what he means. How to…
onmyway133
  • 45,645
  • 31
  • 257
  • 263
5
votes
2 answers

PHP-REGEX: accented letters matches non-accented ones, and vice versa. How to achieve this?

I want to do typical highlight code. So I have something like: $valor = preg_replace("/(".$_REQUEST['txt_search'].")/iu", "\\1", $valor); Now, the request word could be something like…
Lightworker
  • 593
  • 1
  • 5
  • 18
4
votes
1 answer

Non-ascii characters added form input only with Safari Browser

I'm facing a weird problem with the Safari browser for Windows. I have an HTML5 drag-n-drop upload form based on jQuery. It works fine with all the browsers except Safari where, for files with specific extensions, the file name is padded with…
Vasilis
  • 2,721
  • 7
  • 33
  • 54
4
votes
1 answer

MySQL Workbench Import CSV Character Decoding/Encoding Error

I was trying to import a csv file into a MySQL table, using MySQL Workbench. However, I kept getting this character decoding error: Unhandled exception: 'charmap' codec can't decode byte 0x81 in position 418: character maps to So then I…
4
votes
1 answer

libcaca - changing ascii glyphs to Katakana

I am creating a video effect that is supposed to look as in "Matrix" movie, but a bit different ("Matrix"-like video output will be mixed with an altered alpha channel with real video, so it will look half real, half with digits). I am using simply…
4
votes
2 answers

Python returns length of 2 for single non-ascii character string

I am trying to get the span of selected words in a string. When working with the İ character, I noticed the following behavior of Python: len("İ") Out[39]: 1 len("İ".lower()) Out[40]: 2 # when `upper()` is applied, the length stays the…
lux7
  • 1,600
  • 2
  • 18
  • 34
4
votes
1 answer

Chinese character too large for enclosing character literal type

I'm trying to assign the Chinese character 牛 as a char value in C++. On Xcode, I get the error: "Character too large for enclosing character literal type." When I use an online IDE like JDoodle or Browxy, I get the error: "multi-character…
K Man
  • 602
  • 2
  • 9
  • 21
4
votes
1 answer

How to search a string ignoring accent characters (e.g. ã = a)

Possible Duplicate: Programatic Accent Reduction in JavaScript (aka text normalization or unaccenting) I'm trying to find a string ignoring accent, so in my example if I search for avião or aviao I get both result always. here's a start…
4
votes
2 answers

getline() doesn't read accented characters correctly

I'm trying to get accented characters from user using getline() command, but it does not print them correctly. I tried to include some libraries as locale, but it was in vain. Here's my code: #include #include #include…
Levy
  • 231
  • 1
  • 8
4
votes
4 answers

How to find/replace non printable / non-ascii characters using Python 3?

I have a file, some lines in a .csv file that are jamming up a database import because of funky characters in some field in the line. I have searched, found articles on how to replace non-ascii characters in Python 3, but nothing works. When I open…
user10664542
  • 1,106
  • 1
  • 23
  • 43
4
votes
2 answers

Selecting character code table in ESC/POS command

I need print non-english chars on print receipts, use thermal POS receipt printer. Xprinter XP-58III thermal POS receipt printer suppport generic ESC/POS commands. As I know this should be done by setting character code table. In my case, target…
Lexx Luxx
  • 243
  • 1
  • 7
  • 13
4
votes
1 answer

How to display/print Chinese characters with knitr::spin

I have recently began using knitr::spin() to quickly convert my code into into html reports and so far it has been amazing. However, I have run into a problem when attempting to include Chinese characters in either text or the code sections. Here is…
4
votes
4 answers

In which encoding is 0xDB a currency symbol?

I received files which, sadly, I cannot get info about how they were generated. I need to parse these files. The file is entirely ASCII besides for one character: 0xDB (in decimal it gives 219). Obviously (from looking at the file) this character…
NoozNooz42
  • 4,238
  • 6
  • 33
  • 53
4
votes
2 answers

How to compress Non-ASCII characters to 1 byte in C for Linux?

I have a list of Turkish words. I need to compare their lengths. But since some Turkish characters are non-ASCII, I can't compare their lengths correctly. Non-ASCII Turkish characters holds 2 bytes. For example: #include #include…
Atreidex
  • 338
  • 2
  • 15
4
votes
2 answers

Pandas DataFrame: remove � (unknown-character) from strings in rows

I have read a csv file into python 2.7 (windows machine). Sales Price column seems to be mixture of string and float. And some rows contains a euro symbol €. Python sees € as �. df = pd.read_csv('sales.csv', thousands=',') print df Gender Size …
DanG
  • 689
  • 1
  • 16
  • 39