Questions tagged [non-ascii-characters]

ASCII stands for 'American Standard Code for Information Interchange'. ASCII is a character-encoding scheme based on the ordering of the English alphabet. Since ASCII only contains definitions for 128 characters, numerous other encoding schemes have been created to include characters from other alphabets and other symbols.

1055 questions
3
votes
3 answers

Validate email address in Javascript, and compatible with non-ASCII characters

There are many regexes which can be used to validate email address, but most of them aren't compatible with non-ASCII characters. Once an email address contains non-ASCII characters like 'Rδοκιμή@παράδειγμα.δοκιμή' or '管理员@中国互联网络信息中心.中国', they can't…
3
votes
1 answer

JAXB xjc tool and non-ascii characters

I have an xsd scheme which contains non-ascii characters. I want to generate classes but force the xjc tool NOT to convert those characters to unicode format (\uXXXX), and just leave them as is. Trouble is when it puts a value of enum in unicode…
glaz666
  • 8,707
  • 19
  • 56
  • 75
3
votes
4 answers

How can I copy from the browser and paste to vim without unicode problems

This happens to me all the time: I copy something from a rich text screen (usually a browser) and then paste it into vim. Usually its a code block and then when I go to compile or run or what have you I get all kind of bazaar errors. I scratch my…
dsummersl
  • 6,588
  • 50
  • 65
3
votes
4 answers

inserted accent into SQL

i have the probleme with accent (example: é,à,è, etc). when i inserted directly on SQL Server 2008 Management INSERT INTO LETTRE_VOIT (LIB_PORT) VALUES ('Payé') it work well. but when i load this syntax on C#, it not work. try { …
user609511
  • 4,091
  • 12
  • 54
  • 86
3
votes
3 answers

Incorrect placement of accent mark on site with UTF8 encoding (wth cyrillic words only)

I am developing webpage with contains some cyrillic text with stress marks. To place stress mark I use utf8 character U0301 (COMBINING ACUTE ACCENT). In every editor I have already used and every cyrillic-related webpage I have already seen this…
glukoz
  • 31
  • 3
3
votes
4 answers

Printing non-ascii characters in python/jinja

The following code works correctly: from jinja2 import Template mylist = ['some text \xc3'] template = Template('{{ list }}') print template.render(list=mylist) When I run it, it outputs: ['some text \xc3'] Yet, when I try to print the actual…
chaimp
  • 16,897
  • 16
  • 53
  • 86
2
votes
1 answer

Read Accented Characters from JSON or XML format in PHP

I have a XML file which have accented characters like æøåêèé. If I simply read the file using fread I can read these characters easily but if I use simplexml_load_string or DOMDocument I am not able to read these characters. Same is the case with my…
Gunjan Nigam
  • 1,363
  • 4
  • 10
  • 18
2
votes
1 answer

(e)grep: accented characters not recognised as part of a word

I would like to use (e)grep to match a whole word using the -w switch. I've set the locale, but accented characters are being treated as word boundaries as in this example: $…
user918938
2
votes
1 answer

array mismatch not finding accented characters

Well... I am stumped, but I'm sure there's a simple solution I hope! I use Joomla and the following code looks at the Joomla article title being pulled into the page and puts the appropriate flag image to the left of the title.
user991830
  • 864
  • 5
  • 17
  • 35
2
votes
1 answer

NSData to NSString with åöä

I'm downloading a webpage using NSMutableURLRequest but having problem putting that very webpage into a NSString. NSString *username = @"my_username"; NSString *password = @"my_password"; NSURL* url = [NSURL URLWithString:[NSString…
user872661
  • 251
  • 2
  • 13
2
votes
2 answers

PHP - Parsing xml with accents

I'm trying to parse an xml file that contains accents, but I get this error "String could not be parsed as XML". Unfortunately I can't work on the xml file, as it is downloaded from an external source, so I was wondering if there's any easy way to…
don
  • 4,113
  • 13
  • 45
  • 70
2
votes
2 answers

Detect if a string contains "Letter with Tilde" or "Letter + Combining Tilde"

i have this two example strings: $a = 'Anão'; $b = 'Anão'; They visually look the same, but the 3rd character is different: On string $a is Unicode 227 (latin small letter a with tilde) and on string $b is Unicode 97 (latin small letter a) +…
2
votes
1 answer

How to fix Visual Studio Code distorting characters on Windows?

I have problems with Swedish national characters when using Rust in Visual Studio Code in Windows 11. It can be shown with the following program: fn main() { let abc = " ååå ööö äää"; println!("<---{}--->", abc); } When the program is run from the…
2
votes
0 answers

android KeyEvent ACTION_MULTIPLE and getCharacters deprecation

According to the android docs, the KeyEvent ACTION_MULTIPLE (=2) constant and getCharacters() functions are deprecated at API 29. They claim these features are no longer used by the input system. I want to know the correct way to receive accented…
2
votes
2 answers

How to transform "Umlaute" (ü) to "ue" in the string?

I have a string which is looking like this for example Lübeck Nürnberg Österreich with "Umlaute". Now I want to split it and add the new "words" on the string like this Lübeck Nürnberg Österreich Luebeck Nuernberg Oesterreich. So ä should be ae, ö…
B0BBY
  • 1,012
  • 6
  • 24