Questions tagged [non-ascii-characters]

ASCII stands for 'American Standard Code for Information Interchange'. ASCII is a character-encoding scheme based on the ordering of the English alphabet. Since ASCII only contains definitions for 128 characters, numerous other encoding schemes have been created to include characters from other alphabets and other symbols.

1055 questions
3
votes
1 answer

ASCII Character to Decimal Value in R

Given a word I need to find the decimal values of each letter in that word and store it in an array. I used strtoi function to achieve this. But later found out below two functions which are supposed to give same output are giving different result.…
Bikash Behera
  • 434
  • 5
  • 12
3
votes
0 answers

Can wstring_convert just replace invalid characters?

I am currently working on a tool to extract archives from a game for the purpose of data mining. I currently extract metadata from the archives (number of files per archive, filenames, packed/unpacked sizes, etc.) and write them to a std::wstring…
3
votes
1 answer

JavaScript: Strings are equal but comparing returns false

I'm currently developing theme for Textual IRC and I want to compare the "Topic is ..." messages to the topic displayed in the channels topic bar, to delete them if the are the same. The topic that causes problems has both Umlaute and a URI in it…
JHolub
  • 290
  • 3
  • 15
3
votes
1 answer

Using ldapsearch with accented characters

I'm using ldapsearch command to query an OpenLDAP server and, while working fine for most uses, isn't very smart regarding special characters as found in languages like spanish or french. I mean, if I execute the query: ldapsearch -h myhost…
jesjimher
  • 1,175
  • 1
  • 12
  • 17
3
votes
3 answers

Formatting tabular data using unicode characters

I need to produce a calculation trace file containing tabular data showing intermediate results. I am currently using a combination of the standard ascii pipe symbols (|) and dashes (-) to draw the table lines: E.g. Numerator | Denominator |…
Wheelie
  • 3,866
  • 2
  • 33
  • 39
3
votes
8 answers

Print non-ASCII characters into a CSV file

I'm trying to create a CSV file using php. How can I print non ascii characters?
Rui Gonçalves
  • 2,423
  • 6
  • 30
  • 42
3
votes
2 answers

How to find all variations (accented, etc) of a searched string in MySQL table?

I have a big MySQL, InnoDB table which holds data in utf8mb4_unicode_ci. I have records from almost every languages those are based on latin alphabet. I am easily able to search words with accented characters and find results with different…
frankish
  • 6,738
  • 9
  • 49
  • 100
3
votes
1 answer

accents on chrome and explorer works different

Im developing a php and mysql app. When i use internet explorer it works fine i see the information with the acent.. p.e Visión but with chrome it works diferent. I see something like Visión it saves records on my utf8_general_ci database. The code…
Mary
  • 31
  • 1
3
votes
2 answers

How to efficiently remove non-ASCII characters and numbers, but keep accented ASCII characters

I have several strings like this: s = u'awëerwq\u0645\u0631\u062d\u0628\u0627\u043c\u0438\u0440bròn 1990 23x4 + &23 \'we\' we\'s mexicqué' s "awëerwq مرحباмир bròn 1990 23x4 + &23 'we' we's mexicqué" I couldn't found a way to remove non-printable…
Nacho
  • 792
  • 1
  • 5
  • 23
3
votes
2 answers

A QLineEdit/QComboBox search that ignores diacritics

I have an application where people can enter names of places in a form. This being Europe, we have to deal with names that includes diacritics like Orléans, Köln, Liège, Châteauroux. When people enter names I want them to be able to type characters…
JvO
  • 3,036
  • 2
  • 17
  • 32
3
votes
3 answers

How to check if a string contains accented Latin characters like é in Ruby?

Given: str1 = "é" # Latin accent str2 = "囧" # Chinese character str3 = "ジ" # Japanese character str4 = "e" # English character How to differentiate str1 (Latin accent characters) from rest of the strings? Update: Given str1 = "\xE9" # Latin…
sbs
  • 4,102
  • 5
  • 40
  • 54
3
votes
1 answer

Escape html & non-ascii chars with javascript

OK, so I need to replace all <, & and > plus all non-ascii characters with their html-entity counterparts. I've tried Underscore.string.escapeHTML but that didn't seem to touch the non-ascii chars. For example I need this:
föö…
Seppo420
  • 2,041
  • 2
  • 18
  • 37
3
votes
1 answer

AngularJS non-ascii property name support

I don't know how to use non-ascii property name in AngularJS. I could print a value by using a['property_name'] instead of a.property_name, but I couldn't use the same way in 'orderBy'. If I click on 'name', sorting would happen, but if I click on…
margincall
  • 483
  • 1
  • 6
  • 24
3
votes
2 answers

Python convert binary file into string while ignoring non-ascii characters

I have a binary file and I want to extract all ascii characters while ignoring non-ascii ones. Currently I have: with open(filename, 'rb') as fobj: text = fobj.read().decode('utf-16-le') file = open("text.txt", "w") …
Helen Che
  • 1,951
  • 5
  • 29
  • 41
3
votes
1 answer

Read non-ASCII MS Access database (Punjabi text using Gurmukhi (ਗੁਰਮੁਖੀ) alphabet) in MATLAB

I have a Microsoft Access database which contains columns in Punjabi language using using Gurmukhi (ਗੁਰਮੁਖੀ) script. When I read the database in MATLAB the Punjabi words are displayed as ?????. How can I read in the data correctly? here is the…