Questions tagged [codepages]

Code page is another term for character encoding. It consists of a table of values that describes the character set for a particular language.

Code page is another term for , popular with some vendors (IBM, Microsoft, Oracle, etc). It consists of a table of values that describes the character set for a particular language.

See also http://en.wikipedia.org/wiki/Code_page

458 questions
6
votes
2 answers

What codepage encodes a 'ç' as '?º' (0x3f 0xba)

Today I received a file from a customer that I have to read, but it contains strange characters. Using known names, I can guess the meaning of some characters. For example: Realname | Encoded as | sign |…
GvS
  • 52,015
  • 16
  • 101
  • 139
6
votes
2 answers

How to discover what codepage to use when converting RTF hex literals to Unicode

I'm parsing RTF 1.5+ files generated by Word 2003+ that may have content from other languages. This content is usually encoded as hex literals (\'xx). I would like to convert these literals to unicode values. I know my document's code page by…
Malcolm
  • 5,125
  • 10
  • 52
  • 75
6
votes
1 answer

Programmatically change the default code page in Windows XP? (from Delphi)

Could anyone advise how to programmatically change the default Windows XP code page (I'm doing this from Delphi)? (This would be the equivalent of going into Control Panel -> Regional Settings -> Language for non-Unicode applications). In this…
Duncan
  • 858
  • 1
  • 11
  • 29
5
votes
2 answers

Why chars become useless? libcurl c++ Utf-8 encoded html;

First of all sorry for my bad english. I have done my research but there isn't any related answers to solve my problem. I have understood and learnt about CodePages Utf 8 and other stuff about in c or c++, and also know that strings can hold…
uoakinci
  • 47
  • 3
5
votes
3 answers

How do I convert arabic letters in htmlentities symbols?

I need convert arabic letters in htmlentities symbols. Codepage: ISO-8859-1. سك - this is arabic symbol for example. htmlentities("سك") returns: سÙ� How can I get from this symbol the html-entities سك?
Anatoly
  • 141
  • 1
  • 2
  • 8
5
votes
1 answer

MS Access *.MDB Conversion to MySQL or SQLite, Problem in data Encoding

Greetings, i'll present my case: I'm in linux UBUNTU i have several Jet3 .MDB (MS Acess Database) about 500MB each, in which the datas encoded in cp1256/WINDOWS-1256 i have made the sqlite databases by following this article to do the conversion…
Indra Lukmana
  • 303
  • 3
  • 11
5
votes
2 answers

Perl Unicode glitch

In this output, why am I getting extra newlines after printing non-ASCII Unicode characters? Platform is Windows Vista and problem occurs after chcp 65001 but not after chcp 850 C:\>chcp 850 Active code page: 850 C:\>perl…
RedGrittyBrick
  • 3,827
  • 1
  • 30
  • 51
5
votes
2 answers

Xamarin UI Test "1 is not a supported code page."

I have a problem with my computer when trying to run xamarin UI tests. It's a xamarin forms project and I'm building and testing on android devices. I'm using visual studio community 2017 and windows. I have tried both in my own project and in the…
5
votes
1 answer

What is the difference between sys.stdout.encoding, locale.getpreferredencoding(), and sys.getdefaultencoding()?

I am new to python and really confused about this encoding stuff. So far, I've read about the following types of "encoding": import sys import locale print (sys.stdout.encoding) print (locale.getpreferredencoding()) print…
user3685285
  • 6,066
  • 13
  • 54
  • 95
5
votes
1 answer

Which string encoding do the VBA built-in file operations use?

Example: Open "C:\...\someFile.txt" For Output As #1 Print #1, someString Close #1 If someString contains non-ASCII characters, how are they encoded? (UTF-8, Latin-1, some codepage depending on the Windows locale, ...) On my system, the code above…
Heinzi
  • 167,459
  • 57
  • 363
  • 519
5
votes
1 answer

chcp 65001 codepage results in program termination without any error

Problem The problem arises when I want to input Unicode character in the Python interpreter (for simplicity, I have used a-umlaut in the example, but I have first encountered this for Farsi characters). Whenever I use Python with the CHCP 65001 code…
psychob
  • 63
  • 1
  • 6
5
votes
2 answers

Delphi Standard Input Output Code page

Delphi 2010 When reading from file using readLn procedure, by default i get unicode string converted from code page 1251 (code page of windows). How can i change this and read from file in other code page, in 1252 or UTF16 ?
Astronavigator
  • 2,021
  • 2
  • 24
  • 45
5
votes
2 answers

Print a list that contains Chinese characters in Python

My code looks like : # -*- coding: utf-8 -*- print ["asdf", "中文"] print ["中文"] print "中文" The output in the Eclipse console is very strange: ['asdf', '\xe4\xb8\xad\xe6\x96\x87'] ['\xe4\xb8\xad\xe6\x96\x87'] 中文 My first question is: why did the…
user958547
4
votes
4 answers

How to change the CP_ACP(0) of windows ANSI apis in an application?

I try to draw text using a dll library which has only interfaces of ANSI version encapsulated windows ANSI apis, but I need to store string data using utf-8. I don't want to convert strings using MultiByte/WideChar functions so I want an approach to…
legendlee
  • 568
  • 4
  • 12
4
votes
2 answers

In which Windows version did Windows ANSI Western (cp 1252) first appear?

I would like as definitive a reference as possible for which version of Windows introduced the Windows ANSI Western character encoding. My prime suspects are Windows 1.0 (common sense) and Windows 3.1. Windows 3.1 was claimed by implication by a…
Cheers and hth. - Alf
  • 142,714
  • 15
  • 209
  • 331
1 2
3
30 31