Questions tagged [codepages]

Code page is another term for character encoding. It consists of a table of values that describes the character set for a particular language.

Code page is another term for , popular with some vendors (IBM, Microsoft, Oracle, etc). It consists of a table of values that describes the character set for a particular language.

See also http://en.wikipedia.org/wiki/Code_page

458 questions
3
votes
1 answer

Encoding strangeness with Cp500 (LF & NEL)

Recently I had a strange issue with the Cp500 (EBCDIC) encoding during a transformation from bytes to String and then back from String to bytes. The issue is that one specific character LINE FEED - LF - 0x25 is, during this transformation, being…
Jean-Michel Garcia
  • 2,359
  • 2
  • 23
  • 44
3
votes
2 answers

Determinate if character is printable

I want to develop a hex-dump-view and have problems with characters which are not printable in the current active ANSI codepage (CP_ACP). How do I detect them and print a dot instead? My function currently looks like this: function HexChar(j: byte):…
Daniel Marschall
  • 3,739
  • 2
  • 28
  • 67
3
votes
1 answer

Specifying codepage for PInvoke string marshalling using C#

I am calling a DLL using PInvoke. The DLL's function returns a C string in codepage 437. Is there a way to have the .Net marshaling convert the string to unicode, or could someone suggest which parameters I should give to DllImport() and MarshalAs()…
Pigrew
  • 231
  • 3
  • 8
3
votes
1 answer

Extended ASCII in C#

I want to store some of the extended ascii characters into a dictionary for lookup but having little issue with getting the conversion. The current method I have to store these characters works for all the non-graphical looking ascii characters 0x20…
lakedoo
  • 385
  • 1
  • 4
  • 13
3
votes
1 answer

How to create an utf8 file from stringbuilder object in Java

I have a problem with file encoding. I have a method which exports my DB to a XML in a format I created. The problem is that the file is created with ANSI encoding and I need UTF-8 encoding (some spanish characters aren't shown propperly on…
Alberto
  • 139
  • 1
  • 3
  • 14
3
votes
1 answer

What encoding to use for Zip archive comments?

What encoding is supposed to be used for Zip archive comments? I know that the encoding of file comments (and file names) are determined by Bit 11 of the files' General Purpose Bit Flags. However, there is nothing like that for Zip archive comments,…
Malarial
  • 51
  • 5
3
votes
1 answer

Advantage Database Index Collation Sequence

I am converting a Delphi program from the BDE to Advantage Database. On weekends I work on a Win 7 machine using Delphi XE. During the week I work on a Win XP machine using Delphi 7. Advantage tables work fine on the Win 7 machine but when copied…
3
votes
1 answer

8086 Assembly Int 21h and Extended ASCII characters

I need some help with an assignment. I need to process a plain text file in ASCII and return how many characters of each code are present (how many a's, how many b's, and so on). It works now almost perfectly. I now have the problem that, if there…
dhcarmona
  • 402
  • 2
  • 10
  • 29
3
votes
2 answers

What encoding/code page is cmd.exe using when it is started?

A nearly identical question was asked before. A good explanation of code pages was given in the reply, but it did not answer the question in my mind: What controls the code page used when cmd.exe is started? On my system, it gets changed somehow. In…
user1462402
  • 39
  • 1
  • 2
2
votes
5 answers

UTF-8 vs code page 1252 in Visual Studio 2008 for HTML and JavaScript that includes European characters

I have been developing a parser that takes JavaScript as input and creates a compressed version of that JavaScript as output. I found initially that the parser failed when attempting to read the input JavaScript. I believe this has something to do…
mark smith
  • 20,637
  • 47
  • 135
  • 187
2
votes
2 answers

Are non-latin numerals in Windows SBCS codepages used by any Microsoft libraries to represent numerical data in C strings?

I'm trying to write a parser for "text" files which I know will be encoded in one of the Windows single byte code pages. These files contain text representations of basic data types, and the spec I have for these representations is lacking, to say…
Samuel Harmer
  • 4,264
  • 5
  • 33
  • 67
2
votes
0 answers

How can I create a codepage under Windows?

I want to create a codepage under Windows, but I do not where to start. Thanks.
Jichao
  • 40,341
  • 47
  • 125
  • 198
2
votes
1 answer

charset conversion with icu or iconv

In my CGI library, I'm using a converter in order to convert from a IANA-registered charset to native wide unicode (UTF-16/32, depending on platform). With ICU, are all the sets and aliases listed in http://www.iana.org/assignments/character-sets…
moshbear
  • 3,282
  • 1
  • 19
  • 33
2
votes
1 answer

Change default code page for debug console in VS10

Currently it defaults to my system locale, which is 932 (Japanese Shift-JIS) in my case, but I want it to be 65001 (UTF-8) by default. I can change the default for a given program by inserting a SetConsoleOutputCP line somewhere in the code and then…
Nikolai
  • 3,053
  • 3
  • 24
  • 33
2
votes
1 answer

Decode funny ISO encoding of e-mail message

I have an e-mail body (char[] buffer, retrieved via POP3), it apparently uses some markup for ISO code pages, e.g. a subject like Daß ißt ään schlümmer Test is encoded as =?iso-8859-1?Q?Da=DF_i=DFt_=E4=E4n_schl=FCmmer_TDest Is there anything…
peterchen
  • 40,917
  • 20
  • 104
  • 186