Questions tagged [codepages]

Code page is another term for character encoding. It consists of a table of values that describes the character set for a particular language.

Code page is another term for , popular with some vendors (IBM, Microsoft, Oracle, etc). It consists of a table of values that describes the character set for a particular language.

See also http://en.wikipedia.org/wiki/Code_page

458 questions
2
votes
1 answer

Which character encoding is the IPython terminal using?

I used to think I had this whole encoding stuff pretty figured out. I seem to be wrong because I can't explain what's happening here. What I was trying to do is to use the tabulate module to print a nicely formatted table using from tabulate import…
Tim Pietzcker
  • 328,213
  • 58
  • 503
  • 561
2
votes
4 answers

How do you convert posted "english" characters from international PC's in ASP.NET? (ex 2205)

I have a WebForm search page that gets occasional hits from international visitors. When they enter in text, it appears to be plain ASCII a-z, 0-9 but they are printed in bold and my "is this text" logic can't handle the input. Is there any easy way…
James White
  • 2,062
  • 2
  • 24
  • 36
2
votes
2 answers

Remove characters from C# string not belonging to a specicif code page

In C# I have a string that goes on to be inserted into a db table using codepage 37 US. So for instance, the '€' will cause the insert operation to fail. What is a good way to clean my string of characters not represented in code page 37 and…
2
votes
1 answer

Why does ReadLn mis-interpret UTF8 text when non-unicode page is Korean (949)?

In Delphi XE2 I can only read and display unicode characters (from a UTF8 encoded file) when the system locale is English using the AssignFile and ReadLn() routines. Where it fails If I set the system locale for non-unicode applications to Korean…
Duncan
  • 858
  • 1
  • 11
  • 29
2
votes
1 answer

Why am I getting "UnicodeEncodeError: 'charmap' codec can't encode character" error message

I encountered a problem with Python script I wrote while running in a Windows CMD window, and boiled the essence of the problem down to the following SSCCE: The Python script (x.py) import sys in_file = open (sys.argv[1], 'rt') for line in…
Moshe Rubin
  • 1,944
  • 1
  • 17
  • 37
2
votes
3 answers

ListView showing wrong text VB6.0

I've been testing an application using a listView which works well with Western European Characters. However if the locale is in Turkish the characters appear to be wrong in the listview. For example the character 'Ş' will appear as 'Þ'. I can see…
dan983
  • 454
  • 2
  • 5
  • 19
2
votes
1 answer

Corrupted Hebrew: saved as ansi - covert back to UTF-8

I suspect some data has been saved (on windows machines) as ANSI. Therefore, the original Hebrew characters got lost and what we see is stuff like ùéôåãé äòéø. Is the information lost or is there a possibility to map back the characters, knowing…
NoIdeaHowToFixThis
  • 4,484
  • 2
  • 34
  • 69
2
votes
1 answer

What is the most robust way to determine the current codepage from a shell script?

I'd like to determine the environment's current codepage at runtime from a Unix shell script. What's the most reliable way of doing this? I'm looking into parsing environment variable $LC_ALL, but it isn't always set to a useful value, and its…
rewbs
  • 1,958
  • 4
  • 22
  • 34
2
votes
1 answer

GHCi: incorrect text output despite the right font and codepage

Windows 8.1 x64 Russian. I create and fill the %AppData%\ghc\ghci.conf file: :! title GHCi (Haskell interpreter) putStrLn $ replicate 30 '*' putStrLn "© Андрей Бушман, 2014" -- The sample of some not English chars... :set prompt "\x03BB: " This…
Andrey Bushman
  • 11,712
  • 17
  • 87
  • 182
2
votes
1 answer

Read a UTF8 file (created on notepad) and convert to CP850 string

Im trying to read a UTF8 file and convert it to CP850 ( to send to a printer device ). My test string is "ATIVAÇÃO" A T I V A Ç Ã O 0x41 0x54 0x49 0x56 0x41 0xC3 0x87 C3 0x83 4F My java code: private static void…
fabriciols
  • 959
  • 1
  • 12
  • 25
2
votes
4 answers

How to find if a character belongs to a particular codepage using c++ or calling winapi

How can we find if a character belongs to a particular codepage? or How can we determine whether a charcter fits into currently active IME for an application.
Prakash
  • 742
  • 7
  • 19
2
votes
1 answer

How to detect keyboard input code page

I need to detect the code page the keyboard input is using while user is entering data into application fields. I tried to use System.Text.Encoding.Default.CodePage; but it gives the code page of what is configured in regional settings. Then i…
user1651105
  • 1,727
  • 4
  • 25
  • 45
2
votes
0 answers

Convert charset into codepage id

While parsing a .RTF file, I run into classic charset identifiers (i.e. 204 = Cyrillic). I'd like to convert them to the appropriate codepage id, so can instantiate the correct Encoding object. There is a nice table on MSDN that lists the available…
Paul-Jan
  • 16,746
  • 1
  • 63
  • 95
2
votes
1 answer

Verifying if a String is fully supported in a specific codepage

Below I have a small test program than can verify if the content of a String is full supported in codepage CP852 or not. Is there a more elegant way of doing this? public class CodepageTest { public static void main(String[] args) { try…
Ebbe M. Pedersen
  • 7,250
  • 3
  • 27
  • 47
2
votes
1 answer

Why some character codes lack a character in Windows-125* encodings?

In Windows-125* code pages (and actually, in corresponding ISO* ones too), certain character codes don't have a corresponding character defined. What was the premise behind leaving some of the character codes without a character? My understanding…
colemik
  • 1,417
  • 2
  • 20
  • 24