Questions tagged [non-unicode]

Unicode is intended to be a universal character set for describing all the characters required for written text incorporating all writing systems, technical symbols and punctuation. But unicode isn't supported on every system, and many other character sets exist.

This tag concerns encoding questions dealing with non-unicode character sets. It can be about conversion from/to unicode, or dealing with special characters on systems not supporting unicode.

Some commons character sets:

  • ASCII. 7-bits. Only non-accented latin characters.
  • ISO-8859-1 (a.k.a. Latin-1). 8-bits. Occidental latin characters.
  • ISO-8859-15 (a.k.a Latin-9). Like ISO-8859-1, but with some additional characters like euro sign.
  • CP-1252. 8-bits. Occidental latin characters used by Windows.
  • CP-850. 8-bits. Occidental latin characters used by DOS.
63 questions
0
votes
1 answer

How can I save unicode columns in non-unicode Oracle database with VARCHAR2?

How can I save unicode characters in a Oracle database, configured in a non-unicode charset (this configurations is impossible to change)? The text will be saved in VARCHAR2 columns (it can't be NVARCHAR2).
John Assymptoth
  • 8,227
  • 12
  • 49
  • 68
-1
votes
2 answers

Non unicode to Unicode conversion, for any font!

I have a html file with text encoded in a non-unicode font. I need to convert that file to unicode. I searched for a convertor. But, most of the convertors work for only a list of fonts, not for all fonts. My font is very specific, text is in…
user625118
  • 29
  • 1
  • 6
-1
votes
1 answer

Non-Unicode to unicode conversion of a txt file

Given a txt file with non-unicode text, I am able to detect its charset as 1251. Now, I would like to convert into unicode. byte[] bytes1251 = Encoding.GetEncoding(1251).GetBytes(File.ReadAllText("sampleNU.txt")); String str =…
John
  • 693
  • 1
  • 12
  • 37
1 2 3 4
5