Questions tagged [character-set]

A character set maps a set of characters to specific numeric values, e.g. ASCII, UTF-8 and ISO-8859-1.

A character set maps a set of characters to specific numeric values.

Modern computer languages, editors and tools facilitate encoding and decoding of data between internal representations of data and specific character sets. Examples include ASCII, UTF-8 and ISO-8859-1.

Consideration should be given to using the appropriate character set for transmission and persistence of data, particularly text that can contain special characters (such as European languages like French or German) or be in a completely different script (such as Japanese) - see internationalisation (also referred to as i18n).

120 questions
3
votes
1 answer

Checking CharacterSet for single UnicodeScalar yields strange behaviour

While working with CharacterSet I've come across an interesting problem. From what I have gathered so far CharacterSet is based around UnicodeScalar; you can initialise it with scalars and check if a scalar is contained within the set. Querying the…
Michael Waterfall
  • 20,497
  • 27
  • 111
  • 168
2
votes
2 answers

Parsing of CSV file using Node/Express spits out weird \x001 codes

I'm using Node and Express to fetch a .CSV file from a URL that I want to parse. The process of downloading it works just fine. But when I use csv-parser to parse the file the output in the console looks like this: Just tonnes of lines of weird…
2
votes
1 answer

Is there a way to list all categories in perluniprops?

perluniprops lists the Unicode properties of the version of Unicode it supports. For Perl 5.32.1, that's Unicode 13.0.0. You can obtain a list of the characters that match a category using Unicode::Tussle's unichars. unichars '\p{Close_Punctuation}'…
alvas
  • 115,346
  • 109
  • 446
  • 738
2
votes
2 answers

Getting Arabic characters as ??? in PHP from JDE

I am trying to fetch our Arabic values from JDE Database using the following connection string: $dsn = "Driver={SQL Server};Server=10.10.10.27;Database=JDE;charset=utf8"; $username = "username"; $password = "password"; $string =…
2
votes
1 answer

ESC/POS Termal printer UTF-8 charset set up

I am trying to print some currency with Citaq v8 (it has a termal printer), but it prints ?, because of the printer character set. I have byte array (UTF-8) data to print. But the printer has default some character set. How can I change the default…
2
votes
1 answer

Laravel SQLSTATE[01000]: Warning: 1265 Data truncated for column for urdu or arabic character which are RTL mean right to left

I am stuck in an error for many days, I want to insert data in database in urdu or arabic language , the problem I am facing is strange the insert query changes the position of variables which throws data truncated error , I have used the same work…
mutahir
  • 97
  • 3
  • 11
2
votes
2 answers

How to print a content of the CharacterSet.decimalDigits?

I tried to print a content of the CharacterSet.decimalDigits with: print(CharacterSet.decimalDigits) output: CFCharacterSet Predefined DecimalDigit Set But my expectation was something like this: [1, 2, 3, 4 ...] So my question is: How to print…
Blazej SLEBODA
  • 8,936
  • 7
  • 53
  • 93
2
votes
0 answers

Using Greek Code Page (1253) in locale Turkish (1254) Computer

In my server application I am trying to parse my responses with UTF-8 in Greek but since my local char set is 1254 thus I can not reach every Greek character. I tried setting my threads local as 1253 but it did not work. I want to know if there is…
cangermi
  • 29
  • 2
2
votes
0 answers

How does the default character set get initialized in the JVM on Linux

When I run a java application on "Linux CentOS 7" Charset.defaultCharset() is returning "US-ASCII". If I pass the JVM argument -Dfile.encoding="UTF-8" I am able to change the character set to UTF-8. But my question is how does the default character…
Nick
  • 629
  • 1
  • 6
  • 9
2
votes
1 answer

How to display symbols i.e. degree (°) same in all systems

Our application has database Oracle 12c and Client in Java. I wonder when using plsql developer for oracle then symbols i.e.° (degree) are displayed using system's language setting or plsql developer setting or oracle server character setting? We…
2
votes
1 answer

How to get tablename in hindi

I am working with Oracle 11g and my database character set is AL32UTF8. I am able to store hindi values in a table but I want the tablename in hindi EXAMPLE: create a table `राकी` column names are ऋतु,अवधि value1 गर्मी,सुबह value2 सर्दी,शाम How…
Sk.mallika
  • 33
  • 4
2
votes
0 answers

Creating an instance of Informix with UTF8 character set

Overview: I am trying to create an Informix instance with a character set of UTF8. Details: I understand that the default character set in informix when creating an instance is ISO Latin 1 (819). However, I am trying to convert my informix…
2
votes
2 answers

How can I know the character set of HTML content by HTTP headers?

I know the parameter charset= in the HTTP header:Content-Type can be used to determine the character set of the HTML content. But if the parameter is missing in the Content-Type header, how can I know the character set of the HTML content? I also…
peter
  • 379
  • 2
  • 6
  • 16
2
votes
1 answer

Oracle: Conversion from WE8ISO8859P1 to AL32UTF8

I am trying to consolidate some databases and I have some problems with CHARACTER Sets. My database looks like this: Source Database Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production NLS_CHARACTERSET …
adimoise91
  • 558
  • 1
  • 7
  • 26
2
votes
2 answers

Oracle Database Character Set

Hello how do I change or alter the character set of a database for it to support nihongo characters
cedric
  • 3,107
  • 15
  • 54
  • 65