Questions tagged [character-set]

A character set maps a set of characters to specific numeric values, e.g. ASCII, UTF-8 and ISO-8859-1.

A character set maps a set of characters to specific numeric values.

Modern computer languages, editors and tools facilitate encoding and decoding of data between internal representations of data and specific character sets. Examples include ASCII, UTF-8 and ISO-8859-1.

Consideration should be given to using the appropriate character set for transmission and persistence of data, particularly text that can contain special characters (such as European languages like French or German) or be in a completely different script (such as Japanese) - see internationalisation (also referred to as i18n).

120 questions
1
vote
1 answer

Binary data type collation with Characters in MariaDB

I am trying to get a deep understanding about RDBMS, I am learning MariaDB. Struggling to get the Binary data type and how its collation works. I understand the binary(n) data type takes n bytes. This means the smallest memory that could be…
theMyth
  • 121
  • 6
1
vote
1 answer

How to identify non utf8 symbols in a range of text?

In our database we have Latin and Cyrillic alphabets and all is fine with those characters. But, sometimes people enter characters that appear as ? or squares. Is there a way to find those specific symbols, because looking manually through a couple…
DarkBlade
  • 47
  • 6
1
vote
1 answer

MySQL: 1366 Incorrect string value: '\xE0' in utf8mb4 column

I'm getting errors when certain characters are being added to a table... even when the column is has utf8mb4 character set. For example: SQLSTATE[HY000]: General error: 1366 Incorrect string value: '\xE0' for column 'surname' The data in question…
Chuck Le Butt
  • 47,570
  • 62
  • 203
  • 289
1
vote
2 answers

MySQL CHAR_LENGTH(str) sometimes produces different outputs for the same String literal when used with different character set introducers

This is weird. As per MySQL developer website - MySQL CHAR_LENGTH(str) Returns the length of the string str, measured in characters. A multibyte character counts as a single character. This means that for a string containing five 2-byte characters,…
Payel Senapati
  • 1,134
  • 1
  • 11
  • 27
1
vote
1 answer

What encoding should we use in java to read files in US7ASCII encoding

I am trying to parse US7ASCII file using java, using the below code: FileInputStream fileInputStream = new FileInputStream(file); InputStreamReader inputStreamReader = new InputStreamReader(fileInputStream, charSetName); In line 2, the 2nd…
Zeus
  • 319
  • 2
  • 13
1
vote
1 answer

How to encode a STRING variable into a given code page

I've got a string variable containing a text that I need to encode and write to a file, in UTF-16LE code page. Currently the following code generates a UTF-8 file and I don't see any option in the statement OPEN DATASET to generate the file in…
Sandra Rossi
  • 11,934
  • 5
  • 22
  • 48
1
vote
0 answers

Can I use the PHP session_id as name for a temporary directory?

I have a straightforward question for which I have not been able to find an answer anywhere. I would like to use session_id() as the name for a temporary directory (during file upload) which I then rename to something else when all files have been…
Pete
  • 105
  • 1
  • 9
1
vote
0 answers

Why does appcmd list fail for åäö in Azure agent job?

We've got an Azure DevOps release pipeline that executes the "Manage IIS Website" task to create or update a site + app pool on a Windows 2016 server, English version. The names of the app pool and the site contain "ö". This has worked fine before,…
1
vote
1 answer

Will existing indexes be affected when changing character_set and collation of MySQL db

We have an database where the default character set for tables and columns is set to utf8 encoding But with character set encoding of utf8 , we are unable to save emojis To support saving of emojis, a) We had to change the character set of…
1
vote
0 answers

Choosing an Oracle Database Character Set

Does anyone know what is Oracle Database Character Set Name for Polish?
Anh Nguyen
  • 41
  • 5
1
vote
0 answers

SQL - "Ilegal mix of collations" when using a variable in FIND_IN_SET function

The problem I use the native function FIND_IN_SET with a variable @myVar for the list of string, like : FIND_IN_SET(col, @myVar) If I set @myVar from a direct value : SET @myVAr = 'one,two,three'; ... FIND_IN_SET(col, @myVar) => It works great If…
Flyout91
  • 782
  • 10
  • 31
1
vote
2 answers

BLOB to XMLType XML parsing failed in oracle

I can't convert BLOB to XMLType in oracle database. I've tried this: select XMLType( BLOB_COLUMN, 1 /* this is character set ID. 1 == USASCII | ISO-8859-2 char ID?*/ ) as XML from my_table; I got this error message: ORA-31011: XML…
Ztrew
  • 48
  • 1
  • 7
1
vote
2 answers

oracle client can not display the thai language

I have already set up the oracle client NLS_LANG=THAI_THAILAND.TH8TISASCII in regedit>Computer\HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\ORACLE\KEY_OraClient10g_home1. And the database server CHARACTERSET also is…
alex
  • 11
  • 2
1
vote
1 answer

How do I make an NSPredicate that filters local characters properly?

My app is using a Realm database to store a number of items. With a SearchBar, I search (case insensitive) for an item that may already be stored in the database (and if not, it will be added). The item names often contain one of the swedish…
Stalle
  • 67
  • 1
  • 6
1
vote
1 answer

error_log flooded by "charset not supported, assuming utf-8" messages

The issue: Wordpress blog's error log is flooded by "charset not supported, assuming utf-8" messages; grows 0 bytes to 450 Mb in 24 hrs (~28k page views, if stats are correct). Details: I have a Wordppress-powered blog hosted on shared hosting…
flamey
  • 2,311
  • 4
  • 33
  • 40