Questions tagged [character-set]

A character set maps a set of characters to specific numeric values, e.g. ASCII, UTF-8 and ISO-8859-1.

A character set maps a set of characters to specific numeric values.

Modern computer languages, editors and tools facilitate encoding and decoding of data between internal representations of data and specific character sets. Examples include ASCII, UTF-8 and ISO-8859-1.

Consideration should be given to using the appropriate character set for transmission and persistence of data, particularly text that can contain special characters (such as European languages like French or German) or be in a completely different script (such as Japanese) - see internationalisation (also referred to as i18n).

120 questions
0
votes
1 answer

C character coding on windows console

I am having trouble in understanding the character set for printing on the console in for a Windows C programme. I have not found any question answering this directly (if there should be one a link would be appreciated). When looking through some…
Bert
  • 57
  • 6
0
votes
2 answers

Is it safe to change the character set for a column from utf8 to ascii?

The database for an application I manage use UUIDs. These UUIDs are stored as char(36) with a utf8 character set. From a strict performance point of view, I know this is not optimal. The general recommendation seem to be to use a binary(16) column…
sbrattla
  • 5,274
  • 3
  • 39
  • 63
0
votes
0 answers

Why does CONCAT'ed value in MariaDB query result in Illegal mix of collations (utf8mb4_bin,NONE) collation?

We have a query that CONCATs a bunch of values together, and then searches against it, and we're getting this error: Illegal mix of collations (utf8mb4_bin,NONE) and (utf8mb4_general_ci,COERCIBLE) for operation 'like' The strange thing is that on…
ilasno
  • 714
  • 1
  • 13
  • 31
0
votes
0 answers

Restrict Visual Studio 2019 to allow only latin characters in code

How can I restrict Visual Studio 2019 to allow only latin characters in identifiers? That means, I want method names and variable names to be in latin characters, and not for example in greek characters. The following code compiles in Visual Studio…
Alex
  • 419
  • 1
  • 5
  • 14
0
votes
0 answers

Visual Basic, MySQL Restored Backup Not recognising images

5 years ago I had a Visual Basic program that could save images into a MySQL database and retrieve it later. It all worked. When COVID struck this project was canned. Now they want it back... I've got an SQL backup file of the entire database that I…
Richard
  • 1
  • 1
0
votes
0 answers

A few questions about character sets and mapping (translation phase 1)

The questions below are about Character sets (C11, 5.2.1 Character sets) and mapping (C11, 5.1.1.2 Translation phases, 1). The list: Can a source character set as an extension include control characters, representing other than horizontal tab,…
pmor
  • 5,392
  • 4
  • 17
  • 36
0
votes
1 answer

Diamond with question mark (replacement character) in VSCode terminal

Replacement characters appear instead of some symbols of error text (the error itself doesn't matter) in build active file terminal. The default system language on my computer is set to Russian, all file encodings are set to UTF-8, file doesn't…
Shenyuan
  • 23
  • 1
  • 5
0
votes
1 answer

is there some characters that has the same encoding regardless which character set is used ? and if yes which are they?

just asking if there is some characters that has the same encoding in all character sets and what are those characters exactly if there is a list of them ? for example they may be the 0-9 characters or may be all English characters ... don't know !
0
votes
1 answer

Output buffer empty in iconv , while converting from ISO-8859-1 to UTF-8

In linux I have created a file with Turkish characters and changed file characterset to "ISO-8859-9". With below cpp, I am trying to convert it to UTF-8. But iconv returns empty outbuffer. But "iconv" returns "inbytesleft" as "0" means conversion…
0
votes
1 answer

Informix JDBC - fail to run sql update using JDBC, but ok when run using dbaccess

The value to update contains 1 Arabic-like character ie ݠ . Unicode point number is U+0760 (I get it from https://unicode.scarfboy.com/) If the SQL is run using dbaccess the update works. If the character is replaced with different arabic character,…
user3737906
  • 89
  • 1
  • 9
0
votes
0 answers

MySQL character set CLI and Workbench show different results

Q: Character sets shown by the MySQL CLI command lines and my MySQL workbench show different results. Why is that? I've run into MySQL 'chcp' error and cannot view my server status after updating my MySQL workbench, current version is 8.0.27. So…
0
votes
0 answers

Why "character_set_connection" influences insert result?

I change column character set to utf8mb4 to support emojis, but when character_set_connection/character_set_client is set to utf8, inserting data returns "Incorrect string value xxx for column xxx at row 1". I can not successfully insert data until…
ccc
  • 65
  • 1
  • 5
0
votes
1 answer

Initialize object for unicode fonts

I wrote a class object to access mathematical alphanumeric symbols from the unicode block as described on https://en.wikipedia.org/wiki/Mathematical_Alphanumeric_Symbols # San-serif LATIN_SANSERIF_NORMAL_UPPER = (120224,…
alvas
  • 115,346
  • 109
  • 446
  • 738
0
votes
2 answers

How to store Unicode characters in an array?

I'm writing a C++ wxWidgets calculator application, and I need to store the characters for the operators in an array. I have something like int ops[10] = {'+', '-', '*', '/', '^'};. What if I wanted to also store characters such as √, ÷ and × in…
iKebab897
  • 73
  • 2
  • 7
0
votes
1 answer

How can we save emojis in SQL database to preserve their uniqueness for equality comparisons?

We have an issue when we save emojis to our database. We have altered our table column to a character set and collation that should save enough bytes to distinguish between emojis. This was our query to modify the table column: ALTER TABLE TableName…