Questions tagged [utf-8]

UTF-8 is a multibyte character encoding of the Unicode character set, made up of one or more bytes. Unlike some other encodings such as UTF-16, the UTF-8 encoding is upward compatible with 7-bit ASCII characters, and can be processed to some degree by applications that are only aware of bytes.

Full support of UTF-8 for searching, collation, word parsing, etc, does require support of Unicode concepts such as characters, normalisation, supplementary characters, etc. Many application and OS problems with "special characters" such as accented European letters, or ideographs such as used in Japanese or Chinese, derive from mismatched character encodings.

Related tags:

104 questions
0
votes
1 answer

Setting locale permanently fails on VPS machine Ubuntu server 16.04

I tried everything that I could find, including dpkg-reconfigure locale, reinstalling packages completely, adding LC_ALL and other settings to etc/environment, tried adding to /etc/locale.conf , to .bashrc... Removing from ssh-d... There is no…
Aleksandar Pavić
  • 412
  • 2
  • 8
  • 18
0
votes
1 answer

knife ignores client_key in ~/.chef/knife.rb

I am getting errors like WARN: Failed to read the private key /etc/chef/client.pem: # ERROR: Your private key could not be loaded from /etc/chef/client.pem Check your…
chicks
  • 3,793
  • 10
  • 27
  • 36
0
votes
1 answer

German Umlauts (but only uppercase) are broken

I have a proxmox3 server. I login via SSH. Umlauts (öäüÜÖÄ) works well - nevertheless if upper or lowercase. Then I login in my openvz node (debian wheezy) via: vzctl enter 100 In this node, on CLI umlauts are not work in a textfile only lowercase…
suther
  • 143
  • 1
  • 9
0
votes
1 answer

Special chars not displayed correctly

I have problem when I copy file from Server A (Debian 7.4) to Server B (Debian 7.4). Polish special characters in files are not displayed correctly on Server B (in mcedit). I created file with polish special character on server A. It is displayed…
marag
  • 1
  • 2
0
votes
3 answers

how does one quote unicode characters in mysql prompt or in SQL in general?

I have a weird unicode char in my mysql database the value looks like this card issuer bank didnt approve your payment so what should be an apostrophe is a weird unicode char, presumably from windows I want to replace it, but don't know how…
Aleksandar Ivanisevic
  • 3,377
  • 21
  • 24
0
votes
1 answer

cygwin find -ls behaviour like ls

I use cygwin find with -ls (find . -ls) option to have a list of files with full path. Due to UTF-16 (windows XP in this case) converted to en_UTF-8 under cygwin, special charactere like accent are translate into 2 octal escaped value (ex à is…
NeronLeVelu
  • 128
  • 4
0
votes
2 answers

how to set mysql to use utf8 by default?

I would like our centOS server 's mysql setup to use utf8 by default. Can you tell me which file(s) need to be configured and how? I know it's a pretty vague question but i'm not familiar with server management. Let me know if you need more…
pixeline
  • 658
  • 3
  • 13
  • 29
0
votes
1 answer

install mysql client and mysql-server with default charset utf8 to fedora 11

I only have utf8 characters to put into my db so like everything in the db is utf8. I tried 'yum install mysql mysql-server -with-charset=utf8' but it is not right. if i install using ''yum install mysql mysql-server' i got defaut latin1…
Weiwei
  • 367
  • 1
  • 5
  • 10
0
votes
1 answer

UTF-8 encoding gone missing after enabling mysqlnd

I've enabled mysqlnd on my PHP 5.4.26 server using MariaDB 10.0.8. My SQL config contained this: [mysqlnd] character-set-server = utf8 collation-server = utf8_general_ci It was all working fine when no mysqlnd was enabled. Since enabling it all…
Spacedust
  • 568
  • 5
  • 13
  • 28
0
votes
1 answer

HTTP proxy to deal with different encoding

I am working on a web crawler, so I parse HTML pages. My problem is sometime the page encoding is not UTF8 (ISO, exotic Windows[0-9] etc..) and my analyser failled. I tried many solution in PHP/Java/NodeJS to convert the content but there is always…
Thomas Decaux
  • 1,289
  • 12
  • 13
0
votes
2 answers

Setting locale in Ubuntu 12.04 (openvz)

I have a fresh installation of Ubuntu 12.04 as virtual openVZ guest. The defined locale for LC_* in basic setup is POSIX. I'd like to change this to en_US.UTF-8. So I installed language_pack_en_base and language_pack_en, ran dpkg-reconfigure locales…
0
votes
1 answer

Do SMTP servers implement 8BITMIME with anything other than UTF-8?

RFC 5321 specifies that the body of an email be encoded as 7 bit, US-ASCII. The 8BITMIME extension allow the body to be encoded as 8 bit. An SMTP server can advertise that it supports 8BITMIME in its response to the EHLO…
james.garriss
  • 360
  • 6
  • 17
0
votes
1 answer

MySQl: Specified key too long ..max is 1000 bytes?

I'm trying to create a table in MySQL with the statement: CREATE TABLE tags (Tag VARCHAR(200) NOT NULL default '', Login VARCHAR(255) NOT NULL default '', PRIMARY KEY (Tag, Login)) ENGINE=MyISAM; However I'm getting an error: ERROR 1071 (42000):…
Server Fault
  • 3,714
  • 12
  • 54
  • 89
0
votes
2 answers

Force .js files saved in ANSI encoding to show in UTF-8 on IIS 7.5

I'm migrating a web system that now works on windows server 2003 IIS 6, to IIS 7.5 on windows 2008 server This system generates javascript files with accented characters in ANSI (Portuguese - Brazil). These javascripts shows for example alert…
Xcarpa
  • 1
  • 2
0
votes
2 answers

Why doesn't Firefox render this AWstats generated html?

XML Parsing Error: not well-formed Location: https://awstats.example.org/reports/www.example.org/2011/06/awstats.www.example.org.xml Line Number 603, Column 34: - Toile du Qu\uffffbec363363 The…
jldugger
  • 14,342
  • 20
  • 77
  • 129