Questions tagged [utf-8]

UTF-8 is a multibyte character encoding of the Unicode character set, made up of one or more bytes. Unlike some other encodings such as UTF-16, the UTF-8 encoding is upward compatible with 7-bit ASCII characters, and can be processed to some degree by applications that are only aware of bytes.

Full support of UTF-8 for searching, collation, word parsing, etc, does require support of Unicode concepts such as characters, normalisation, supplementary characters, etc. Many application and OS problems with "special characters" such as accented European letters, or ideographs such as used in Japanese or Chinese, derive from mismatched character encodings.

Related tags:

104 questions
0
votes
2 answers

LINUX: Can a file which is shown as ASCII text contain UTF-8 data

Version of LINUX Red Hat Enterprise Linux ES release 4 I need to confirm if an extract from a database has correctly output data with UTF-8 encoding. I created the file using the mechanism specified by the database vendor, but when I did $ file…
SQLMike
  • 1
  • 1
  • 1
0
votes
1 answer

Unable to change encoding to UTF-8 in Open Office

I see the following when I look at a document in Finnish alt text http://files.getdropbox.com/u/175564/oo-bug-finnish.png The problem seems to be in encoding. I am not sure which encohing OO uses by default. I can see the documents in terminal…
0
votes
1 answer

Got error while encoding text in linux

I am using python code to encode text before inserting into a database in a remote server (linux). Since I have special characters so I need to encode them to unicode. But I got following error while encoding text. UnicodeDecodeError: 'ascii' codec…
khati
  • 31
  • 2
  • 6
0
votes
2 answers

Ubuntu 8.04LTS Folder name encoding issue

I will explain my problem from the beginning, I have an Ubuntu server 8.04LTS and I use a Java application with Tomcat 6.0.18 and JRE 1.6. The problem is that the display of characters such as (é è ç û) is replaced by other strange characters. I…
ME-KJ
  • 1
  • 1
  • 2
0
votes
1 answer

SQL Error (1064) when importing data from SQL file

I have a MySQL database, which was originally set up with the default latin1 character set and latin1_swedish_ci collation. I was using the database like this for sometime, until I noticed strange characters on my production web site, which is…
unpossible
  • 218
  • 3
  • 9
0
votes
1 answer

How to read Chinese character from MySQL to PHP page by resin server?

My characters are in Chinese Simplified that stored in MySQL database by all tables and character set to UTF-8. I am reading them to my PHP page that runs under resin server on Windows XP Professional. My Resin server version is 4.0.15; PHP version…
HexaHow
  • 21
  • 1
  • 5
0
votes
1 answer

Ubuntu Server 10.10: Preventing a serial console from using UTF-8

I have an Ubuntu 10.10 Server box with a Digital VT220 connected over a serial line. (That's a real VT220, not a virtual one. An actual beige CRT thing.) It works great apart from the fact that Ubuntu is sending it UTF-8, so occasionally you get a…
David Glover
  • 101
  • 2
0
votes
1 answer

mount and sync non english folder & file names

I'm trying to rsync a folder who's name contains non-english characters it breaks the whole rsync, how can I copy a folder even if they don't have english characters in it?
amirash
  • 129
  • 3
0
votes
1 answer

Problem with diacritics on psql 9.0 (PostgreSQL)

I have two instances of PostgreSQL installed on my server: 8.3 and 9.0. There seams to be some problem with Polish diacritic characters (like ółęąśżźć) on postgresql 9.0 client - psql. When I connect to DB (either 8.3 or 9.0) with psql 8.3 - I can…
grzaks
  • 335
  • 1
  • 3
  • 9
0
votes
3 answers

All password with '$' inside (phpmyadmin) won't work [UTF-8 problem]

i set up a dedicaced server with a tutorial. I set in PHP : mbstring.language=UTF-8 mbstring.internal_encoding=UTF-8 mbstring.http_input=UTF-8 mbstring.http_output=UTF-8 mbstring.detect_order=auto But each time there is a $ in the password (i…
Tristan
  • 498
  • 2
  • 9
  • 27
0
votes
1 answer

How can I convert a large ANSI text file to UTF-8?

For a database migration I need a tool which can convert a 1 GB file from Ansi codepage to UTF8 on Windows. Maybe I can use sed (given that I know all search and replace values), but is there something included in GNU or Windows tools?
mjn
  • 933
  • 2
  • 12
  • 26
0
votes
1 answer

sql and web encoding problem

Guys, I've got an encoding problem I believe. I have upgraded from phpBB2 to phpBB3. The old databases were in latin1, the new ones have utf8 encoding. Already during the upgrade process some rows of the DB were only read partly into the new…
Marki
  • 2,854
  • 3
  • 28
  • 45
0
votes
0 answers

Server apache reverse does not encode coma in UTF-8

Server version: Apache/2.4.6 (CentOS) Server built: Nov 5 2018 01:47:09 centos-release-7-6.1810.2.el7.centos.x86_64 We are trying to solve a reverse problem and we need you help Reverse proxy Apache should send a %2C in url but instead send a…
Mina
  • 1
  • 1
-1
votes
1 answer

filenames, ASCII unicode escaped sequences to UTF8

I'm not sure if I've grasped the issue here so if I haven't just say so and I'll edit the title. My problem is the following: I have an Ubuntu 12.04 server (UTF-8 locale) to which users upload files via a web app or through shell. So I have no…
D.Mill
  • 379
  • 5
  • 15
1 2 3 4 5 6
7