Questions tagged [utf-8]

UTF-8 is a multibyte character encoding of the Unicode character set, made up of one or more bytes. Unlike some other encodings such as UTF-16, the UTF-8 encoding is upward compatible with 7-bit ASCII characters, and can be processed to some degree by applications that are only aware of bytes.

Full support of UTF-8 for searching, collation, word parsing, etc, does require support of Unicode concepts such as characters, normalisation, supplementary characters, etc. Many application and OS problems with "special characters" such as accented European letters, or ideographs such as used in Japanese or Chinese, derive from mismatched character encodings.

Related tags:

104 questions
1
vote
1 answer

What is the fastest way to convert MySQL columns to UTF8 from latin1 colums who's data is actually UTF8?

I have a MySQL table that has a number of columns with the latin1 character set. Because of driver issues, the columns actually contain utf8 data. I want to leave the data as-is but change the column character set to UTF8. Every solution I've…
Daniel Beardsley
  • 547
  • 6
  • 12
1
vote
1 answer

cyrillic URLs in apache access log

My website is configured to include cyrilic urls for SEO reasons, using some RewriteRules. It wall works very well and the link look like this http://site.com/категория/новина.html now, the issue problem I have is that when apache logs the request…
duckyflip
  • 111
  • 4
1
vote
2 answers

Informix "Database locale information mismatch"

I have informix 11.5 running in my Win-2003 box and few databases running in it. System databases have locale en_us.819 My custom databases have locale en_us.57372 (UTF8). There is also application deployed to JBoss 4.0.2 which has few datasources…
lmmortal
  • 13
  • 1
  • 1
  • 3
1
vote
1 answer

Importing UTF8 data into MySQL from a file causes truncation

I have a file called person.sql with 1 line: INSERT INTO Person (sFullname) VALUES ('Thomas Ågren'); If I run the following command from the shell: mysql --default-character-set=utf8 -uusername -ppassword mydatabase < person.sql I get truncated…
Rob Sobers
  • 333
  • 1
  • 4
  • 13
1
vote
1 answer

How to serve HTTP filenames with special characters?

Take the following blog page as a case: http://www.roney.com.br/2010/06/20/estados-do-brasil-um-pais-que-precisa-se-unir/ !Careful, it has tons of youtubes embedded, so is a slow load! It is a Brazilian web page, written in Portuguese but hosted…
1
vote
2 answers

Using Chinese Characters With Mod_Rewrite

I'm trying to create a rule using Chinese characters #RewriteRule ^zh(.*) /中文版$1 [L,R=301] creates error 500 when i change the file to UTF-8 #RewriteRule ^zh(.*) /%E4%B8%AD%E6%96%87%E7%89%88$1 [L,R=301] redirects to…
Moak
  • 734
  • 3
  • 10
  • 31
1
vote
1 answer

rclone quits FTP server after asking for UTF8

I need to programmatically copy data from an FTP server with rclone. The config is as simple as: [exampleftp] type = ftp host = ftp.example.com user = myuser pass = mypass If I try to copy anything, I get: Failed to create file system for…
cornuz
  • 437
  • 1
  • 7
  • 17
1
vote
0 answers

Mail bounces: SMTPUTF8 is required, but was not offered

My mail server periodically sends out a newsletter to my clients. This is handled by a Perl script on the server. A few of the recipients names contain non-ASCII characters like ä, é, ö, etc. On my server, Postfix is setup with smtputf8_enabled =…
Zippy1970
  • 247
  • 1
  • 4
  • 12
1
vote
1 answer

utf-8 using exchange behind postfix workaround?

We have a problem with mail bouncing due to UTF-8. we have a smtpin (postfix) --->KEMP load balancer ----> exchange 2013 We have a company that needs to send mail to some of our users and they are using mailgun.com to send the mails. Thay are using…
Robert
  • 11
  • 1
0
votes
0 answers

Nginx to immediately drop connection if request contains UTF?

I run some web application behind nginx under heavy load. This application doesn't accept requests containing UTF characters. I want to somehow protect it from unwanted HTTP traffic. How can I do that?
0
votes
1 answer

Domino Server Incoming Mail Mime Encoding

I recently installed FP10 to the Domino 9 server and made changes to http.cnf to AddType image/svg+xml .svg I am not running IHS. Incoming mail however now has encoded Subject: =?UTF-8?Q?Breaking_News_-_LeBron:_Trump_is_u?=…
teleman
  • 113
  • 5
0
votes
0 answers

url with hebrew chars returns giberish (404) from apache

moving wordpress site with wpml template to a new server when I click on a url that contians hebrew letters I get this results form apache (the url at the new page is correct and display the right letters in hebrew): Not Found The requested URL…
Dani
  • 531
  • 3
  • 11
  • 24
0
votes
2 answers

re: UTF-8, PHP and XML Mysql

This is related to https://stackoverflow.com/questions/1791082/utf-8-php-and-xml-mysql, which I am still trying to get my head around. I have a couple of separate questions that will hopefully help me understand how to resolve the issues I am…
icelizard
  • 732
  • 3
  • 10
  • 20
0
votes
0 answers

Reverse proxy to force UTF8 requests upstream

I'm trying to debug a problem to a web application that I didn't write and that I cannot alter, that seems to have serious encoding problems whenever it receives requests not in UTF8. Now, to prove that this in fact is the problem (and to mitigate…
Matteo Italia
  • 385
  • 3
  • 15
0
votes
0 answers

IIS 6.0 UTF8 support for local email robért@domain.com

IIS 6.0 UTF8 support for local email robért@domain.com 2 hours, 38 minutes ago|LINK Is there anyway for IIS 6.0 to handle the local part of the email with UTF-8 Characters - robért@domain.com I have the correct code. mMailMessage.To.Add(New…
Moojjoo
  • 87
  • 1
  • 10