Questions tagged [mojibake]

Garbled text that is the result of bytes being decoded using an incorrect coding.

Mojibake is the phenomenon which occurs when text is decoded from a byte stream using the wrong character encoding, resulting in a sequence of characters which is unreadable. The term "mojibake" is derived from Japanese where it literally means "unintelligible sequence of characters".

Example mojibake:

اÙ"إعÙ"ان اÙ"عاÙ"Ù

References:

150 questions
3
votes
1 answer

Why can I not view foreign language characters in my mysql DB?

I am inserting the following characters into my DB: 汉字 / 漢字 This is the meta tag on the page that is inserting the characters: I have altered all the columns in my table that is…
zeckdude
  • 15,877
  • 43
  • 139
  • 187
3
votes
1 answer

Encoding issue of a character in utf-8

I get a link from a web page by using beautiful soup library through a.get('href'). In the link there is a strange character ® but when I get it became ®. How can I encode it properly? I have already added at the beginning of the page # -*- coding:…
Mazzy
  • 13,354
  • 43
  • 126
  • 207
3
votes
5 answers

Why are these strange characters appearing in mcrypt?

I encrypt and decrypt successfully, but when I decrypt the value, there appears strange characters at the end of the string, "���". The initial $_POST['value'] do not have any blank space or any strange character. How can I solve this? I encrypt…
David
2
votes
1 answer

Compiled C++ program riddled with mojibake?

#include "StdAfx.h" #include #include #include "mysql_connection.h" #include #include #include #include #include…
2
votes
1 answer

Passing UTF-8 string to f:viewParam in JSF

I tried to pass UTF-8 String as f:viewParam value but value shown as garbage string, i added EncodingFilter in web.xml for setting UTF-8 to request and response as below HttpServletResponse response = (HttpServletResponse)…
Ashkan
  • 320
  • 2
  • 17
2
votes
1 answer

Character encoding not working with PrimeFaces CellEditor component

I'm having some trouble editing a table using I use encoding='windows-1252' to be able to use Swedish characters (å, ä, ö). Creating an entity works fine but when I edit it in a using it commits characters…
Dennis S
  • 858
  • 1
  • 18
  • 32
2
votes
1 answer

Greek and special characters show as mojibake - how to decode?

I'm trying to figure out how to decode some corrupt characters I have in a spreadsheet. There is a list of website titles: some in English, some in Greek, some in other languages. For example, Greek phrase ΕΛΛΗΝΙΚΑ ΝΕΑ ΤΩΡΑ shows as ŒïŒõŒõŒóŒùŒôŒöŒë…
2
votes
1 answer

Python 3 - Hebrew coding problems

I have a server written in FastAPI, Python 3.8.13 that receives data through a form from an external service, which may include Hebrew letters. Until recently, the data arrived through a back proxy server that was written in Python 2.7 and…
Peleg
  • 119
  • 1
  • 1
  • 10
2
votes
1 answer

What causes the ГѓВ pattern in this Mojibake?

Google ГѓВ (UTF-8: D0 93 D1 93 D0 92) and you'll see a few examples of what seems to be Mojibake. A specific example is ö becoming ГѓВ¶. What kind of encodings did the original ö go through to become ГѓВ¶? How would I figure this out?
Aly
  • 847
  • 1
  • 6
  • 30
2
votes
2 answers

encoding question in perl

I have an encoding question and would like to ask for help. I notice if I choose "UTF-8" as encoding, there are (at least) two double quotes " and “. But when I choose "ISO-8859-1" as the encoding, I see the latter double quote becomes ¡°, or…
Qiang Li
  • 10,593
  • 21
  • 77
  • 148
2
votes
2 answers

python3 decode str to utf8

I have a str variable in Python 3.6, which is as following: \xc3\xa4\xc2\xb8\xc2\xad\xc3\xa5\xc2\x9b\xc2\xbd\xc3\xa6\xc2\xb0\xc2\x91\xc3\xa7\xc2\x94\xc2\x9f\xc3\xa9\xc2\x93\xc2\xb6\xc3\xa8\xc2\xa1\xc2\x8c I want to decode the str to chinese, I…
tdycss
  • 139
  • 2
  • 13
2
votes
1 answer

Certain Arabic text gets incorrectly shown while other Arabic text gets showed normally?

I'm developing an app with Arabic text in it.. My phone supports Arabic so the text gets displayed correctly.. the weird problem is that: if I copy an Arabic text that i want from a.txt file and put it into an EditText, the EditText displays weird…
Omar
  • 7,835
  • 14
  • 62
  • 108
2
votes
1 answer

unprintable python unicode string

I retrieved some exif info from an image and got the following: { ... 37510: u'D2\nArbeitsamt\n\xc3\x84nderungsbescheid' ...} I expected it to be { ... 37510: u'D2\nArbeitsamt\nÄnderungsbescheid' ... } I need to convert the value to a str, but i…
matlantis
  • 160
  • 5
2
votes
0 answers

Passing foreign characters via Socket.IO

I am doing a very simple thing, just sending a message to my Flask app via Socket.IO . It works like a charm with English, but some other languages break somewhere in the process. Minimal working example follows. testapp.py: from flask import…
2
votes
1 answer

UTF8 missmatch in script

I have issues with a Python script. I just try to translate some sentences with the google translate API. Some sentences have problems with special UTF-8 encoding like ä, ö or ü. Can't imagine why some sentences work, others not. If I try the API…
keschra
  • 289
  • 1
  • 3
  • 17
1 2
3
9 10