Questions tagged [cp1251]

Windows-1251 (a.k.a. code page CP1251) is a popular 8-bit character encoding, designed to cover languages that use the Cyrillic script such as Russian, Bulgarian, Serbian Cyrillic and other languages. It is the most widely used for encoding the Bulgarian, Serbian and Macedonian languages. In modern applications, Unicode is a preferred character set.

Windows-1251 or code page CP1251 is a popular 8-bit character encoding, designed to cover languages that use the Cyrillic script such as Russian, Bulgarian, Serbian Cyrillic and other languages. It is the most widely used for encoding the Bulgarian, Serbian and Macedonian languages. In modern applications, Unicode is a preferred character set.

Source and more information: Windows-1251 on the English Wikipedia

76 questions
1
vote
2 answers

Builder c++ Rave Reports encoding problem with cyrillic

When i try save rave project in pdf\html file, have incorrect encoding. When choose format and press SAVE, it ussually save in iso-8859-1 code. But i need cp1251 (cyrillic). For example "Ïëîùàäü" instead of "Площадь".
1
vote
1 answer

blob with conversion to 8bit cp1251 or cp1252

I need a solution with encoding utf to 8-bit cp1251 or cp1252 using blob I managed to change the https://github.com/b4stien/js-csv-encoding including windows 1251, but there are insoluble problems: Unfortunately noscript does not allow loading…
nnovic
  • 11
  • 1
  • 5
1
vote
1 answer

Sending "windows-1251"-encoded string in JSON from python to javascript

What I need to do is best descriped as example. Previously, I had the following code: content = u'\n' + ... # with open(file_name, 'w') as f: f.write(content.encode('cp1251')) f.close; Now I…
Edgar Navasardyan
  • 4,261
  • 8
  • 58
  • 121
1
vote
1 answer

How to make Cyrillic characters display correctly when converting to PDF with itextpdf?

Does any body knows how to make itextpdf work with Cyrillic symbols? I have code: Font normal = FontFactory.getFont(String.valueOf(Font.FontFamily.HELVETICA), "CP1251", false, 13); Document doc = new Document(PageSize.A4, 36, 36, 36, 65); …
Lena
  • 99
  • 4
  • 14
1
vote
1 answer

Python: decode russian string

I recieved a list of tuples from a mySQL database. When I try to print an item, here is the result: Далоев ÐлекÑандр This is cp1251, according to https://2cyr.com/decode/?lang=ru I have tried lots of variations of…
Snobby
  • 1,067
  • 3
  • 18
  • 38
1
vote
1 answer

Parsing cp1251 pdf to text in python

Is there any way to extract text from the pdf file with russian text (cp1251)? For parsing pdf files I am using pdfminer package. I tried to specify encoding in the argument to pdfminer.converter.TextConverter class but it didn't help.
1
vote
2 answers

How to transcode Windows-1251 to UTF-8?

How to transcode Windows-1251 to UTF-8? Will such function do it? function win_to_utf($s) { for($i=0, $m=strlen($s); $i<$m; $i++) { $c=ord($s[$i]); if ($c<=127) {$t.=chr($c); continue; } if ($c>=192 && $c<=207) {$t.=chr(208).chr($c-48);…
Rella
  • 65,003
  • 109
  • 363
  • 636
1
vote
1 answer

case sensitive LIKE on VARCHAR field with cp1251 encoding

I have an InnoDB table with VARCHAR(250) cp1251_general_ci field named comment. I'm trying to search on this field, case sentative. SELECT comment FROM body_legend WHERE comment LIKE '%ТТ%' GROUP BY comment works as expected, but its case…
Sergey Novikov
  • 4,096
  • 7
  • 33
  • 59
1
vote
1 answer

Base64 String to Windows1251 (cyrillic symbols)

I have a trouble to convert email attachment(simple text file in windows-1251 encoding with latin and cyrillic symbols) to String. I.e I have a problem with converting cyrillic. I got attachment file as base64 encoded String like…
user3283133
  • 11
  • 1
  • 3
1
vote
1 answer

Parsing different unicode files using BeautifulSoup

I have this particular HTML page having codec Now When I am trying to parse this particular file using BeautifulSoup, it always returns NULL object. I can convert it…
Koustav
  • 733
  • 1
  • 6
  • 21
1
vote
0 answers

UTF-8 symbols and cp1251 site

There is a bunch of old sites created with cp1251 charset. I've been asked to enhance its functionality by implementing support for non-cp1251 symbols for user input. It's a bit too late to convert everything to utf-8, because there is lots of old…
CamaroSS
  • 493
  • 2
  • 6
  • 16
1
vote
1 answer

Converting between ISO-8559-1 and cp1251

My Android app uses an open-source library that only accepts text data in an ISO-8859-1 encoding. I have a few users from Eastern Europe who would like to enter cp1251-encoded text. This seems to be a limitation of the open-source library, as…
ktambascio
  • 434
  • 4
  • 17
1
vote
1 answer

Mobify.js build encoding

How can i do "mobify build" with result encoding WINDOWS-1251? By default the character set of mobify.js is UTF-8, but the character set in the client site is WINDOWS-1251. If I use iconv, enconv etc. then it helping me with setting encoding is…
1
vote
2 answers

ITextSharp: parse html with cyrillic/international words

I try to parse html file and to generate pdf. I use code document.Open(); HtmlPipelineContext htmlContext = new HtmlPipelineContext(null); htmlContext.SetTagFactory(Tags.GetHtmlTagProcessorFactory()); ICSSResolver cssResolver =…
PlushEngineCell
  • 775
  • 1
  • 7
  • 14
0
votes
0 answers

Troubles with CP1251 printed file from Word

I have a bunch of PDF (1.4) files printed from Word with Adobe Distiller 6. Fonts are embedded (Tahoma and Times New Roman, which I have on my Linux machine) and encoding says "ANSI" and "Identity-H". Now by ANSI, I assume that regional code-page is…
theta
  • 24,593
  • 37
  • 119
  • 159