Questions tagged [transliteration]

Transliteration refers to the process of mapping letters or glyphs from one character encoding to another

Transliteration is the conversion of letters from one alphabet to another one, like from Greek to Latin. But it may as well be just a simplification within one alphabet, for example omitting any diacritics found in that alphabet or substituting special characters with a sequence of characters without diacritics.

257 questions
3
votes
1 answer

Latin<->Han Conversion in ICU?

I am just getting started implementing ICU transforms using ICU4C in a C++ program. I am particularly looking at transliteration to and from Chinese. According to this document, the package supports both "Han-Latin" and "Latin-Han" conversion. As…
NatHillard
  • 306
  • 2
  • 10
3
votes
1 answer

Japanese Transliteration in Node.js and Kakasi

I have written a little wrapper for Kakasi that is like the following: Kakasi.prototype.transliterate = function (data) { var self = this; return new Promise(function (resolve, reject) { var args; …
loretoparisi
  • 15,724
  • 11
  • 102
  • 146
3
votes
2 answers

map from Char to String in Haskell

I'm trying to figure out how to improve the transliteration from German umlauts to ASCII for id identifiers in Pandoc. Currently there is only a mapping Char -> Maybe Char, that converts ä into a and ß into Nothing etc., but the most common…
Wolf
  • 9,679
  • 7
  • 62
  • 108
3
votes
0 answers

Reverse Transliteration from Language X to English

I wanted to convert text in an Indian regional language "Kannada" to it's corrusponding pronunciation in English. E.g. the Kannada word ಅಂಗಸನ್ನೆ would be transliterated to "aṅgasanne" in English. I believe this process is called reverse…
Ajay H
  • 794
  • 2
  • 11
  • 28
3
votes
1 answer

Emacs transliteration?

Is there some way to get transliteration to work in Emacs, sort of like it does in Gmail now? I am particularly interested in getting it to work in Cyrillic. For reference, Gmail does something like the following: I can type svoboda and it will…
Tikhon Jelvis
  • 67,485
  • 18
  • 177
  • 214
3
votes
1 answer

Transliterate Removal of Symbols

I am trying to transliterate ®, ©, ', and ™ into blank characters meaning completely removing them when they are slugified. The following is what I tried to do: var tr = require('transliteration'); var slugify =…
mayvn
  • 191
  • 1
  • 9
3
votes
2 answers

Python - Transliterate German Umlauts to Diacritic

I have a list of unicode file paths in which I need to replace all umlauts with an English diacritic. For example, I would ü with ue, ä with ae and so on. I have defined a dictionary of umlauts (keys) and their diacritics (values). So I need to…
Crazy Otto
  • 125
  • 2
  • 13
3
votes
4 answers

Convert text to and from Serbian cyrillic letters

How do I add the Serbian Cyrilic alphabet into my HTML, to make my browser recognize it? I need to, for example, make “Povrce” into “Поврће”. I just need a code so when I type “Поврће” or “Povrće”, the browser can show it.
Ivan
  • 816
  • 2
  • 9
  • 14
3
votes
1 answer

Google AJAX Transliteration API: Is it possible to make all input fields in the page transliteratable?

I've used "Google AJAX Transliteration API" and it's going well with me. http://code.google.com/apis/ajaxlanguage/documentation/referenceTransliteration.html Currently I've a project that I need all input fields in every page (input & textarea tags)…
Omranic
  • 1,392
  • 3
  • 17
  • 32
3
votes
3 answers

Transliteration in Java. Redefine each char in a string

The aim of a method is a transliteration of strings, like: афиваў => afivaw. The problem is: I cannot use charAt method to redefine because there are some letters that demand to be transliterated as two symbols 'ш' => "sh". I try this: public…
Rudziankoŭ
  • 10,681
  • 20
  • 92
  • 192
3
votes
2 answers

Can I use the y operator to do a non-one-to-one transliteration in Perl?

The y operator in Perl does character-by-character transliteration. For example, if we do y/abc/dfg to the string "foobar", we get "foofdr". But what if I want to transliterate "ā" to "ei" and "ä" to "a:" and "ō" to "әu" and "o" to "ɒ". I tried the…
Mike
  • 1,841
  • 5
  • 24
  • 34
3
votes
3 answers

Ruby character transliteration

What's the current best way to transliterate characters to 7-bit ASCII in Ruby? Most of questions I've seen on SO are 3 or 4 years old and the solutions don't fully work. I want a method that will work for a wide range of Latin alphabets and, for…
Old Pro
  • 24,624
  • 7
  • 58
  • 106
3
votes
3 answers

Using ICONV on mysql select - transform all characters to standard ascii

I want to search in my database for polish cities which commonly consist of a lot of special characters. I want to migrate those to ASCII, so that even those without a polish keyboard can do a regular search. In PHP it looks like $ascii =…
MatthiasLaug
  • 2,924
  • 6
  • 28
  • 43
3
votes
2 answers

php preg_grep and umlaut/accent

I have an array that consists of terms, some of them contain accented characters. I do a preg grep like this $data= array('Napoléon','Café'); $result = preg_grep('~' . $input . '~i', $data); So if user type in 'le' I would also want the result …
user1906418
  • 139
  • 1
  • 9
3
votes
2 answers

How can I transliterate Hindi text between Hindi and Latin characters using Java?

How to convert a hindi meanig written in english alphabet to Hindi using Java? eg. Input text is: anil NE lath marke apko Ganga me hi Fenk diya. in Hindi Output text is: अनिल ने लात मार्के आपको गंगा में ही फेंक दिया How to convert using…
Sonia Gupta
  • 95
  • 2
  • 9