Questions tagged [diacritics]

A Diacritic is "a mark near or through an orthographic or phonetic character or combination of characters indicating a phonetic value different from that given the unmarked or otherwise marked element" -- Merriam-Webster

From Wikipedia:

A diacritic (/daɪ.əˈkrɪtɨk/; also diacritical mark, diacritical point, diacritical sign) is a glyph added to a letter, or basic glyph. The term derives from the Greek διακριτικός (diakritikós, "distinguishing"). Diacritic is both an adjective and a noun, whereas diacritical is only an adjective. Some diacritical marks, such as the acute ( ´ ) and grave ( ` ) are often called accents. Diacritical marks may appear above or below a letter, or in some other position such as within the letter or between two letters.

The main use of diacritics in the Latin alphabet is to change the sound value of the letter to which they are added. Examples from English are the diaeresis in naïve and Noël, which show that the vowel with the diaeresis mark is pronounced separately from the preceding vowel; the acute and grave accents, which indicate that a final vowel is to be pronounced, as in saké and poetic breathèd, and the cedilla under the "c" in the borrowed French word façade, which shows it is pronounced /s/ rather than /k/. In other Latin alphabets, they may distinguish between homonyms, such as French là "there" versus la "the," which are both pronounced [la]. In Gaelic type, a dot over consonants indicates lenition of the consonant in question. In other alphabetic systems, diacritics may perform other functions. Vowel pointing systems, namely the Arabic harakat ( ـَ, ـُ, ـُ, etc.) and the Hebrew niqqud ( ַ, ֶ, ִ, ֹ , ֻ, etc.) systems, indicate sounds (vowels and tones) that are not conveyed by the basic alphabet. The Indic virama ( ् etc.) and the Arabic sukūn ( ـْـ ) mark the absence of a vowel. Cantillation marks indicate prosody. Other uses include the Early Cyrillic titlo ( ◌҃ ) and the Hebrew gershayim ( ״ ), which, respectively, mark abbreviations or acronyms, and Greek diacritics, which showed that letters of the alphabet were being used as numerals.

In orthography and collation, a letter modified by a diacritic may be treated either as a new, distinct letter or as a letter–diacritic combination. This varies from language to language, and may vary from case to case within a language.

In some cases, letters are used as "in-line diacritics" in place of ancillary glyphs, because they modify the sound of the letter preceding them, as in the case of the "h" in English "sh" and "th".

More information

1105 questions
-1
votes
2 answers

match special characters from input to stored in database

I saved some German characters on the database by entity (eg. föo => föo). Now I have a query that search for a match just using LIKE %search_word%. The search_word will probably be an input containing special German characters. Can you suggest…
Leandro Garcia
  • 3,138
  • 11
  • 32
  • 44
-2
votes
1 answer

Remove accents in python

I'm trying to remove all the accents in a string in python using unidecode and it work pretty well import unidecode print(unidecode.unidecode('ááíãôç')) it returns aaioac The problem is that i need to keep the 'ç' character aaiaoç Is there some…
-2
votes
1 answer

Badly displayed accented characters

On a CMS TYPO3 website I created 15-20 years ago and have been updating regularly, a few days ago, noticed a new problem: that the accented characters were all displayed badly. I hadn't changed or updated anything for a few weeks. How do I get…
-2
votes
1 answer

Filter search with Javascript, character with accent equal to coomon

I´m doing something just like this example Is there a way to equal on the search the characters with or without accents? example áéíóú equal to aeiou.
-2
votes
1 answer

Convert Accents to raw data SQL

I have a column( BLOB data) in which accents are used. I would like to search for all the rows have a particular accent( not all the accents and replace them, a particular one). Suppose I want to search l'application Data I have used below query but…
-2
votes
2 answers

Codeigniter + Mysql = accents not being saved properly

I have a database that I run using MySql and codeigniter that has a mix of english and french names in it. The french names, like Andrée, come back as Andrée, and other various obviously incorrect. How do I prevent this from happening, and that we…
JonYork
  • 1,223
  • 8
  • 31
  • 52
-3
votes
1 answer

Virtual key codes

I have a c++ console program. Ho can I simulate the "é" character ? Code: // Set up a generic keyboard event. ip.type = INPUT_KEYBOARD; ip.ki.wScan = 0; // hardware scan code for key ip.ki.time = 0; ip.ki.dwExtraInfo = 0; ip.ki.wVk = 0x45;…
T Jayceon
  • 11
  • 4
-3
votes
2 answers

Remove certain characters from a string with JavaScript

I wrote a function in JavaScript, that replaces accents in a string to certain characters. function textToURL(str) { str = str.replace(/á/gi,'a'); str = str.replace(/é/gi,'e'); str = str.replace(/í/gi,'i'); str =…
Kobanric
  • 23
  • 3
-4
votes
1 answer

I need to replace all ü Characters with ue from my Textfile. If Im trying this I get "ou out. How can I remove the " from ou?

I need to replace all ü Characters with ue from my Textfile. I tried the following: set -e 's/ü/\"ue/g Output: "ue If im writing it like this I became E: set -e 's/ü/\ue/g Output: E My actual problem is: How can I remove the " from "ue?
1 2 3
73
74