ASCII non ASCII translation Python 2.7

Question

i got a problem trying to encore non ASCII characters. I have this function :

#function to treat special characters 
tagsA=["À","Á","Â","à","á","â","Æ","æ"]
tagsC=["Ç","ç"]
tagsE=["È","É","Ê","Ë","è","é","ê","ë"]
tagsI=["Ì","Í","Î","Ï","ì","í","î","ï"]
tagsN=["Ñ","ñ"]
tagsO=["Ò","Ó","Ô","Œ","ò","ó","ô","œ"]
tagsU=["Ù","Ú","Û","Ü","ù","ú","û","ü"]
tagsY=["Ý","Ÿ","ý","ÿ"]

def toASCII(word):
    for i in range (0, len(word),1):
        if any(word[i] in s for s in tagsA):
           word[i]="a"
        if any(word[i] in s for s in tagsC):
           word[i]="c"
        if any(word[i] in s for s in tagsE):
           word[i]="e"
        if any(word[i] in s for s in tagsI):
           word[i]="i"
        if any(word[i] in s for s in tagsN):
           word[i]="n"
        if any(word[i] in s for s in tagsO):
           word[i]="o"
        if any(word[i] in s for s in tagsU):
           word[i]="u"
        if any(word[i] in s for s in tagsY):
           word[i]="y"
    print word
    return word

i get this error usually : UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 1: ordinal not in range(128)

tried to change encoding to utf8 but it doesn't change the issue.

 # -*- coding: utf-8 -*-

@ArnavBorborah well i treat character by character for the whole string, you can leave the step blank (it will be set to 1 automatically) — Samuel, Feb 26 '18 at 13:36
This is a pretty poor approach. You will need to handle A LOT of edge cases. Python has support for this built in. Why do you not use those functions? — OptimusCrime, Feb 26 '18 at 13:37
The UnicodeDecodeError is *by far* not the only thing wrong with this. — Jongware, Feb 26 '18 at 13:40

score 2 · Answer 1 · answered Feb 26 '18 at 13:43

2

You can use the unicodedata module to remove all the accents from string.

Ex:

import unicodedata
print unicodedata.normalize('NFKD', u"ÀÁ").encode('ASCII', 'ignore')

Output:

AA

answered Feb 26 '18 at 13:43

Rakesh

81,458
17
76
113

ASCII non ASCII translation Python 2.7

1 Answers1