I have a dataset from Vietnam. But when I read it in R, the string variables are imported incorrectly. I used stri_trans_general
from the stringi package, however it works on only a few columns.
I checked the raw dataset and it seems those few columns were broken when the dataset was exported, looking e.g. like this:
"Du?c ch?t m?i"
I thus obtain strings with ?
or >
characters instead of actual words.
How I can repair these words in order to obtain the correct Vietnamese words?