When building websites for non-english speaking countries
you have tons of characters that are out of the scope.
For the database I usally encode it on either utf-8 or latin-1.
I would like to know if there is any issue with performance, speed resolution, space optimization, etc.
For the fixed texts that are on the html between using for example
á or á
which looks exactly the same: á or á
The things that I have so far for using it with utf-8:
Pros:
- Easy to read for the developers and the web administrator
- Only one space ocupied on the code instead of 4-5
- Easier to extract an excerpt from a text
- 1 byte against 8 bytes (according to my testings)
Cons:
- When sending files to other developers depending on the ide, softwares, etc that they use to read the code they will break the accent in things like: é
- When an auto minification of code occurs it sometimes break it too
- Usually breaks when is inside an encoding
The two cons that I have a bigger weight than the pros by my perspective because the reflect on the visitor.