I am about to convert a series of MySQL databases to the utf8mb4 character set. Most of the databases are using the utf8 character set.
This article points out two potential problem areas that should be addressed prior to conversion:
- Character maximum lengths of string types are reduced (example: tinytext drops from 85 to 63 characters)
- Index key maximum character widths are reduced (191 down from 255) because maximum key lengths are 767 bytes. This maximum was raised to 3072 bytes in MySQL 5.7.7 according to this page but the situation would still occur if keys are larger than 768 characters.
Are there scripts or programs that exist that I can use to check the databases for these issues? I would like to find all of the potential problems prior to attempting the conversion.
Follow-up question: Are there any other scenarios that should be identified prior to a utf8mb4 character set conversion?