I have a long list of names in a spreadsheet that I am using in R. There is a few classical issues with names (corporate or persons) such as the example below.
DU PONT JEAN
DUPONT JEAN
DUPON T JEAN
DUPONT JEAN
DUPONT J
DU-PONT JEAN
DU POTN JEAN
I am trying to fix a few things such as spaces between names or taking only the first letter of first name but it is not very satisfying.
As it is very common issues, I wonder if there is a piece of code or package to deal with this?