I am trying to read a PDF using Camelot library and store it to a dataframe. The resulting dataframe has garbled/bad characters in string fields.
Eg: 123Rise – Tower & Troe's Mech–
I want to remove ONLY the Garbled characters and keep everything else including symbols.
I tried regex such as these [^\w.,&,'-\s] to only keep desirable values. But I'm having to add every special character which need not be removed into this. I cannot ditch Camelot library as well.
Is there a way to solve this ??