I want to extract from pdf but pypdf2
doesn't extract all the information and textract
was unable to install in 3.7 due to following error:
UnicodeDecodeError: 'charmap' codec can't decode byte 0x8d in position 1671: character maps to <undefined>