0

I have a pdf file which is written in Russian . I am trying to convert the table present in PDF to a CSV file. I am able to create the CSV file but it is encrypted I have used this code in python

import tabula df = tabula.read_pdf("IPLmatch.pdf", pages='all',encoding='UTF-8') print(df) df.to_csv('output.csv')

The result I am getting is something like this ???????, ... ???????? ???????? ??????????? ?????????, 0 ??? ???????? ... ??????????? ? ??????????? 1 NaN ... NaN 2 NaN ... ??? ???????? ??????? ?????? 3 NaN ... ???????????? (??.?) ?????????- 4 NaN ... ???

Expected output will be the dataframe to be their in russian language

  • Please clarify your specific problem or provide additional details to highlight exactly what you need. As it's currently written, it's hard to tell exactly what you're asking. – Community Sep 13 '22 at 09:16

0 Answers0