I have a pdf that's about 50 pages of scanned tables. I need to eventually scrape it into R so I can clean the data and export it as a .csv. I have experience scraping readable pdfs with tabulizer but I've never really worked with scanned pdfs before and tabulizer can't read them.
Looking around online, the farthest I've been able to get is reading the scanned pdf into R as a single character object but this shifts the formatting around a lot so the columns of the table are all misaligned and out of order. Even if it were still nicely formatted I don't know how to then get the character object into a final df.
Is there a way to convert the scanned pdf into a readable pdf so I can scrape it in tabulizer? Or another method for scraping scanned pdfs into tables?