0

With {pdftools} package, we can read pdf into R environment. But it reads by lines rather by blocks. So, when there are multiple columns, the result becomes a mess.

For example, we like to have it in this way. enter image description here

but it comes in this way enter image description here

Have tried to read it into data.table, and split it into 2 columns. But the attempt failed, since the code can't tell if the single space is for the words or the columns.

Please advise.

Grec001
  • 1,111
  • 6
  • 20
  • I'd found a post about this issue. Please find it here. https://stackoverflow.com/questions/72229791/scraping-two-column-pdf – Grec001 Aug 11 '23 at 03:35
  • the concept was easy that it uses long spaces as the separator. it's rigid however it seemed the only option we have for now. – Grec001 Aug 13 '23 at 05:13

0 Answers0