Questions tagged [pdfparser]

a standalone PHP library, provides various tools to extract data from a PDF file

See https://github.com/smalot/pdfparser

39 questions
0
votes
1 answer

PdfParser in Codeigniter

I'm trying to use the PdfParser library ( PDFparser link ) in Codeigniter, so first of all I have downloaded, unzipped and installed the library under third_party/vendor folder, additional I have downloaded, unzipped and installed the tcpdf library…
Federico
  • 319
  • 2
  • 14
0
votes
0 answers

search pdf files content in large folder using php and pdf parser library

I'm using pdf parser to search in folder contains a lot of pdf files to search within pdf files content. the code is working fine for only maximum 3 files with small sizes: $keyword = "Calibri"; //the keyword is dynamic $dir = new…
hany
  • 49
  • 1
  • 10
0
votes
1 answer

How to read from PDF using Selenium webdriver and Java

I am trying to read the contents of a PDF file using Java-Selenium. Below is my code. getWebDriver is a custom method in the framework. It returns the webdriver. URL urlOfPdf = new URL(this.getWebDriver().getCurrentUrl()); BufferedInputStream…
0
votes
0 answers

Illegal string offset '/Root' in FPDF FPDI Parser issue

I have multiple pdf files of one user. So, I have given an option to our user to see all his document in a single PDF file. For this, I am using FPDI pdf parser to merge all pdf files in a single file and then show to user. But I am getting below…
Dhirender
  • 604
  • 8
  • 23
0
votes
1 answer

Continue a script after an exception is thrown PHP

I'm using the PDFParser and encountering a thrown exception that breaks my script even if I put it in a try/catch block like below. Meaning, the exception gets echoed but the "Caught exception: " part does not. I thought this was exactly what…
tim peterson
  • 23,653
  • 59
  • 177
  • 299
-1
votes
1 answer

java.lang.UnsatisfiedLinkError: no lcms in java.library.path: [/usr/lib/jvm/java-11-openjdk/lib/server

I am using PDF parser class from apache tika parser jar which is working fine with openjdk 8 but same code is failing when i have updated the openjdk to 11. i have tried updating the tika parser version to latest but still code is failing with the…
DeadPool
  • 40
  • 8
-1
votes
1 answer

How to decode PDF multi-filter

The PDF internal structure has been analyzed. A stream with a single filter works fine. But in the case of multiple filters as shown below, which one should be turned off first? I don't know how to turn it off. The header is broken due to an…
-1
votes
1 answer

pdfparser says: 'Secured pdf file are currently not supported.'

I am using pdfparser to parse and read text from pdfs on PHP. It works fine for some pdf files. But, for some files it throws an error saying: 'Secured pdf file are currently not supported.'. When I try to open the files that pdfparser says are…
SAN
  • 75
  • 2
  • 11
1 2
3