Questions tagged [pdf-reader]

An application or API that understands and parses PDF syntax allowing the manipulation or inspection of the objects contained in the PDF.

An application or API that understands and parses PDF syntax allowing the manipulation or inspection of the objects contained in the PDF.

If possible, use more specific tags, e.g.:

267 questions
1
vote
1 answer

Parsing/Reading a PDF Document using iText7 C#

I'm trying to upgrade my code by using iText7 libraries. Previously I used iTextSharp libraries But looks like iText7 is totally new I tried Reading a pdf Document but facing an exception in between "Pdf Header Not Found". Here's my code byte[]…
Aakash Singh
  • 97
  • 2
  • 13
1
vote
2 answers

read_pdf FileNotFoundError: [Errno 2] No such file or directory: in Python

I am trying to scrape tables from pdf with read_pdf in python. I am using read_pdf but it doesn't do the job. Also, to mention, I do this in MAC with Jupiter notebook. This is what I do: from tabula import read_pdf file =…
GaB
  • 1,076
  • 2
  • 16
  • 29
1
vote
1 answer

How to read a PDF file using Amazon Polly?

In a AWS blog (second line of first paragraph), it's mentioned that we can convert the text in a pdf document to speech. I tried to find the documentation related to pdf reading, but still not able to get any solution.
RVRJ
  • 135
  • 1
  • 8
1
vote
1 answer

AppContainer Incompatibility in Protected Mode - Adobe Acrobat Reader

After the latest Windows 10 update KB4524098 and KB4525237 Adobe Reader suddenly doesn't start properly anymore, and says: Adobe Acrobad Reader cannot open inside an AppContainer in Protected Mode due to an incompatibility with your system…
gloschtla
  • 41
  • 3
  • 10
1
vote
0 answers

How to send binary data over JSON object which I've got by reading PDF file?

I'm reading a PDF file content using php fread function. Now I want to post this data through JSON object. So, how can I send this data(content of PDF file)? I'm using POSTMAN to send pdf file by selecting binary in header option and getting…
rhl9119
  • 33
  • 5
1
vote
1 answer

Where to save PDF files to open with external App

I need to crate an app that opens selected PDF files via external app. I've tried putting all the files in the assets folder, but I didn't find a way to create a file-provider that will read from the assets folder. I prefer a solution that doesn't…
etamar211
  • 53
  • 7
1
vote
1 answer

Solving "EOF market not found error" PyPDF2

I am using PyPDF2 and tika to extract text from .pdf and .htm files. I am running into the following error: "PyPDF2.utils.PdfReadError: EOF marker not found" I have seen multiple posts on the issue yet none included a solution. Here's the code I'm…
segilmez
  • 35
  • 5
1
vote
0 answers

Extract data(both text and images) from Image file

I have a PDF file(Content as image in PDF), i need to extract text and images from the PDF file. I have tried PDF converter libraries in Laravel, but none is worked. So i have converted that PDF into image with Imagick, after that using TesseractOCR…
user3668438
  • 165
  • 5
  • 18
1
vote
0 answers

[ERR_INVALID_ARG_TYPE]: The "listener" argument must be of type Function at _addListener (events.js:180:11)

Update : The issue was solved after a clean installation of NodeJS. I am trying to create a PDF file downloader in NodeJS. This is my code const file = fs.createWriteStream(filePath); const sendReq =…
Sayooj
  • 375
  • 3
  • 13
1
vote
0 answers

PdfRenderer throws SecurityException everytime if user select first Pdf document which is protected

I have used below code please checked it. PdfRenderer.Page page = null; Bitmap mBitmap; try { descriptor = getSeekableFileDescriptor(documentPath); renderer = new PdfRenderer(descriptor); page =…
1
vote
1 answer

How to open a PDF file in pdf reader with vaadin?

A new Programmer here. I was searching about open a PDF file with vaading. I can open it in the browser or download it without problem, but i need to open in a tab of my application, or at least open it in the predefined pdf reader and the Desktop…
Daitona
  • 11
  • 2
1
vote
1 answer

How to get Tags from pdf document in c#

I am using iTextSharp to read a pdf document and it is getting read successfully. Now I want to get Tags from a pdf document but I don't know how to get tags using iTextSharp. Code is given below class Program { static void Main(string[] args) …
Bilal Malik
  • 79
  • 1
  • 1
  • 10
1
vote
1 answer

Error handling when opening a PDF in Node

I am attempting to open a 3rd party generated PDF that I know will fail occasionally. I am trying both pdf2json and pdfreader, and am encountering the same issue, which I'm not sure if it how I am attempting to handle the libraries in a…
Chris
  • 105
  • 1
  • 10
1
vote
1 answer

word count PDF files when walking directory

Hello Stackoverflow community! I'm trying to build a Python program that will walk a directory (and all sub-directories) and do a accumulated word count total on all .html, .txt, and .pdf files. When reading a .pdf file it requires a little…
1
vote
1 answer

How to read a PDF file from a url of external site in C#

I want to be able to download a pdf that is accessed by a routed url (external site) in my asp.net application. is there a way to do it? The Current Situation: The routed url is in the company's internal site (joomla site)…