Questions tagged [pdf-reader]

An application or API that understands and parses PDF syntax allowing the manipulation or inspection of the objects contained in the PDF.

An application or API that understands and parses PDF syntax allowing the manipulation or inspection of the objects contained in the PDF.

If possible, use more specific tags, e.g.:

267 questions
4
votes
0 answers

How to identify vertical text with yob/pdf-reader?

I have some code which uses the yob/pdf-reader gem to fetch the text from a PDF file and search for certain strings in it. Recently I got the issue that vertical text may interfere with my logic to identify text that belongs to the same line. My…
Jonas
  • 515
  • 4
  • 13
4
votes
1 answer

Google document api iframe bandwidth limit

in my site i have an iframe like this that open a pdf < iframe style="width: 100%; height: 895px; border: 0px; position: relative; top: 56px;" src="https://docs.google.com/gview?url=http://www.example.com/3155309.pdf&embedded=true"> I'm…
user2437305
  • 53
  • 1
  • 9
4
votes
3 answers

Extracting entire pdf data with python pdfminer

I am using pdfminer to extract data from pdf files using python. I would like to extract all the data present in pdf irrespective of wheather it is an image or text or whatever it is. Can we do that in a single line(or two if needed, without much…
sunil reddy
  • 135
  • 1
  • 1
  • 8
4
votes
3 answers

how can we extract text from pdf using itextsharp with spaces?

I am using below method to extract pdf text line by line. But problem that, it is not reading spaces between words and figures. what could be the solution for this ?? I just want to create a list of string, each string in list object has a text line…
shailendra
  • 165
  • 2
  • 3
  • 8
4
votes
3 answers

Android pdf reader

I want to use Android pdf library http://andpdf.sourceforge.net/, but i have same error. Log: ST='file 'no file selected' not found' ST='reading page 1, zoom:1.0' My classes: public class Reader extends PdfViewerActivity { public int…
Yuriy Aizenberg
  • 373
  • 9
  • 28
4
votes
2 answers

How to read pdf file from document directory in iPhone?

Currently i am working in iPhone application, i have an pdf file in resource folder (Local pdf file) then i read that pdf file (paper.pdf) successfully, below i have mentioned read local pdf file for your reference. Example: CFURLRef pdfURL =…
SampathKumar
  • 2,525
  • 8
  • 47
  • 82
4
votes
1 answer

Pdf library for android?

I am developing a small application in which i have to show pdf file of newspaper exatly same as it is shown into the below link.This is a link press Reader app this also shows the newspaper.I had search a lot from the internet but cannot found any…
Amandeep singh
  • 1,865
  • 7
  • 21
  • 41
3
votes
1 answer

ModuleNotFoundError: Module not found: Error: Can't resolve 'canvas' in '/home/pegasus/Documents/Final_Website/blog/node_modules/pdfjs-dist/build'

When I installed react-pdf I get the following error: ModuleNotFoundError: Module not found: Error: Can't resolve 'canvas' in '/home/pegasus/Documents/Final_Website/blog/node_modules/pdfjs-dist/build' I'm trying to render a pdf on a gatsby website.…
NobinPegasus
  • 545
  • 2
  • 16
3
votes
2 answers

No implementation found for method getPage on channel flutter_plugin_pdf_viewer

I am getting the list of the pdfs from the firebase storage. I am using advance_pdf_viewer to view the pdf but as soon as the file is passed to the pdf reader, my emulator crashes with following error.E/AndroidRuntime(31096):…
learner
  • 49
  • 1
  • 6
3
votes
2 answers

How to read an existing pdf file using the itextpdf.jar file in java/Android

I want to read a pdf file using Java/Android from my SD card. I imported the itextpdf5.1.1.jar file in to Eclipse. I am able to read a file if I create a new file from an existing one, like this: public void readPdfFile(String pFilename){ …
3
votes
2 answers

What is the best way to extract text contained within a table in a pdf using python?

I'm constructing a program to extract text from a pdf, put it in a structured format, and send it off to a database. I have roughly 1,400 individual pdfs that all follow a similar format, but nuances in the verbiage and plan designs that the…
Parker Scott
  • 33
  • 1
  • 3
3
votes
1 answer

Fastest way to check that a PDF is corrupted (Or just missing EOF) in Ruby?

I am looking for a way to check if a PDF is missing an end of file character. So far I have found I can use the pdf-reader gem and catch the MalformedPDFError exception, or of course I could simply open the whole file and check if the last character…
user3037691
  • 135
  • 1
  • 3
  • 9
3
votes
0 answers

how to enable annotation of adobe pdf reader control

I tried adobe reader control in window .net stand alone application ,but it annotation is disable, if there any way to enable annotation of adobe pdf reader control. private void HighlightTextToolStripMenuItem_Click(object sender, EventArgs e) …
Harsh
  • 41
  • 6
3
votes
1 answer

How to parse pdf in Ruby

I have been trying a simple Ruby program to parse a simple pdf file and extract the texts I am interested in. I found that pdf-reader is quite good gem for pdf file parsing. I have read through the examples given in that gem and some tutorials…
Raghavendra Nilekani
  • 396
  • 2
  • 10
  • 22
3
votes
2 answers

How to Detect table start in itextSharp?

I am trying to convert pdf to csv file. pdf file has data in tabular format with first row as header. I have reached to the level where I can extract text from a cell, compare the baseline of text in table and detect newline but I need to compare…
NetDeveloper
  • 509
  • 1
  • 9
  • 20
1
2
3
17 18