Questions tagged [xpdf]

Xpdf is an open-source PDF viewer for the X Window System and Motif.

Xpdf is an open-source PDF viewer for the X Window System and Motif. Xpdf runs on practically any Unix-like operating system. Xpdf can decode LZW and read encrypted PDFs.

More details at http://en.wikipedia.org/wiki/Xpdf

71 questions
0
votes
0 answers

pdftotext for firebase functions?

I need to extract text from pdf and for that on my local system i am using pdftotext(poppler) (on ubuntu 18.4 LTS) now i want to deploy the function to Firebase Functions but firebase functions does not have it by default. Is there any portable…
0
votes
1 answer

what difference between GhostScript and Xpdf on convert pdf to png

In my project i use GhostScript to convert pdf file to png image. The problem is that it cannot process the font that not emmbeded in pdf file correctly, thus output was wrong. When using pdftoppm (an xpdf tool) the problem was fixed. So i want to…
Trần Hồng
  • 91
  • 2
  • 10
0
votes
1 answer

Convert PDF to HTML in PHP similar to DocuSign

we are developing a website that needs to convert PDF files into HTML because some of the PDF has a form (not necessarily fillable PDF, these PDFs are printed to be filled up). So we want it to be filled up through our website instead of printing…
fsnight
  • 17
  • 1
  • 7
0
votes
1 answer

Trying to scrape a PDF in R, my code will only scrape 6 out of 9 pages and i'm not sure why, am I missing something in my code?

Im trying to scrape a couple PDFs in R, PDF1 has 9 pages and PDF2 has 12 pages. When I run the code below it scrapes both PDFs but only up to page 6 and nothing after that. Is there a reason for this? Something missing in my code? library(tm) read…
Jlingz14
  • 47
  • 6
0
votes
1 answer

make linker for pdftohtml causes compiler errors

I'm getting the error below when I run make to compile xpdf. It seems to do with papar sizes not been detected. I've not seen this kind of error before, usually its the math library that causes the compiler to complain. Is anyone experienced with…
bit
  • 443
  • 1
  • 7
  • 19
0
votes
0 answers

Can xpdf accept variables as path arguments in Powershell?

I'm getting an error: .\pdftotext.exe : pdftotext version 4.00 At C:\Users\eakinsa\Desktop\Style Guide Report\styleGuideReport.ps1:62 char:88 + ... Style Guide Report\Includes\bin32\"; .\pdftotext.exe $selectedFile }) + …
A. Eakins
  • 303
  • 1
  • 2
  • 14
0
votes
1 answer

How to get chars/words/lines/blocks coordinates

I'm doing pdftotext -bbox file.pdf and that produces word-level output. Is there a way to output coordinates on the character/phrase/line/block level? I'm interested in knowing if either the poppler or xpdf version of pdftotext can do this.
0
votes
1 answer

error occurs in distilling with adobe. while not in ghostscript

I have a postscript file when i open it with ghostscript it show output with no error. But when i try to distill it with adobe it stops with following error. %%[ Error: undefined; OffendingCommand: show; ErrorInfo: MetricsCount --nostringval--…
Khizer But
  • 11
  • 1
0
votes
1 answer

how to create type 3 glyphs in postscript

%%LanguageLevel: 3 8 dict begin /FontName /T3_414 def /FontType 3 def /FontBBox [ 0 0 1 1 ] def /FontMatrix [0.011368 0 0 0.011368 0 0 ] def /Encoding 256 array def 0 1 255 {Encoding exch /.notdef put} for Encoding 97 /g7516 put /BuildGlyph…
Kbstar
  • 15
  • 8
0
votes
1 answer

Type 3 fonts conversion

I am parsing Type3 glyphs fonts from Pdf to postscript. The input file have inline image with data streams flate decode filter applied.the filter has predictor 15. Any body can help how I take the image streams form pdf to postscript. This is how…
Kbstar
  • 15
  • 8
0
votes
0 answers

Why does Xpdf not work on Windows

I am writing a program using invoice2data which requires me to use Xpdf. I am coding on Windows platform. Though I have installed I get the Error message as pdftotext not installed. Can someone help me with why it does not work? I also changed the…
Herchelle
  • 1
  • 2
0
votes
1 answer

invalid input '📧' in 'utf8towcs when using tm and pdftools

My work was going along smoothly but i encountered problems due to some of my pdf files containing weird symbols ("📧") I have reviewed the older discussion but none of those solutions worked: R tm package invalid input in 'utf8towcs' This is my…
0
votes
0 answers

how do you add pdf files to a list in R?

I am trying to analyze pdf files in R. Following the instructions on data.library.virginia.edu/reading-pdf-files-into-r-for-text-mining/ . The first few steps worked out fine but i realize that i have to somehow connect the PDF files to the list in…
0
votes
1 answer

xpdf batch file: how to change file name?

I'm using a simple batch file calling the xpdf engine to convert a PDF to a TXT file. Right now, the resulting txt file's name is the same as the PDF's, except the extension has been changed to .txt of course. However, I want to add some text behind…
user7443687
0
votes
1 answer

process the data of an image like pdf or something else using pdfcreator

hay all. maybe you guys can help me in my project. im using pdfcreator as a virtual printer to print to a file some images. can be pdf can be any type of image. but i need to extract data from it. can it be done? im using C#.
Guy
  • 11
  • 1