Questions tagged [pdf-to-html]

79 questions
1
vote
0 answers

pdf2htmlEX cannot save font to

I have an error converting some pdf files, it is: Internal Error: File Offset wrong for ttf table (name-data), -1 expected 174 Save Failed Cannot save font to C:\Users\test\AppData\Local\Temp//pdf2htmlEX-a14136/__tmp_font1.ttf I'm using Windows…
WP8_CT
  • 166
  • 1
  • 2
  • 13
1
vote
3 answers

how to get html code from pdf,docx,doc using php

I want to convert any pdf,docx,doc file into html code using php. with same style as in pdf. I am not getting proper solution. Config::set('pdftohtml.bin', 'C:/poppler-0.37/bin/pdftohtml.exe'); …
viralchampanery
  • 397
  • 1
  • 7
  • 16
1
vote
1 answer

Converting multiple files using pdf2htmlEX

How do you use pdf2htmlEX on multiple files or on a folder that contains pdf files? I am able to convert single files just fine, but I obviously don't want to run a command 100 times for 100 files. I couldn't find anything in the documentation and…
Procyon82
  • 45
  • 6
1
vote
2 answers

pdf2htmlEX cannot open or read file

I installed docker and run pdf2htmlEX through it alias pdf2htmlEX="docker run -ti --rm -v ~/pdf:/pdf bwits/pdf2htmlex pdf2htmlEX" pdf2htmlEX -h pdf2htmlEX --zoom 1.3 test.pdf This is my path and the pdf's contained inside: ~/Desktop/pdf$ ls…
Sean
  • 267
  • 4
  • 9
1
vote
2 answers

convert pdf to html using abcpdf

i am looking for a method to convert a pdf document into corresponding html document using abcpdf. kindly let me know if it is feasible. FYI, My pdf document has rich text along with images.
1
vote
1 answer

PDF to HTML and HTML to PDF solution in php

I need to convert a PDF document to HTML and after editing the html I then convert this HTML to PDF . I use 'pdftohtml' ubuntu command (pdftohtml - program to convert pdf files into html, xml and png images) like PHP code below
Nadimul De Cj
  • 484
  • 4
  • 16
1
vote
1 answer

Convert pdf to html/xml using c#

I want to convert pdf to html using c#... Can anyone please give any ideas..
Sur
  • 83
  • 2
  • 6
1
vote
4 answers

PDF to HTML with PHP

I need to convert some pdf files into HTML. I downloaded pdftohtml for PHP but I don't know how to use it. I am trying to run it with this code:
silvia
  • 49
  • 1
  • 3
  • 8
1
vote
6 answers

In ASP.NET what is the best way to convert a PDF file to HTML?

What my users will do is select a PDF document on their machine, upload it to my website, where I will convert into an HTML document for display on the website. The document will be stored in a database after conversion. What's the best way to…
David Basarab
  • 72,212
  • 42
  • 129
  • 156
1
vote
0 answers

Convert PDF to HTML (Code)

Is there any way to convert a pdf file to HTML code? I founded tools for convert PDF in HTML but all convert PDF in image and insert it into html page. I need to convert PDF in html code for generate dinamic pages.
colymore
  • 11,776
  • 13
  • 48
  • 90
1
vote
0 answers

Slidesow from pdf

if someone know that html5slides by google or hakim framework also works with multi pages pdf? I'd like run slideshow but based on many pdf files. Sometimes 1 page, other many pages. Each page should be 1 slide in slideshow. Thanks in advance.
110precent
  • 322
  • 4
  • 19
1
vote
2 answers

HiqPdf Rendering / Layout Engine

My company is using HiqPdf for some time now, and we had a discussion on what rendering engine HiqPdf parses its html. We couldn't find it on the website of HiqPdf itself nor stackoverflow. http://www.hiqpdf.com/ I would suspect Webkit but does…
Shikiju
  • 722
  • 2
  • 9
  • 18
1
vote
0 answers

PDF to HTML and OCR solution for information extraction

I'm looking for a solution for PDF to HTML and OCR service in the cloud or in the SDK format. After my searches, I see that there are bunch of services out there in the internet. I tried some of them and I got some idea. I'd like to know that if any…
1
vote
1 answer

How to import or convert an Adobe EPS /PDF file into Microsoft Expression Design for editing

I use expression design by microsoft. One limitation is the lack of import or convert from .EPF or PDF (an adobe file format) to Microsoft Expression to edit. I wish microsoft would support this, but its just lacking for the last 3 yrs as of version…
aroos
  • 505
  • 2
  • 5
  • 12
1
vote
1 answer

Generate html with images from pdf using Linux -poppler-utils-pdftohtml

Currently I am working with pdftohtml, under CentOS, poppler-utils. The concept is simple - user uploads the PDF file and sees the HTML version of that file. I use the simple command - $> pdftohtml source.pdf target.html but it doesn't work! Later…
Avijit Majumder
  • 63
  • 1
  • 2
  • 7