Questions tagged [pdf-conversion]

Relating to converting between Portable Document Format and other file formats. Questions asking us to recommend or find a conversion tool or library are off-topic.

This tag is for questions relating to programmatically converting to and from the open standard file format . If a specific conversion is involved, the appropriate tag should also be used: etc.

Conversion solutions may range from complete rasterization (and graphic embedding) to intense . The middle ground generally converts at a high enough level to recognize and use text attributes where possible, falling back to graphic rendering only when necessary.

Questions asking us to recommend or find a tool, library, documentation or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam.

266 questions
2
votes
2 answers

PDF generation from HTML having multilingual text using flying-saucer+iText, Only Chinese fonts are working

I am trying to convert a html page into pdf using iText and flying-saucer. coding for the html page is
H S
  • 179
  • 4
  • 12
2
votes
1 answer

How to convert a PDF with a filled form to a JPEG image using ImageMagick and preserving the form data?

I'm trying to convert this PDF to a JPEG image via ImageMagick (v 6.8.7-0): https://dl.dropboxusercontent.com/u/10351891/cd.pdf I didn't find any working solution to preserve the filled data inside the PDF. This is one of the commands tried: convert…
Panda
  • 275
  • 1
  • 4
  • 7
2
votes
1 answer

Docverter loses utf-8 encoding on markdown -> pdf

When I try to convert an utf-8 encoded markdown file to pdf using Docverter (through the API), I just lose non-ASCII characters. Any solution to that? I want to convert .md -> .pdf. Maybe Docverter can help to do .md -> .html, then I could use some…
lajarre
  • 4,910
  • 6
  • 42
  • 69
2
votes
2 answers

Zend PDF write html code to instead text

I have success to test my code with function Zend_Pdf() for easy to understand, this is my code : $pdf = new Zend_Pdf(); $page = new Zend_Pdf_Page(Zend_Pdf_Page::SIZE_A4); $font =…
koe
  • 736
  • 1
  • 12
  • 33
2
votes
1 answer

Special characters are not converted correctly from pdf to text

I am having a set of pdf files that contain central european characters such as č, Ď, Š and so on. I want to convert them to text and I have tried pdftotext and PDFBox through Apache Tika but always some of them are not converted correctly. The…
Yannis P.
  • 2,745
  • 1
  • 24
  • 39
2
votes
2 answers

PDFCreator and Excel Viewer

I'd love to use PDFCreator to create PDF files from excel ones (.xlsx). I cannot use MS Office. I've downloaded PDFCreator and tried to print .docx with command: PDFCreator.exe /PF"myfile.docx" No problem with this one because Wordpad is used to…
luccio
  • 485
  • 1
  • 6
  • 24
2
votes
1 answer

extract barcode in pdf using c#

Is there a way to extract barcode numbers from a pdf document (spanning several pages) in C#. The pdf document is basically obtained from a scanner. If possible, I wouldn't like to use any third party library. Thanks.
blue piranha
  • 3,706
  • 13
  • 57
  • 98
2
votes
1 answer

converting pdf pages to images using js

How to export a PDF document into a series of individual images(jpeg, png or bmp format) using javascript or jquery?
user1909130
2
votes
1 answer

Color of PDF is different when opened in adobe than in preview

I'm currently working on an app that will generate PDF's that were previously being generated by a different (Windows based app). I convert a UIView into a PDF, which works 100%. The problem is that when I email that PDF from the ipad to my PC and…
Christopher Rex
  • 392
  • 3
  • 10
2
votes
1 answer

Converting windows form in C# to PDF using PdfSharp

I am trying to convert the current contents of a C# windows form to a pdf document. I am using the PDFSharp dll to do the conversion, I am not sure of how to capture the windows form and convert it to a PDF. I gather I should use the…
user1155383
  • 139
  • 3
  • 5
  • 9
2
votes
2 answers

ConversionJob in SharePoint2010 to convert docx to pdf takes long time or very slow

I have been using ConversionJob in SharePoint 2010 to convert the docx file to pdf and it takes a long time, around 10 mins. Is that really very slow or is it because of the resources on my server is bit slow? Because it takes 10 minutes in my…
Kiru
  • 3,489
  • 1
  • 25
  • 46
1
vote
2 answers

PDF to DOM conversion using javascript

I've been at it for quite some time and all i could uncover was this pdf2dom parser and probably a reverse engineered version of this. Anyway, here are my questions. For any rendering engine its input should be a stream of data (in my case the pdf…
Ashwin Krishnamurthy
  • 3,750
  • 3
  • 27
  • 49
1
vote
1 answer

When converting a PDF to a lower-resolution PDF file with Ghostscript, the background black colour is applied to the text and icons

When converting a PDF to a lower-resolution PDF file with Ghostscript, the background black colour is applied to the text and icons. Download original PDF file My code: gs -q -dNOPAUSE -dBATCH -dSAFER -sDEVICE=pdfwrite -dPDFSETTINGS=/default…
Rams
  • 53
  • 7
1
vote
0 answers

iText7 (Version 8.0.0) getting "dependency must be added" error when upgraded from .Net 6 to .Net 7

var writer = new PdfWriter(stream); The above line throwing below exception in .Net7, working fine in .Net6 without doing any changes. NotSupportedException: Either itext7.bouncy-castle-adapter or itext7.bouncy-castle-fips-adapter dependency must…
User
  • 804
  • 8
  • 22
1
vote
0 answers

Convert a fillable PDF to CSV file

I have a fillable PDF that contains multiple tables and I want to convert it to a CSV file using Python. When I try to open the PDF with anything other than Adobe Acrobat Reader I get this message. Error message What I have tried so far: import…