Questions tagged [pdf-conversion]

Relating to converting between Portable Document Format and other file formats. Questions asking us to recommend or find a conversion tool or library are off-topic.

This tag is for questions relating to programmatically converting to and from the open standard file format pdf. If a specific conversion is involved, the appropriate tag should also be used: openoffice-writer msword tiff jpeg etc.

Conversion solutions may range from complete rasterization (and graphic embedding) to intense ocr. The middle ground generally converts at a high enough level to recognize and use text attributes where possible, falling back to graphic rendering only when necessary.

Questions asking us to recommend or find a tool, library, documentation or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam.

266 questions

votes

1 answer

Annotation in pdfclown

I am trying to put a sticky note at some x,y location. For this i am using the pdfclown annotation class in .net. Below is what is available. using files = org.pdfclown.files; public override bool Run() { files::File file = new…

asp.net .net pdf pdf-conversion pdfclown

asked Dec 20 '12 at 10:43

user1853803

votes

1 answer

how to convert Office files to pdf using Aspose lib in Java

I am new to Aspose. All i know is, its a useful library for document creation,conversion etc. please guide me how can i convert any file(MS office) to pdf file since in my project the document uploaded has to be first converted to pdf then saved in…

java aspose pdf-conversion

asked Nov 28 '12 at 07:47

Zay hf

votes

1 answer

How to extract text from a photocopy saved as a pdf in Python

I found the following code that allows one to extract text from a pdf file. However, this only works for pdf's where you can copy the text directly from highlighting it. I'm curious if there's some way to extract text from a document where you can't…

python nlp pdf-conversion

asked Dec 23 '22 at 04:05

syntax_of_vectors

votes

4 answers

Is there way to remove only BAD characters from a string in Python/pandas?

I am trying to read a PDF using Camelot library and store it to a dataframe. The resulting dataframe has garbled/bad characters in string fields. Eg: 123Rise â€“ Tower & Troe's Mechâ€“ I want to remove ONLY the Garbled characters and keep everything…

python pandas string dataframe pdf-conversion

asked Jul 07 '22 at 07:09

maddy_novice

votes

0 answers

Command was not found in your classpath error

I use wkhtmltopdf in my Java app and when try to send HTTP request, I get "wkhtmltopdf command was not found in your classpath. Verify its installation or initialize wrapper configurations with correct" error. There are several fix for Linux e.g.…

java windows pdf environment-variables pdf-conversion

asked Jun 21 '21 at 12:18

user16183695

votes

0 answers

Pdf result is wrong format

I need to convert a docx in `pdf. This is the starter docx: The document docx has something like a table border and other table ( in the docx at the end , belove the two fields, there are a lot of rows with black border). Another problem is that…

java pdf java-8 apache-poi pdf-conversion

asked Feb 15 '21 at 09:19

JoeB

votes

0 answers

Kodak's KCMS replaced with Little-CMS (LCMS) in openJDK - Performance drop in PDF conversion (slow)

The Kodak color management system (KCMS) has been replaced in openJDK with LittleCMS (LCMS) starting from Java 10. In Java8 and 9 it was still available but not the default anymore. It could be activated with: // force…

java image-processing pdf-generation pdfbox pdf-conversion

asked Jan 28 '21 at 22:44

Lonzak

9,334
5
57
88

votes

3 answers

PDF to HTML or similar

I'm building an application to view pdf's through a browser without the need of a plugin on mobile devices. I tried ImageMagick and ghostscript to covert the pages to images but they are far too large and text becomes unclear. I see website offering…

pdf-conversion pdf-viewer pdfview

asked Jun 06 '11 at 13:14

Tw1tCh

votes

0 answers

Apache POI XWPF Word document is Losing Page Number when converted into PDF

I'm generating a word document using Apache POI XWPF. this document includes a footer with Page number but when I convert this word document into PDF, the page number is not displayed: XWPFRun run = paragraph.createRun(); XWPFFooter footer =…

java apache-poi pdf-conversion

asked Mar 30 '20 at 14:21

tiamat

votes

1 answer

How to download a page in Angular 8 with TS in pdf format

this is the function i'm calling for downloading the page printComponent(cmpName) { let printContents = document.getElementById(cmpName).innerHTML; let originalContents = document.body.innerHTML; document.body.innerHTML =…

angular typescript pdf-conversion

asked Feb 18 '20 at 04:02

Ashish Kumar Dubey

votes

2 answers

is there any Library for ePub

I want to write converter that gives me full control over converting PDF to ePUB? What i am looking is that the converter -should not cut the words in the end line. If a text is one paragraph then epub must be one paragraph without any additional…

c# asp.net wpf epub pdf-conversion

asked May 04 '11 at 12:52

Afnan Bashir

7,319
20
76
138

votes

0 answers

expertPDF function GetPdfDocumentObjectFromHtmlString returns error "Conversion error: Webkit Navigation timeout"

I am using expertpdf library to convert an html page in pdf and the function is giving error "WebKit Navigation timeout". public void ConvertHTMLStringToPDF(string compHTML) { var isIE8 = Request.Browser.Browser == "IE" &&…

c# html-to-pdf pdf-conversion

asked Sep 26 '19 at 11:16

Samreen

votes

1 answer

JODConverter: StarOffice.ServiceManager started but its pid could not be found

I've faced with next problem while starting officeManager: A process with acceptString 'socket,host=127.0.0.1,port=2002,tcpNoDelay=1;urp;StarOffice.ServiceManager' started but its pid could not be found @Configuration public class…

java libreoffice pdf-conversion jodconverter

asked Feb 21 '19 at 00:03

Andriy Chekhovych

votes

1 answer

text-align:justify on a generated evoPDF

I am using evoPDF to convert HTML files and it seems that text-align:justify is not behaving properly. The spacing happens in between characters, instead of in between words. What i have used so far without any…

pdf-conversion evopdf

asked May 08 '18 at 15:23

tiondal

votes

1 answer

How to convert Xamarin.Forms XAML UI page to PDF file?

In Xamarin.Forms, I want to convert my xaml page UI (sometimes my page is scrollable when having more content) into the PDF. I have tried the PDFSharp (https://github.com/akgulebubekir/PDFSharp.Xamarin.Forms) open source. But it works only on UWP…

xaml pdf xamarin xamarin.forms pdf-conversion

asked May 04 '18 at 09:59

Selvamz

Prev 1 2

…

17 18 Next