Questions tagged [pdf-conversion]

Relating to converting between Portable Document Format and other file formats. Questions asking us to recommend or find a conversion tool or library are off-topic.

This tag is for questions relating to programmatically converting to and from the open standard file format . If a specific conversion is involved, the appropriate tag should also be used: etc.

Conversion solutions may range from complete rasterization (and graphic embedding) to intense . The middle ground generally converts at a high enough level to recognize and use text attributes where possible, falling back to graphic rendering only when necessary.

Questions asking us to recommend or find a tool, library, documentation or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam.

266 questions
3
votes
1 answer

Annotation in pdfclown

I am trying to put a sticky note at some x,y location. For this i am using the pdfclown annotation class in .net. Below is what is available. using files = org.pdfclown.files; public override bool Run() { files::File file = new…
user1853803
  • 649
  • 3
  • 8
  • 27
3
votes
1 answer

how to convert Office files to pdf using Aspose lib in Java

I am new to Aspose. All i know is, its a useful library for document creation,conversion etc. please guide me how can i convert any file(MS office) to pdf file since in my project the document uploaded has to be first converted to pdf then saved in…
Zay hf
  • 139
  • 2
  • 4
  • 12
2
votes
1 answer

How to extract text from a photocopy saved as a pdf in Python

I found the following code that allows one to extract text from a pdf file. However, this only works for pdf's where you can copy the text directly from highlighting it. I'm curious if there's some way to extract text from a document where you can't…
2
votes
4 answers

Is there way to remove only BAD characters from a string in Python/pandas?

I am trying to read a PDF using Camelot library and store it to a dataframe. The resulting dataframe has garbled/bad characters in string fields. Eg: 123Rise – Tower & Troe's Mech– I want to remove ONLY the Garbled characters and keep everything…
2
votes
0 answers

Command was not found in your classpath error

I use wkhtmltopdf in my Java app and when try to send HTTP request, I get "wkhtmltopdf command was not found in your classpath. Verify its installation or initialize wrapper configurations with correct" error. There are several fix for Linux e.g.…
user16183695
2
votes
0 answers

Pdf result is wrong format

I need to convert a docx in `pdf. This is the starter docx: The document docx has something like a table border and other table ( in the docx at the end , belove the two fields, there are a lot of rows with black border). Another problem is that…
JoeB
  • 201
  • 2
  • 4
  • 9
2
votes
0 answers

Kodak's KCMS replaced with Little-CMS (LCMS) in openJDK - Performance drop in PDF conversion (slow)

The Kodak color management system (KCMS) has been replaced in openJDK with LittleCMS (LCMS) starting from Java 10. In Java8 and 9 it was still available but not the default anymore. It could be activated with: // force…
Lonzak
  • 9,334
  • 5
  • 57
  • 88
2
votes
3 answers

PDF to HTML or similar

I'm building an application to view pdf's through a browser without the need of a plugin on mobile devices. I tried ImageMagick and ghostscript to covert the pages to images but they are far too large and text becomes unclear. I see website offering…
Tw1tCh
  • 89
  • 1
  • 12
2
votes
0 answers

Apache POI XWPF Word document is Losing Page Number when converted into PDF

I'm generating a word document using Apache POI XWPF. this document includes a footer with Page number but when I convert this word document into PDF, the page number is not displayed: XWPFRun run = paragraph.createRun(); XWPFFooter footer =…
tiamat
  • 879
  • 2
  • 12
  • 35
2
votes
1 answer

How to download a page in Angular 8 with TS in pdf format

this is the function i'm calling for downloading the page printComponent(cmpName) { let printContents = document.getElementById(cmpName).innerHTML; let originalContents = document.body.innerHTML; document.body.innerHTML =…
2
votes
2 answers

is there any Library for ePub

I want to write converter that gives me full control over converting PDF to ePUB? What i am looking is that the converter -should not cut the words in the end line. If a text is one paragraph then epub must be one paragraph without any additional…
Afnan Bashir
  • 7,319
  • 20
  • 76
  • 138
2
votes
0 answers

expertPDF function GetPdfDocumentObjectFromHtmlString returns error "Conversion error: Webkit Navigation timeout"

I am using expertpdf library to convert an html page in pdf and the function is giving error "WebKit Navigation timeout". public void ConvertHTMLStringToPDF(string compHTML) { var isIE8 = Request.Browser.Browser == "IE" &&…
Samreen
  • 23
  • 6
2
votes
1 answer

JODConverter: StarOffice.ServiceManager started but its pid could not be found

I've faced with next problem while starting officeManager: A process with acceptString 'socket,host=127.0.0.1,port=2002,tcpNoDelay=1;urp;StarOffice.ServiceManager' started but its pid could not be found @Configuration public class…
2
votes
1 answer

text-align:justify on a generated evoPDF

I am using evoPDF to convert HTML files and it seems that text-align:justify is not behaving properly. The spacing happens in between characters, instead of in between words. What i have used so far without any…
tiondal
  • 21
  • 3
2
votes
1 answer

How to convert Xamarin.Forms XAML UI page to PDF file?

In Xamarin.Forms, I want to convert my xaml page UI (sometimes my page is scrollable when having more content) into the PDF. I have tried the PDFSharp (https://github.com/akgulebubekir/PDFSharp.Xamarin.Forms) open source. But it works only on UWP…
Selvamz
  • 362
  • 3
  • 16
1 2
3
17 18