Document conversion is the act of converting one document's format to another, which allows the document to be read in many more applications. Documents can be converted into other source document formats, consumer formats or structured data.
Questions tagged [document-conversion]
70 questions
0
votes
3 answers
Are their any free PHP librarys to convert PPT or ODT to PDF
I'm working on a document manager on a shared hosting account (linux) which I don't have command-line access too.
When people upload presentations I need to convert them to PDF for people to download. I have read that there is a way to do this by…

Chris Headleand
- 6,003
- 16
- 51
- 69
0
votes
1 answer
Persian PDF file to Word file Converter
Persian PDF to Word Converter.
I have created an application to convert the PDF file into a word file. There is a problem with Persian PDf files when I try to convert them. The application converts the pdf into a word file but the text format is not…

behrooz
- 9
- 5
0
votes
0 answers
Aspose for .Net | Error while converting word to pdf for reference number data type numbers
I am getting this error while converting the word document to PDF. The error is shown instead of the reference number text.
enter image description here
The text has reference type:
enter image description here

Nitinkumar
- 19
- 8
0
votes
1 answer
Tiff Output is not as expected for Black and white 1200dpi LZW test file created using Universal Document Converter 6.7 & 6.8 versions
Respected Sir/Madam,
I have a doubt regarding LZW BW 1200dpi tiff file creation using “UDC driver 6.7/6.8 version”.
If we disable “'Perform High-Quality Smoothing”, then output data are not visible in output files.
If we enable this option, it is…

Shant
- 11
- 5
0
votes
1 answer
How to view google docs or a word files in a flex application?
I am evaluating flex for a mobile and a web application and one of things I want to be able to do is show a google docs or a microsoft docs file (doc,ppt) within a flex application.
Google docs files are published as web pages so I guess you need a…
user308808
0
votes
1 answer
Not able to read file in Pypandoc
I am trying to covert a pdf to html using Pandoc. I have installed pandoc binary , added the environment variable path and then using
import pypandoc
import os
os.environ.setdefault('PYPANDOC_PANDOC', 'C://Program…

SUBHRA SANKHA
- 118
- 1
- 2
- 11
0
votes
0 answers
How to Run Openoffice in a server and perform conversion from my local system
I have a requirement where I need to use openoffice in a standalone server and use a Java program for Document conversion.
Right now, I have a setup where I have started openoffice in my linux environment by using below command
sudo -H …

BlackViper
- 53
- 5
0
votes
1 answer
c# Word-AddIn convert activeDocument to a virtual PDF and merge them into one PDF document
I am creating multiple virtual documents and then I want to merge them into one PDF, without saving them somewhere. All I found for now are guides, in which they save the document as a PDF somewhere and then they read the PDF to merge them into one.…

GuterProgrammierer
- 73
- 11
0
votes
1 answer
Convert pdf to rtf or doc
It is possible to convert a pdf document to rtf or doc using java?

Madalina
- 1,297
- 6
- 15
- 25
0
votes
1 answer
Conversion of PDF to EPub
I am creating an application to convert HTML Pages to an ePub format. I tried converting the file to PDF Since I require Table Of Contents as the first page of the ePub file. I have used Spire PDF and Spire DOC for this purpose. To convert to ePub,…

Shubashree Ravi
- 261
- 1
- 16
0
votes
1 answer
Ruby: parse/extract images and objects from docx file
I am trying to open and read a .docx file using Ruby, and extract portions of the text and objects/images and save into another (non .docx) file.
Using Nokogiri, I am able to properly extract text and do my partitioning of the document into the…

Noel Euzebe
- 11
- 2
0
votes
1 answer
Splitting complex PDF files using Watson Document Conversion Service
We are implementing Question & Answering System using Watson Discovery Service(WDS). We required each answer unit available in single document. We have complex PDF files as corpus. The PDF files contains two column data, tables and images. Instead…
0
votes
1 answer
I need to convert DOC/TXT files to PDF in large batches
We are changing systems and the new system only outputs .DOC or .TXT files for reports. Several of the reports that come out need to be converted to PDF so they are available for our web users on a daily basis. Currently I am testing about 1500 of a…

David M
- 43
- 3
- 10
0
votes
1 answer
getting "415:Media not supported" Error when passing pdf to IBM watson in Salesforce
I am planning to integrate the IBM Watson Document Conversion service
with Salesforce.
From there I am unable to send my pdf file directly to Watson and I'm getting Media Type not supported.
I am also getting this error:
{
"code" : 500 ,
…

Umang
- 1
- 2
0
votes
1 answer
IBM Watson Document Conversion responding with 415 error even though I ingest a PDF?
I have an html form that allows users to upload a file, which then uses IBM Watson's document conversion API to convert the text of the document into normalized text which is then inserted into a database.
Upon testing, I have received the following…

Daniel La
- 1
- 1