Questions tagged [document-conversion]

Document conversion is the act of converting one document's format to another, which allows the document to be read in many more applications. Documents can be converted into other source document formats, consumer formats or structured data.

70 questions
0
votes
3 answers

Are their any free PHP librarys to convert PPT or ODT to PDF

I'm working on a document manager on a shared hosting account (linux) which I don't have command-line access too. When people upload presentations I need to convert them to PDF for people to download. I have read that there is a way to do this by…
Chris Headleand
  • 6,003
  • 16
  • 51
  • 69
0
votes
1 answer

Persian PDF file to Word file Converter

Persian PDF to Word Converter. I have created an application to convert the PDF file into a word file. There is a problem with Persian PDf files when I try to convert them. The application converts the pdf into a word file but the text format is not…
behrooz
  • 9
  • 5
0
votes
0 answers

Aspose for .Net | Error while converting word to pdf for reference number data type numbers

I am getting this error while converting the word document to PDF. The error is shown instead of the reference number text. enter image description here The text has reference type: enter image description here
0
votes
1 answer

Tiff Output is not as expected for Black and white 1200dpi LZW test file created using Universal Document Converter 6.7 & 6.8 versions

Respected Sir/Madam, I have a doubt regarding LZW BW 1200dpi tiff file creation using “UDC driver 6.7/6.8 version”. If we disable “'Perform High-Quality Smoothing”, then output data are not visible in output files. If we enable this option, it is…
Shant
  • 11
  • 5
0
votes
1 answer

How to view google docs or a word files in a flex application?

I am evaluating flex for a mobile and a web application and one of things I want to be able to do is show a google docs or a microsoft docs file (doc,ppt) within a flex application. Google docs files are published as web pages so I guess you need a…
user308808
0
votes
1 answer

Not able to read file in Pypandoc

I am trying to covert a pdf to html using Pandoc. I have installed pandoc binary , added the environment variable path and then using import pypandoc import os os.environ.setdefault('PYPANDOC_PANDOC', 'C://Program…
SUBHRA SANKHA
  • 118
  • 1
  • 2
  • 11
0
votes
0 answers

How to Run Openoffice in a server and perform conversion from my local system

I have a requirement where I need to use openoffice in a standalone server and use a Java program for Document conversion. Right now, I have a setup where I have started openoffice in my linux environment by using below command sudo -H …
0
votes
1 answer

c# Word-AddIn convert activeDocument to a virtual PDF and merge them into one PDF document

I am creating multiple virtual documents and then I want to merge them into one PDF, without saving them somewhere. All I found for now are guides, in which they save the document as a PDF somewhere and then they read the PDF to merge them into one.…
0
votes
1 answer

Convert pdf to rtf or doc

It is possible to convert a pdf document to rtf or doc using java?
Madalina
  • 1,297
  • 6
  • 15
  • 25
0
votes
1 answer

Conversion of PDF to EPub

I am creating an application to convert HTML Pages to an ePub format. I tried converting the file to PDF Since I require Table Of Contents as the first page of the ePub file. I have used Spire PDF and Spire DOC for this purpose. To convert to ePub,…
Shubashree Ravi
  • 261
  • 1
  • 16
0
votes
1 answer

Ruby: parse/extract images and objects from docx file

I am trying to open and read a .docx file using Ruby, and extract portions of the text and objects/images and save into another (non .docx) file. Using Nokogiri, I am able to properly extract text and do my partitioning of the document into the…
0
votes
1 answer

Splitting complex PDF files using Watson Document Conversion Service

We are implementing Question & Answering System using Watson Discovery Service(WDS). We required each answer unit available in single document. We have complex PDF files as corpus. The PDF files contains two column data, tables and images. Instead…
0
votes
1 answer

I need to convert DOC/TXT files to PDF in large batches

We are changing systems and the new system only outputs .DOC or .TXT files for reports. Several of the reports that come out need to be converted to PDF so they are available for our web users on a daily basis. Currently I am testing about 1500 of a…
0
votes
1 answer

getting "415:Media not supported" Error when passing pdf to IBM watson in Salesforce

I am planning to integrate the IBM Watson Document Conversion service with Salesforce. From there I am unable to send my pdf file directly to Watson and I'm getting Media Type not supported. I am also getting this error: { "code" : 500 , …
Umang
  • 1
  • 2
0
votes
1 answer

IBM Watson Document Conversion responding with 415 error even though I ingest a PDF?

I have an html form that allows users to upload a file, which then uses IBM Watson's document conversion API to convert the text of the document into normalized text which is then inserted into a database. Upon testing, I have received the following…
Daniel La
  • 1
  • 1