Questions tagged [doc]

Questions about the old Microsoft Word file format and how to use it.

The "doc" format is a proprietary file format used by Microsoft as their main file format for text documents from the 1980s till 2007. With the release of Word 2007 "docx" is the main file format.

All Questions about the older (pre .docx) Wordfiles should use this tag. Questions about .docx files can use this tag, but should use "docx" primarily. Questions about .odf .txt .rdf should not use this tag.

Wikipedia about "doc" http://en.wikipedia.org/wiki/DOC_%28computing%29

Decrypted file format: http://sc.openoffice.org/compdocfileformat.pdf

851 questions
12
votes
2 answers

Convert DOC to PDF from Command Line

Anyone recommend a DOC to PDF converter that can be run from the command line? It seems like an easy requirement, but I have been coming up short on free solutions.
user489041
  • 27,916
  • 55
  • 135
  • 204
11
votes
7 answers

Convert a .doc or .pdf to an image and display a thumbnail in Ruby?

Convert a .doc or .pdf to an image and display a thumbnail in Ruby? Does anyone know how to generate document thumbnails in Ruby (or C, python...)
Nathan
10
votes
7 answers

Convert DOC file to DOCX with Java

I need to use DOCX files (actually the XML contained in them) in a Java software I'm currently developing, but some people in my company still use the DOC format. Do you know if there is a way to convert a DOC file to the DOCX format using Java ? I…
3rgo
  • 3,115
  • 7
  • 31
  • 44
10
votes
5 answers

How do I extract data from a doc/docx file using Python

I know there are similar questions out there, but I couldn't find something that would answer my prayers. What I need is a way to access certain data from MS-Word files and save it in an XML file. Reading up on python-docx did not help, as it only…
Stefan Urziceanu
  • 237
  • 1
  • 3
  • 10
9
votes
4 answers

Converting doc, docx, pdf to HTML using PHP linux

i run a job search site, and i need to convert doc, docx and pdf files into HTML on linux CentOS server running php. People submit these files as resumes. So far, I found PHPDocx to be great at converting docx to html. But I am stuck at doc/pdf.…
sam
  • 317
  • 1
  • 4
  • 8
9
votes
2 answers

How can I use Ghostscript to convert XPS to PDF or XPS to DOC?

Which one is the best third party libary to convert XPS to PDF and XPS to DOC.
Dhyanesh
  • 247
  • 1
  • 4
  • 12
9
votes
2 answers

How to read doc and docx file in java with POI api

I am trying to read doc and docx files. here is the code: static String distination="E:\\ static String docFileName="Requirements.docx"; public static void main(String[] args) throws FileNotFoundException, IOException { // TODO…
Khaled
  • 255
  • 4
  • 16
8
votes
3 answers

Where's the rails release notes or changelog

Rails 3.0.4 just released. But I don't see any changelog documents. So where is the rails 3.0.4 changelog? I need to know if I should upgrade all my apps to 3.0.4 from 3.0.3.
Lewis Z
  • 498
  • 7
  • 16
8
votes
10 answers

How to avoid java.lang.NoClassDefFoundError

I have a code for adding the texts to existing .doc file and it'll save that as another name by using apache POI. The following is the code I have tried so far import java.io.File; import java.io.FileInputStream; import…
Vignesh Vino
  • 1,242
  • 4
  • 25
  • 50
7
votes
2 answers

Reading .doc file in Python using antiword in Windows (also .docx)

I tried reading a .doc file like - with open('file.doc', errors='ignore') as f: text = f.read() It did read that file but with huge junk, I can't remove that junk as I don't know from where it starts and where it ends. I also tried installing…
Mithilesh Tipkari
  • 697
  • 1
  • 10
  • 16
7
votes
2 answers

How to convert html to doc in php

I need to convert a html file to doc. I am using html2pdf for pdf conversion. Is there is any same kind of library for html2doc? (PS must be free/open source) EDIT After Mark Eirich comment.. Here are two screenshots. Word document is not proper…
Mohit Jain
  • 43,139
  • 57
  • 169
  • 274
7
votes
1 answer

PHPWord bold certain words on a line

I was wondering if there was a way to bold certain words on a line. For example if I wanted every third word on a line bold, how would I do it. I am currently using addText but that requires the whole line to be bold or not bold. Any responses would…
user2579723
  • 213
  • 2
  • 3
  • 5
7
votes
4 answers

Reading doc and docx files using C# without having MS Office installed on server

I'm working on a project (asp.net, c#, vb 2010, .net 4) and I need to read both DOC and DOCX files, that I've previosly uploaded (I've done uploading part). Tricky part is that I don't have MS Office installed on server and that I can't use it. …
user1999722
  • 161
  • 1
  • 2
  • 3
6
votes
2 answers

Convert HTML & CSS to DOC(X)?

Is there some utility that could be called via command line to produce a doc(x) file? The source file would be HTML and CSS. I am trying to generate Word documents on the fly with PHP. I am only aware of phpdocx library, which is very low level and…
Tower
  • 98,741
  • 129
  • 357
  • 507
6
votes
2 answers

Is there any way to inline a const inside a doc comment (rendered by cargo doc)?

With "default" constructors, it can be useful to document what the… defaults are. If this is textually defined in the doc and separately defined as a literal or a static / const, the two can get out of sync: impl Foo { /// Creates a [Foo] with a…
Masklinn
  • 34,759
  • 3
  • 38
  • 57
1 2
3
56 57