Questions tagged [doc]

Questions about the old Microsoft Word file format and how to use it.

The "doc" format is a proprietary file format used by Microsoft as their main file format for text documents from the 1980s till 2007. With the release of Word 2007 "docx" is the main file format.

All Questions about the older (pre .docx) Wordfiles should use this tag. Questions about .docx files can use this tag, but should use "docx" primarily. Questions about .odf .txt .rdf should not use this tag.

Wikipedia about "doc" http://en.wikipedia.org/wiki/DOC_%28computing%29

Decrypted file format: http://sc.openoffice.org/compdocfileformat.pdf

851 questions
-1
votes
1 answer

How do I change font size in a docx file python

I am trying to create a Finance Log, however I canot change the font size from one size to another without the entire font size of all the text changing. I want "Finance Log" and "Created On..." to be in 48 and "Log Begins:" to be in 24. Tstyle =…
Lyra Orwell
  • 1,048
  • 4
  • 17
  • 46
-1
votes
1 answer

Prints out docstrings of all the attributes to an object but need a decorator to be legible

Here is a rough code which prints out the docstring of each attribute to an object one after the other. I need to decorate a function which is inside a class, so that it is legible for me to distinguish between different attribute docstrings. If you…
a2warik
  • 143
  • 1
  • 6
-1
votes
1 answer

Is that possible to extract doc/docx file page wise in php?

I have a web form where user needs to upload a document file (doc or docx), in that file there is some formatted data on multiple each page. so I want to extract that data but I also need to know page number as well. That means I want content of…
-1
votes
1 answer

Unix Shell script to remove duplicate files with the same filename but different file extension

I am brand new to Unix shell scripting. Got basic knowledge of commands and trying to write my first script. The problem: I have an archive folder with large amount of PDF and DOC files. Lot of them have both PDF and DOC version e.g. hello.doc and…
PeterS
  • 3
  • 2
-1
votes
2 answers

Getting text from doc and docx

I'm using a computer with Windows 7 and python 3.3 installed on it. At my organization we have thousands of documents which are not organized. I want to create a program that opens doc/docx files, searches the text for certain keywords and then…
-1
votes
1 answer

How to update MS word at node level(xml format) using Apache POI

is it possible to get/set the native xml nodes directly as we can see of we set the '.docx' extension of ms word format to '.xml' extension using Apache POI? I've a use case where i need to update such attributes of some xml nodes(for both doc and…
gaurav arora
  • 195
  • 3
  • 11
-1
votes
1 answer

Classification using Mallet and MaxEntropy

I want to do preprocessing of docs(wsdl files) using mallet in Eclipse. I want to generate feature vectors and perform classification using mallet and MaxEntropy. I am new in using mallet, Can anyone guide me in this regard. Thanks
sid
  • 9
  • 5
-1
votes
1 answer

How do content publishers maintain a similar UX on different devices?

How do developers able to handle publishing their stuff on devices with different screen sizes (and even different dimension ratios)? For instance I own an old 10" and a newer 6" kindle. For some ebooks, the placement of pictures and the text…
alexT
  • 49
  • 5
-1
votes
2 answers

PHP Export DOC file is not coming correctly in Microsoft Word

I am new to PHP. I have written a script for converting HTML to DOC. It is working fine while opening this file in editor rather than Microsoft Doc. While opening this file in Microsoft word it is not recognizing some special characters including…
-1
votes
1 answer

Alternative to Phplivedocx which can change images dynamically in template

For my web-application i use phplivedocx for docx/pdf generating reports. But now i need to insert images, that user will upload to my web-site, to the docx template. On the phplivedocx forums i found that this software doesn't support this…
dsplatonov
  • 745
  • 1
  • 8
  • 16
-1
votes
1 answer

extracting headings followed by text

I want to write the code for extracting paragraph after each heading. For example, consider the following input document: Black-Box Testing The technique of testing without having any knowledge of the interior workings of the application is called…
-1
votes
1 answer

Programatically add footnote(image) when printing

Completely new to this and don't even know where to start. Specifically, the problem is to programmatically add a picture(JPEG) as a footnote when printing PDF/DOC/DOCX files on Windows/Mac. Most people use MS Word(on both Windows and…
Aiden
  • 93
  • 1
  • 2
  • 8
-1
votes
1 answer

Apache pdfbox .doc to .pdf conversion

I'm trying to convert .doc to .pdf, but I got this exception and I don't know how to fix it. java.io.IOException: Missing root object specification in trailer at …
adi.neag
  • 633
  • 2
  • 12
  • 27
-1
votes
1 answer

Retrieval of Table data in .doc file to .csv programmatically using c#

I am trying to retrieve data from tables in .doc to .csv files programmatically using C#. Due to few conditions, i cannot install Microsoft office on the client machine . Is there any other way to accomplish this as Microsoft PIA wont work without…
DEVEL232
  • 21
  • 2
-1
votes
1 answer

Is it possible to attach two word file with each other?

Like in MS Excel one csv file may contain more than one sheet, Like this what I have to do is I want to keep all the theory in one word file and the programs in another word file. But I want to attach these both file with each other as a single DOC…
Arpit Shah
  • 71
  • 7