0

is there a simple way of reading a microsoft word document in java without using Apache POI package?

Or what are other alternative to achieving the same without using Apache POI package?

StackTrace
  • 9,190
  • 36
  • 114
  • 202
  • What do you mean by "reading"? Word documents from recent versions of MS Office are zip-compatible packages containing XML documents corresponding to the text-based content of the document - you can read those using Java's ZIP and XML libraries ... – alexbrn Jun 17 '11 at 09:05
  • http://stackoverflow.com/questions/7731948/java-library-for-reading-word-documents –  Mar 23 '17 at 14:18
  • Here is a good link: [http://stackoverflow.com/questions/6383473/read-microsoft-word-document-in-java](http://stackoverflow.com/questions/6383473/read-microsoft-word-document-in-java) –  Mar 23 '17 at 14:19

4 Answers4

1

Using Java to Crack Office 2007 might help.

KMån
  • 9,896
  • 2
  • 31
  • 41
0

There is a commercial alternative which is pretty easy-to-use but commercial - Aspose

Igor Konoplyanko
  • 9,176
  • 6
  • 57
  • 100
0

java docx ? http://www.javadocx.com/ it's not much for reading, but for generating dynamic output.

Cosmin Cosmin
  • 1,526
  • 1
  • 16
  • 34
0

Sort of. If it is a WordML file, you can unzip it and then manually extract the data from the XML files inside, but trust me, it is not a pleasant process. When we tried to do it at my company we had hoped to be able to leverage some of the OOo libraries to clean up the documents and, well, it didn't work too well.

cwallenpoole
  • 79,954
  • 26
  • 128
  • 166