I need to use docx and xml files for translation prozess, not all of the translation tools can read xml, but docx, thats why i, xml because it can be better assigned to each other, i want to convert plain text from docx to xml and backwards(from xml to docx), with what can i begin , do you know if there is a way to do it programmatically using java? is it with dom4j possible?
Asked
Active
Viewed 5,099 times
1
-
DOCX *is* XML - a zip-compressed package of XML files – Panagiotis Kanavos Feb 08 '17 at 12:41
1 Answers
1
You can have a look at docx4j The site says:
docx4j is a Java library for creating and manipulating Microsoft Open XML (Word docx, Powerpoint pptx, and Excel xlsx) files.
Or, Apache POI. The site says:
The Apache POI Project's mission is to create and maintain Java APIs for manipulating various file formats based upon the Office Open XML standards (OOXML) and Microsoft's OLE 2 Compound Document format (OLE2). In short, you can read and write MS Excel files using Java.
One of the two should suffice. Since you are only concerned with docx
, go for the first. If there are multiple formats, go for the second.

An SO User
- 24,612
- 35
- 133
- 221
-
1Anytime. Always make it a habit to search StackOverflow for questions before posting. Most of the questions have already been answered. Saves you time and effort. =) – An SO User Dec 04 '13 at 12:52