Word2Forrest - Apache POI Word Document Example

Q

What is Word2Forrest example in the Apache POI package? How to run Word2Forrest example?

✍: FYIcenter.com

A

Word2Forrest example in the Apache POI package is a Word to XML conversion program that reads a Word document in *.doc format and generates an XML file with text from the Word document.

Word2Forrest example uses the Horrible Word Processing Format (HWPF) API, which supports Word document in *.doc format only.

Here is the command to run Word2Forrest to convert Word documents in *.doc file format compatible with Microsoft Word 97 version to XML files:

C:\fyicenter>c:\local\jdk-1.8.0\bin\java 
   -cp .;C:\local\poi-3.15\poi-examples-3.15.jar;
         C:\local\poi-3.15\poi-3.15.jar;
         C:\local\poi-3.15\poi-scratchpad-3.15.jar 
   org.apache.poi.hwpf.Word2Forrest simple.doc

Note that the simple.doc was created by open simple.docx in Word and save it in Word 97 (*.doc) format.

If you open the output file, text.xml, in Notepad++ editor, you will see an XML file with text from simple.doc:
Apache POI Word Document Example - Word2Forrest

 

Apache POI Examples for Word Documents

⇒⇒FAQ for Apache POI (Poor Obfuscation Implementation)

2017-02-22, 214👍, 0💬