DOC2TXT or atleast DOC2RTF

Hi,

I have to read the MS-WORD .doc file into a String and perform text operations on it. I might even need to save this string to a file. Basically i want to convert .doc file into .txt file.

To do this I found 2 products, viz:

1. POI by apache

and

2. text-mining0.4 by www.textmining.org

i am currently using the second one. but i m not really satisfied with the same. its kind of outdated and does not support Fast-Saved files(a ms-word internal format).

does anyone know of a better and a more efficient tool written in java?

I had to do the same with RTF files too. I have done it easily by the RTFEditorKit.

so, for me a DOC-2-RTF converter will also be a great help.

any suggestions?

kapilChhabra

[778 byte] By [kapilChhabra] at [2007-9-30 20:15:19]
# 1
Unfortunately u dont have any direct methods in Java to convert .doc to .txt Even I started trying it but i failed when i started trying reading .doc file, bcos it contains info regarding font size, color etc..according to my knowledge POI would bebetter one
sivaji_sun at 2007-7-7 1:00:34 > top of Java-index,Administration Tools,Sun Connection...
# 2
Thanks.I have tried POI. But i believe testmining is better than this. At least it is faster.I donot know if Fast-Saved files are supported by POI.
kapilChhabra at 2007-7-7 1:00:34 > top of Java-index,Administration Tools,Sun Connection...