Extract PDF Text

Hi, just a quick question to see whether anyone here knows whether or not the iText library is capable of text extraction? I believe pages can be extracted using this library for insertion into new documents but am not so sure about extraction of text only...

Additionally, was wondering whether this function was available in the Multivalent library similarly?

Thanks very much,

R B

P.S. I am aware of the PDFBox and JPedal libraries :-)

[469 byte] By [R.W.Butlera] at [2007-10-2 9:13:43]
# 1
Yes. You should be able to extract actual text sections in i-Text. Take a look at their PDFCopy example. - Saish
Saisha at 2007-7-16 23:20:47 > top of Java-index,Java Essentials,Java Programming...
# 2
Sorry if I sounded surer about that than I am. I *think* it's possible. :^)- Saish
Saisha at 2007-7-16 23:20:47 > top of Java-index,Java Essentials,Java Programming...
# 3

Thanks for the help but it seems PDFCopy is for copying pages of a PDF document directly to another. I am now pretty convinced that iText cannot extract text having just read the following:

" iText can't convert a text in PDF to some other 'readable' document format such as RTF, WORD or even plain text "

at:

http://itextdocs.lowagie.com/tutorial/general/copystamp/

However, I am still interested to know about Multivalent,

Thanks,

Ross

R.W.Butlera at 2007-7-16 23:20:47 > top of Java-index,Java Essentials,Java Programming...