PDFBox
As of 2010-07-12, this project may now be found at http://pdfbox.apache.org.
Description
PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
PDFBox Web SiteUser Ratings
User Reviews
-
pdfbox works great
-
I'm interested in Text extraction out of PDF files.