Note as of 2013-09-13: I'm moving this project over to github due to this:
Feel free to rejoin the more updated versions on
This is a wrapper written in Java that allows to recursively iterate a directory structure and call an OCR engine on each found PDF on the condition that it hat not yet been called for that PDF. It works well with the ABBYY OCR Engine for Linux.
Very easy to use pdfocrwrapper