This crawler helps to index binary documents such as PDF, Open Office, MS Office. Local file system (or a mounted drive) crawling and indexing new files, updating existing ones, and removing old ones. Remote file system over SSH/FTP crawling. REST interface to let you “upload” your binary documents to elastic search.
This is the ported version of the very good grails' plugin, ckeditor. This jsp taglib is very easy to use and the distribution package is in web archive (war) format. You can just drop the war file in the servlet container then testdrive from their. For tutorial, please visit the project website.