Command-line toolset for extracting text from files
multi-encoding strings(1) replacement with language identification
File type detector library
Locate32 finds files and directories based on file names.
An open source search engine with RESTFul API and crawlers
A Fast Duplicate File Detector with graph based semi-automatic cleaner
Trovi is a text search tool for PDF files