Data Ninja
A document clustering system with search & report generation features
A university project - A document clustering software for an audit client with additional features. The main task of clustering takes documents in a directory as an input and outputs an Excel spreadsheet displaying clusters of documents, with each cluster containing documents that are similar to each other.
The search features take search terms as input by the user and a directory with documents as an input and outputs an Excel spreadsheet displaying all documents containing the search...