Desktop search application
DocFetcher is an Open Source desktop search application: It allows you to search the contents of files on your computer. — You can think of it as Google for your local files. The application runs on Windows, Linux and Mac OS X.
An open source search engine with RESTFul API and crawlers
OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
Common Resource Grep
CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find binary downloads and discussion (https://sourceforge.net/p/crgrep/discussion/) . The actual development and issue tracking can be found here: https://bitbucket.org/cryanfuse/crgrep
Find, select, delete empty folders. Use the ignore list and others.
Empty Directories Finder expires empty directories and allows you to conveniently select the required directories before deleting.
Datafari is an advanced open source enterprise search solution!
Fork of Search Monkey project
Power searching without the pain. Perform powerful desktop searches without having to index your system using regular expressions. Graphical equivalent to grep. Fork have new features like displaying context of files and "Open With" menu in Linux with KDE.
FastFreeTextSearchFiles (fastfreetextsea) is a external, internal hard disk, USB flash pen, file indexer for Fast Free Text Search.
Jake is a console based app written in python and qt4. Plugins will let you do almost anything, for example, search in google, translate, view images, talk with it (aka AI bot). Also, skining system will let you choose how should jake look.
Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
Semantic annotation of archaeology reports with respect to CIDOC-CRM
The semantic annotation system OPTIMA is the result of Andreas Vlachidis PhD work, (supervised by Prof. Douglas Tudhope, University of Glamorgan, UK). OPTIMA performs the NLP tasks of Named Entity Recognition, Relation Extraction, Negation Detection and Word Sense Disambiguation using hand-crafted rules and SKOS terminological resources (English Heritage Thesauri and Glossaries). The resulted semantic annotations are associated with classes of the (ISO 21127:2006) CIDOC Conceptual Reference Model (CRM) and its archaeological extension, CRM-EH. OPTIMA is also targeted at the detection and recognition of contextual relations between CRM entities. Such relations are modeled with respect to the CRM-EH archaeology extension. The pipeline targets the CIDOC-CRM entities; E19.Physical_Object, E53.Place, E49.Time_Appellation and E57.Material and the CRM-EH entities; EHE1001.Context_Event, EHE1002.Production_Event, EHE1004.Deposition_Event and P45.consists_of material property
Lucene/Solr based search engine and workflow system
Important: This project has been moved to https://github.com/statsbiblioteket/summa/ Lucens (and Solr) based search engine with very flexible setup and workflow system. It supports incremental updates, hierarchical faceting and index lookup with low memory overhead. Note: Although Summa is open source, the focus is on features used at Statsbiblioteket. No explicit resources has been allocated for support of external users.
Successor: <a href="http://syncarus.net">syncarus.net</a>
SynchroMike is a two-way synchronisation program which allows the user to synchronize two directories. It comes with a handy user interface which displays differences between both storage locations as a tree. Update: This page is deprecated, find the new program at <a href="http://syncarus.net">syncarus.net</a>
Find files within other ZIP JAR WAR EAR files. Search recursively for file names or search strings. This Java based utility can save you time when you wonder in which jar or ear file a particular class is defined.
OSSSE - (Open Source Software Search Engine). Webcrawler, parsers for HTML and other Documents. Powerful full-text search, hit highlighting, faceted search, dynamic clustering & much more.
Useful console application to search for largest files on file systems
sfbf (Search For Big Files) is a useful console application to search for largest files on file systems. It brings features such as : - Fast file search to locate largest files - Configurable search depth - Various output formats (txt, json, xml, html, csv) - Regular expression matching This project is licensed under the WTFPL (Do What The Fuck You Want To Public License, Version 2) Copyright © 2017 spypunk email@example.com This work is free. You can redistribute it and/or modify it under the terms of the Do What The Fuck You Want To Public License, Version 2, as published by Sam Hocevar. See the COPYING file for more details.
UI frontend for sfbf, a useful application to search for largest files
UI frontend for sfbf, a useful application to search for largest files on file systems It brings features such as : - Fast file search to locate largest files - Configurable search depth - Various export formats (txt, json, xml, html) - Regular expression matching This project is licensed under the WTFPL (Do What The Fuck You Want To Public License, Version 2) Copyright © 2017 spypunk firstname.lastname@example.org This work is free. You can redistribute it and/or modify it under the terms of the Do What The Fuck You Want To Public License, Version 2, as published by Sam Hocevar. See the COPYING file for more details.