find duplicate files on your harddisk very fast by comparing only needed files
Finddupe is a VERY FAST command line C program to catalog very large archives, identifying duplicate files even when offline. It has many features. You can easily grep a catalog to find what you have, and locate where it is.
find extension tool : list all extensions of a directory
gDiscoverer is a search tool, that is providing a inexact search, that means, that you could type in "lunex" and get "Linux" as result.
Provides remote searching to Google Desktop Search
This is a simple Java front-end for the UNIX grep. It lets you search file contents also if you are not familiar with the command line or the regular expression syntax.
iTCP/IP is a Portable TCP/IP Stack for Embedded System write by C language, run platform is real time OS(like-linux, ucos,etc).
iTagged is a Java Swing application that allows the user to create tags intuitively for the files stored on his/ her machine locally. This is a very effective tool to organize and book mark things that we deem necessary.
The final goal of this software is to ease to go back to the workflow from the final output. For example, you can find the original word file by right-clicking the final output PDF file.
A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
This is a Python cli command line utility that allows you to search for Java classes/files and packages in jar/ear/war 's on your system under a specific directory / path All docs are on the wiki: http://javaclassfind.wiki.sourceforge.net/
Kato is an approach to bring the work done on software agents out of academia and into the public arena. Developers can create agents as easily as they can Drupal modules.
The Fast Index Library is an open source C++ template library which is used to build full text indexes based on Boolean, vector, extended Boolean or probabilistic models.
Searches for a file within another file
Searches for a file within another file using the MIME database. Works similarly to the *nix command 'file', but instead of searching only the header, it advances byte by byte looking for the second file type. This program uses the libmagic library.
The script allows you to get the full name of the file per or python module installed in the system and perform the action above it
OSSSE - (Open Source Software Search Engine). Webcrawler, parsers for HTML and other Documents. Powerful full-text search, hit highlighting, faceted search, dynamic clustering & much more.
PileWorks provides the organizational structure for coordinating several different projects which approach some aspect of Pile Technology. PileWorks defines a set of interfaces and implements some basic infrastructure for Pile engines and agents.
A dirt simple python tool using wx.Python and the librets library to look at a RETS server's complete metadata.
Index the web pages you visit with the Recoll text search tool
Please note that this extension is superceded by a new one based on the WebExtensions API for Firefox 57 and later: https://addons.mozilla.org/en-US/firefox/addon/recoll-we/ The source for the new version is here: https://opensourceprojects.eu/p/recollwe/code/ This Firefox extension allows you to include the web pages that you visit in Firefox in the index built by the Recoll text search tool. The extension has been reviewed and should now be directly installed from the Mozilla addons catalog: https://addons.mozilla.org/en-US/firefox/addon/recoll-indexer-1/ The package on the Mozilla site is the up-to-date version of this project, and replaces the sourceforge downloads, which are kept only as history. The code repository remains on sourceforge.
Python script for doing mass extended-regular-expression based replacements on a stdin stream.
rlocate is an implementation of the "locate" command that is always up-to-date
Simple fast and lightweight identical/duplicate files searching tool
Simple fast and lightweight identical/duplicate files searching tool with graphical interface (GTK+ 3).
A simple command line regex search [and replace] written in Python. Searches individual files or whole directories, with the option to search recursively into subdirectories.
Power searching without the pain. Perform powerful desktop searches without having to index your system using regular expressions. Graphical equivalent to grep.
PHP script to log search engine spider visits to your homepage. Find out out when and where search engine bots are crawling your site. Features: email reports and/or log file reports for 32 spiders, monitor php and html files REQUIREMENTS:PHP4 or later