OOSearch is a full text search program for OpenOffice.org files. It opens all OpenOffice.org files in a specified directory or volume and searches for a given keyword. The found files can be opened via OOSearch directly.
Semantic annotation of archaeology reports with respect to CIDOC-CRM
The semantic annotation system OPTIMA is the result of Andreas Vlachidis PhD work, (supervised by Prof. Douglas Tudhope, University of Glamorgan, UK). OPTIMA performs the NLP tasks of Named Entity Recognition, Relation Extraction, Negation Detection and Word Sense Disambiguation using hand-crafted rules and SKOS terminological resources (English Heritage Thesauri and Glossaries). The resulted semantic annotations are associated with classes of the (ISO 21127:2006) CIDOC Conceptual Reference Model (CRM) and its archaeological extension, CRM-EH. OPTIMA is also targeted at the detection and recognition of contextual relations between CRM entities. Such relations are modeled with respect to the CRM-EH archaeology extension. The pipeline targets the CIDOC-CRM entities; E19.Physical_Object, E53.Place, E49.Time_Appellation and E57.Material and the CRM-EH entities; EHE1001.Context_Event, EHE1002.Production_Event, EHE1004.Deposition_Event and P45.consists_of material property
PDFcat - Portable Document (PDF) Catalog Manager
Personalized Search Engine for Your Files
MySearchEngine (Personalized Search Engine) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract text content from files of many wildly used file types such as pdf, doc, ppt, and mp3 to improve the index quality.
File Search and Launch Utility Program for Windows
A small and fast file searching utility program. Originally was created for a service department to find and display service manuals. It is written in plain C using Win32 API. One executable file only. Easy install and uninstall, It is searching recursively from the start folder including the sub folders. Matching the search term with the file path name, ignoring spaces (" ") an hyphens ("-") in the file path name and in the search term. It presents a list of full file paths, it has found. Double clicking one the paths, launches the file, with its associated program.
Narrows search result produced by popular Internet search engines, allowing to put extra filtering conditions, as certain words presented, certain words excluded, and so on.
Spotlight Remover is an AppleScript application that runs through the application Terminal (most likely already on the computer) to move your annoying Spotlight feature to a backup folder in ~/Documents.
This is the "Eclipse of Web Browsers", a secure social web browsing and multi-user messaging system. You will need to run your own version of mysql. Development is active and we are seeking project leaders. Please email suprasphere___at___gmail.com.
Indexing your USB's files in full text mode. Store this powerfull indexer directly on your USB key or USB HDD, chose a USB folder and indexing it to perform powerful searchs on your USB key.
simple BNF parser makes xml markup of matches
bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find needle in haystack and what to show, ie: <alph> ::= a | b | c | d ... <word> ::= <alph>+ bnf2xml is a top down recursive parser. Unlike buttom up parsers like gcc(1) or some top downs, bnf2xml is completely unambiguous / resolves ALL conflicts. Slower on ave. for parsing C or than sed(1) for simple searches. Far easier than using flex/C to create a parser. caveate: I do not suggest it's worth while to make a new gcc(1) using bnf2xml. bnf2xml an nth BETA release, but no complains yet.
find files with a path length that is longer than ...
Sometimes (for example when an AD backup fails..) you need to know which paths are longer than 248 characters and full file names are longer than 260 (the numbers are for example, and taken from the Active Directory backup limits). This small application finds these paths, show it, and when double clicking - it takes you to the file in Windows Explorer.
Windows tool for quick launch apps, directories and stored web sites.
Runit is small, portable Windows program, that increase your comfort of running apps, files, directories and web sites. Runit allows you to quick search by term any file, defined web site or directory, and then quickly launch it.
Seek is an application that allows you to quickly search through files on your computer. It is similar to a desktop search application, however it is much faster than existing desktop search applications, and only searches filenames, not within files.
Portable source code librarian
Snippetsource is a simple yet powerful repository to store code snippets or any other text content. SQLite is used as the database backend which makes fast indexed lookups possible.
Tautomaton is a C++11 -template library for deterministic (DFA) and non-deterministic finite automata (NFA). It supports regular expressions and efficient input matching of multiple regexps simultaneously. The library comes with a somewhat grep-like command-line tool for showcasing these features.
Allow to find duplicate files
Two goals : Allow to find duplicated files on computer / find files which haven't been backed up yet
Search for duplicate files fast and with manual filters. Portable application/no need for installation
Its an open source search engine
4buntu is a set of scripts to install a collection of digital forensic tools on top of a Linux system. The tools provide a complete forensic workstation to investigate different systems such as Windows, Linux and Mac OS X.
AASE(Anarchy Advantage Search Engine) is a search engine that reorders search results with base in the habits of its users. By measuring user activity on the search engine result page the search engine constantly improves the search results.
Enable your academic documents on your hard-drive to be searched using an automated solution with limited user-intervention. All this is done in a non-intrusive manner, ensuring your files are not moved.
A searcher and indexer to allow easy and fast locating of relevent information from a large collection of research papers. A Java backend with a web based frontend. Based on the Lucene indexer and searcher
Management of Ad Campaigns. Initially it will be manage Google Adwords campaigns. This will be extended to Yahoo and other Adwords like campaigns.
Agent based Regional Crawler strategy implementation - gathers users' common needs and interests in a certain domain. It crawls based on these interests, instead of crawling the web without any predefined order.