Fast SMB Search is a search engine for local SMB-based networks (e.g Windows networks). It's key feature is the ability to quickly search for a file in a large network. Also supports FTP search, so project name is not strict
A fusion of several open-source libraries and a web application to parse and filter RSS feeds, as well as generate RSS feeds based on user defined search terms
Short Python script designed to wget lists of websites and concatenate them into "summaries" for offline viewing.
This is an Cms system build in php
Frassle is an interpersonal content management and blog system. It is a complete solution for a self-organizing web community, with blog publishing, RSS aggregators, a search engine, social bookmarking, and advanced organizational features.
Search engine script using the power of sphinx full text search engine the script connects to sphinx api to return search results from mysql.
Web-based fulltext document search consisting of a database-supported indexer supporting multiple filetypes and languages as well as a retrieval front-end with a search engine alike interface including relevancy ranking and ajax-based document preview.
An open-source, Friendster-like social networking portal and news site written in PHP. Post and read news plus browse through contacts like you would in Friendster, Orkut, Tribe.net or Ringo with the knowledge that your personal information is safe.
Google Desktop Search (GDS) plugin for indexing mbox files (currently only Opera mbs files).
Galilei is a Copernic clone for the GNOME desktop, a GUI internet meta-search application.
Ganesha: An easy to use, graphical RDF/XML editor
An online implementation of a scoring system based upon challenges of which anyone can conquer across thousands of different sites via a public Open API and centralized database.
A toolkit for crawling information from web pages by combining different kinds of "actions". Actions are simple operations such as navigation to a specified url or extraction of text from the html. Also available is a graphic user interface.
geoLucene is an extension of Lucene that allows to effectively index and search documents that contain locational information (longitude/latitude). It uses R-tree as a spacial index. See http://www.gossamer-threads.com/lists/lucene/java-dev/53378.
Giraf is a PHP based project for searching and posting to Fluidinfo.
Glue 2 is a Semantic Web Service discovery engine fully compatible with the WSMO meta-model and the WSML language that aims at solving polarization problems by using mediators.
GoldSeeker is a small formatted data extraction application. It can parse informations from a text, html or other file, and export it in a database.
Agriculture tracking system used for tracking the different levels of Grain in elevator bins. Grain traker will also give extensive information on the activity of these bins.
Gumshoe Desktop Search indexes local files of various formats on a Windows desktop and provides a search GUI. The project is developed in Java. It builds on other opensource projects including Lucene, Luke, SWT, tagsoup, Jakarta POI and others.
A software to count vote, generate statistic reports for forum.hkgolden.com.
XPath HTML parser
HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
The Halizo Project is working on a application similar to FreshMeat.net that allows software to be placed into categories and searched and indexed for others to access.
Grupo de Investigacion y Desarrollo para la creacion de una Herramienta de Documentalista. HDD es un programa desarrollado en velazquez visual para la gestion de la informacion bajo el punto de vista de un documentalista.
HooDoo is designed to provide most of the same functionality of Google, but available to all for their websites
This project render a giving web page (Html && Css) and return a GD image. This project could be used for generate thumbnails of websites.