The Rainbow project is an open source initiative to build a comprehensive content management system using Microsoft's ASP.NET and C# technologies. It has ASP.NET 1.1 and ASP.NET 2.0 code bases.
An open source search engine with RESTFul API and crawlers
OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
OAI-SOAP will provide a test bed for the application of SOAP/Web service/UDDI forms of OAI protocol.
Dias is a standalone, recursive directory indexing server. It is based on Apache Lucene and supports currently all Postscript-like, HTML-like and Text-like file formats. It is small, multi-threaded and easy to use. Works with FTP,SMB,WebDAV,eMail...
OSSSE - (Open Source Software Search Engine). Webcrawler, parsers for HTML and other Documents. Powerful full-text search, hit highlighting, faceted search, dynamic clustering & much more.
Written in PHP and designed to maintain a personal database of bookmarks, Linkerdoodle is a simple link organizer.
HttpFinder is web content searching tool. It enables look for text content that matches given regular expression in html pages/scripts etc. All navigation is performed with use of other regexp which describes links to visit.
HooDoo is designed to provide most of the same functionality of Google, but available to all for their websites
SlinkE is a highly elastic distributed cloud computing environment. All source code is included in all of the products. Our goal in making it open source is to allow others to contribute to the project.
My Community Portal is a all in one internet portal that offers, forum, groups, chat, your own e-mail, search engine, internet directory, your own home page, poll's, dating services, buddy list, MP3 and file sharing, and many more.
You can 'wear' any clothes on the internet 'virtually' on your body (image). And this application stores the URL where you can buy the clothes, you can share your information like your good looking shot or your ratingson the clothe over internet.
Unlock Google's potential. Use this application to find infomation that is more relevant to your search... This application allows enhanced searching on Google without the need for long modifiers etc..
Quran Search Engine API
Alfanous (The Lantern - الفانوس ) is an Arabic search engine API provide the simple and advanced search in the Holy Quran , more features and many interfaces...
=DOES NOT WORK ANYMORE AS DSA HAS PUT CAPTCHA= DSA Practical Driving Test Monitor helps you find any available practical driving test slot within specified date range. Runs on Linux/Mac/Windows and automates your manual task of finding the test slot.
Coherence is an advanced Content Management System build on top of Zope. Coherence has site-, user- and filemanagement. Some of the special features are a WYSIWYG page-editor with a drag and drop interface, versioncontrol, workflow and linkmanagement.
Simple application for downloading pictures from Zerochan.net
Simple java application for downloading high-quality pictures from Zerochan.net. You can find images by size or a tag. It's simple. And flat. All you need to do: download .jar file and run it with Oracle JVM (or any another JVM supporting image decoding)
Torrent Search is a cross-platform application, allowing to search for torrent files on different websites. Supported websites are integrated through plugins, which allows to easily extend the number of websites supported.
An application used to search various web-based genealogy sites simultaneously and review and analyse the data gathered.
Caused by new releases and/or activities of similiar tools like swish++ and swish-e this project has been closed.
Hier geht es um einen Webbrowser zum Kooperativen Surfen im Web. Dazu kann man sich an einem Server anmelden und Gruppen bilden zum Surfen. Alle Mitglieder einer solchen Gruppe sehen dann, wo die anderen gerade unterwegs sind und können sich gegenseitig ü
BullFrog is a search engine ranking program, written as a Mozilla Firefox extension. Simply enter one or more URLs and their corresponding keywords or key phrases, and BullFrog will see what position the URLs appear in Google.
Web Textual eXtraction Tools C++ Parallel web crawler, noun phrase idenification, Multi-lingual Part of Speech Tagging, Tarjan's Algorithm, Co-RelationShip Mappings...
A Perl administration interface for ht://Dig, an open source content indexing and searching system. Includes web-based GUI.
webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy to master. The standard webStraktor output format is XML based, either in ASCII, UTF-8 or ISO-8859-1 (Latin1) code pages. webStraktor relies on the Apache HttpClient for retrieving content via the HTTP protocol. It adheres to the Robots Exclusion Protocol and it can be configured to operate in an anonymous way by connecting to the predominant types of web proxy servers. webStraktor extends the functionality of web crawlers, spiders or bots by integrating scraping and crawling capabilities.