The ht://Dig system is a complete indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista.
Law Leecher is a multi-threaded web crawling tool which extracts laws from the EU law database PreLex (http://ec.europa.eu/prelex/). It's written in Ruby.
Quran Search Engine API
Alfanous (The Lantern - الفانوس ) is an Arabic search engine API provide the simple and advanced search in the Holy Quran , more features and many interfaces...
MultiSearch is a simple and fast search engine able to concatenate and organize multiple results. Easy to customize for the end users.
Auto Index wap is Advance of Download Portal (Multi Language)
Djamolwap 13v -Advance Auto Index With Web Admin Panel + Multi Language + Themes ||||||||||||||||||||||||||||||||||||| New Updates ||||||||||||||||||||||||||||||||||||| - Multi Language Website 1) English 2) Urdu 3) Gujrati 4) Russian - User/Visitor manual change language website - Multi Language Plugin On/Off - Added Function in Admin Panel - Automatic All Mp3 Tag Setting Added _____________________________________________ Official Website : http://ai.djamol.com Demo Of Future & Installation Live http://youtube.com/phpindia ------------------------------------- [[ Requirements ]] Webserver (Cpanel OS OR Other OS) * PHP 5.0 or greater * MySQL 5.0 or greater # The mod_rewrite Apache module (.htaccess) (Note : mod_rewrite Apache if this future not avilalble then use DjAmolWap 10.4version without htaccess)
Versión standalone de TemaTres: servidor de vocabularios controlados
Paquete permite implementar una versión local de TemaTres y experimentar con sus funcionalidades y las capacidades de otras herramientas asociadas. Incluye: Una versión del Learning Resource Exchange (LRE) thesaurus en español, inglés y francés TemaTres Visual Vocabulary configurado para alimnetarse de la versión en español de Learning Resource Exchange (LRE) thesaurus Una instalación de TemaTres en blanco
A DAML+OIL ontology editor with constraint propagation functionality to ensure that constraints applied to properties and restrictions are correctly propagated through an ontology, and datatype management functionality for manipulating custom datatypes.
ApexKB is a collaborative search engine that is powered by social bookmarking. It uses peer and social-networking principles to tag and link distributed content, media, or data to make valuable information more easily discoverable.
WACS is a tool for building Adult Web Sites; equally suitable for managing a private collection or building a commercial site. It has many powerful features including dynamic filtering, model catalogs, automatic download and powerful search engine.
Group-CCS development Components, templates, tools, accessories, tutorial, modules, translations, documentation, codes, scripts, everything that can improve the work of who uses the powerful tool of development, CCS - CodeCharge Studio.
WallPaper (alias crawlpaper) is a desktop changer (NOT a screensaver) which includes a web crawler for picture download, an audio stream ripper, an audio player, a mini mp3 tag editor,etc. Also included support for .zip and .rar files.
Web Search by the people, for the people
YaCy is a free search engine that anyone can use to build search the internet (www and ftp) or to create a search portal for others (internet or intranet). The scale of YaCy is limited only by the number of users and can index billions of web pages. In p2p mode it is fully decentralized, all users of the search engine network are equal and it is not possible for anyone to censor the content of the distributed index.
JobHunter aims to automatically find job information from some big sites such as chinahr,51job,zhaopin and so on. JobHunter searches the email address of each job item and automatically sends a email of application text to it.
A function-testing, performance-measuring, site-mirroring, web spider that is widely portable and capable of using scenarios to process a wide range of web transactions, including ssl and forms.
Methanol is a scriptable multi-purpose web crawling system with an extensible configuration system and speed-optimized architectural design. Methabot is the web crawler of Methanol.
OpenWebSpider is an Open Source multi-threaded Web Spider (robot, crawler) and search engine with a lot of interesting features!
A Java implementation of a flexible and extensible web spider engine. Optional modules allow functionality to be added (searching dead links, testing the performance and scalability of a site, creating a sitemap, etc ..
One search tool that gets nzb files and open them directly in to your newsreader by default. I made it because its annoying having to constantly search and open the files on the web, and there's no free app doing that right now(that i know off).
A redistribute of a stripped down version of the Zend Framework for use with the Search Lucene API contributed Drupal module.
pyTube is a python-based commandline YouTube search. One can search for videos and display them in their default web browser. Requires python 2.5 and gdata.
The censorship tools are a collection of bash scripts for a) comparing DNS server answers to get the blacklist from the censored server, b) downloading censored URLs and c) other stuff like open all censored pages of a blacklist with a browser.
YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system.
Mesin pencari berkas .ppt (Open Source). Created by : X-Cisadane (Dwi). Greetz to : XCode, Muslim Hackers, Cisadane Hacker, Dunia Santai Forum, UG-HotZone 567.
High performance distributed in-memory key/value store
Infinispan is an open source, Java based data grid platform. ***IMPORTANT*** Starting with Infinispan 5.0.0.FINAL, Infinispan releases are no longer hosted in Sourceforge. They can now be located in www.jboss.org/infinispan/downloads
A simple to set up web scraper written in Java. It uses modified regEx to quickly write complex patterns to parse data out of a website. It contains a GUI tool for testing your configuration scripts and is fully automated through the command line