Free Manga Downloader
The Free Manga Downloader (FMD) is an open source application written in Object-Pascal for managing and downloading manga from various websites such as AnimeA, Batoto, MangaFox, MangaStream, ...
CLucene is a C++ port of Lucene: the high-performance, full-featured text search engine written in Java. CLucene is faster than lucene as it is written in C++.
An open source search engine with RESTFul API and crawlers
OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
PHPCrawl is a high configurable webcrawler/webspider-library written in PHP. It supports filters, limiters, cookie-handling, robots.txt-handling, multiprocessing and much more.
Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
Quran Search Engine API
Alfanous (The Lantern - الفانوس ) is an Arabic search engine API provide the simple and advanced search in the Holy Quran , more features and many interfaces...
The stuff here has no documentation and some of it may never be completed. This is my playground, use at your own risk.
Forum Downloader is a program that allows you to download forums and saves them locally for offline viewing and searching. It can also save linked images, images linked using thumbnails, attachments, or other files linked in posts.
The ht://Dig system is a complete indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista.
Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
Fusker is a tool to create entire image galleries from an single specially constructed URL.
A function-testing, performance-measuring, site-mirroring, web spider that is widely portable and capable of using scenarios to process a wide range of web transactions, including ssl and forms.
Search the web for videos, audios, eBooks, torrents and much more
What is WebCrunch? WebCrunch is intended to provide a very powerful web server indexing and search service allowing you to find a file among millions of files located on public servers around the internet. The search engine is powered by a database that holds information about all the files web servers have. The information about the files is gathered by an intelligent web crawler that runs every 2 to 4 days. It keeps the database clean and up-to-date with the previous contents and new entries for each web server address submitted by members.
A torrent search engine plugin for the Azureus/Vuze bittorrent platform.
OpenEphyra is an open framework for question answering (QA). It retrieves answers to natural language questions from the Web and other sources. Visit http://www.ephyra.info/ for more details and information on joining this open research initiative.
A php application that provides a web-based graphical interface similar to apache directory listing. Functions:copy, move, delete, rename files, etc. For more detail, please go to the official site.
MultiSearch is a simple and fast search engine able to concatenate and organize multiple results. Easy to customize for the end users.
So scr_ipfm is a script written in php, used to graphically show amount of data downloaded by users in local network. To do that, it uses logs generated by ipfm program (ipfm is available at the address: http://robert.cheramy.net/ipfm/).
Downloads pictures from Google Image Search, DeviantArt.com, and 4Chan.com directly to your hard drive based on the criteria you select.
Oxyus is an open source search engine written in 100% Java, aimed to provide a search button to your website in an easy way. Oxyus uses Apache Lucene for indexing, Quartz for scheduling and other interesting software products.
Classifier4J is a java library that provides an API for automatic classification of text. The default (and only current) implementation of this API is a Bayesian classifier. This library can be used for multiple purposes - as a spam filter or a blog cl
The Netjuke is a Web-Based Audio Streaming Jukebox powered by PHP 4, a database and all the MP3, Ogg Vorbis and other format files that constitute your digital music collection. Supports images, language packs, multi-level security, random playlists, etc
A little piece of software that downloads pictures from zerochan.net depending on tags and number of pages.
This was a terrible idea and is equally terribly implemented.
NLucene is the .NET implementation of the Lucene high-performance, full-featured text search engine written in Java. See http://jakarta.apache.org/lucene for more info on Lucene.