Tarantula is a Java Web crawler. Tarantula is Multithreading, Scalable, High Performance, Extensible and Polite and can be used to crawl and index any Web or Enterprise domain and is configurable through a XML configuration file.
A network asset management written in PHP & MySQL. Maintains a list of servers that can be x-ref by multiple items. Features: locations,manufacturers,vendors(contact names & phone numbers), Device log ,List of network ports,Software manager,File manager
Soogle is a web search service proxying between Google Desktop Search(GDS) and the internet users. While GDS lets you search your own computer, Soogle is able to have your intranet users search the shared emails, files, media and chats remotely.
AIS - Associative Indexing Service, an application for storing bookmarks, memos, indexing of big (lifetime) archives for fast future access to the data by (personalized) keywords. In other words - it is an extension of human associative memory :)
Spencer is a Java-based, web-hosted filesystem indexing application. It indexes files on network shares, reads inside MSOffice, Open/StarOffice, PDF and zip files and provides a web interface to the index with search functions to find the file you want.
Roosster.org is a personal "on-demand" search engine. This means, it indexes only items/entries/files/URLs you explicitly tell it to index and provides a full-text-search over indexed items. Goto http://roosster.org/dev for all details.
Reads RSS feeds' full html page, scrapes and summarizes just the article content, stripped of ads, etc. Converts to speech (ogg/mp3) and creates a podcast of all of the summaries. Works with slashdot, weather, cnn, newsforge, groklaw, pirillo and more!
KSearch website search engine, written in Perl, is fully customizable with unlimited page search. Can use DBM or flat-file database. Search results output produce XHTML 1.0 Strict doc types making HTML and CSS easily match your existing website.
Este es un basico Script que permite mostrar todas las imagenes dentro de un directorio. Crea imagenes en miniatura. Requiere la libreria GD2 para mostrar imagenes en miniatura.
PornSeer, a smart porn detector, precisely locates breasts, vulvas and other pornographic features in images/videos. It generates mosaic patterns on illicit contents of porn images/video, provides indexes of pornographic contents for image/video database
jCV is a powerful multilingual Web application designed for creating, searching and printing resumes. jCV is 100% developed in Java using "best-of-breed" Open Source J2EE frameworks (SOFIA) and reporting tools (JasperReports, iReport).
PHPSpider is a PHP base class for creating custom spiders to mirror sites, check links, index content, scrape content, and more limited only by the user's imagination. It includes mirroring and link checking scripts based off of the Spider base class.
UindexWeb Search engine is an open source web spider, main program is in Delphi7. Lucene.Net is the default full text index engine. The latest version can be retrieved from http://www.opencpu.com/.
Jetbox CMS is seriously tested on usability & has a professional intuitive interface. Its role based, with workflow and module orientated. All content is fully separated form layout. It uses php & mysql.
This plug-in for Google Desktop is a simple web spider (Könguló is Icelandic for spider) that crawls websites you specify, e.g. intranet websites, and dumps them into Google Desktop. You must install Google Desktop prior to installing the plug-in.
Naig (Not Another Image Gallery) is a very easy to use php based image gallery. Just upload the images you want to share. thumbnails and smaller versions are created on access.
OpenDataBag is object database with web interface. Full text search over whole database, live reports, secure and stable.
Open Source Intelligence Automation.
SpiderFoot is an open source intelligence automation tool. Its goal is to automate the process of gathering intelligence about a given target, which may be an IP address, domain name, hostname or network subnet. SpiderFoot can be used offensively, i.e. as part of a black-box penetration test to gather information about the target or defensively to identify what information your organisation is freely providing for attackers to use against you.
Written in python, Reverse Phone Lookup is a simple program that when given a phone number, will search the white pages and display the information returned (First Name, Last Name, Address, City, State, and Zip code). Note: This program no longer works.
PRO-Search is a crawler of FTP servers, SMB shares, HTTP, dc++ networks, ... with powerful web search and navigation interface
SNT is a search engine for SMB and FTP shares with crawler running on Win32. Web interface is provided for searching files and browsing shares contents. Also provided shared films list with users rates and comments.
WebTrack is an PHP based search engine for your MovieTrack (www.movietrack.net) or AMC (www.antp.be) system. It gives you a way to present your movie list on the web. Full skin support and it's very simple to use. You have to try it to understand it...
The LEADERS toolkit is a generic toolset that enables the creation of an online environment which integrates EAD finding aids and EAC authority records with TEI transcripts and digitised images of archival material suitable to a wide variety of archives.
The aim of this project is to develop client software for PubSub.com's XMPP-based JEP-60 publish-subscribe system.
Develop a java API (JAR library, with an example web GUI) for content management. Simple but powerful, based on Apache Lucene project, it would be embeded on projects requiring content management.