Search engine and data mining applications and ClueWeb datasets.
The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
The stuff here has no documentation and some of it may never be completed. This is my playground, use at your own risk.
Imgur Gallery Downloader
Users can now search Imgur for any phrase and ImgurDL/Loadur will automatically search for matching images. ImgurDL/Loadur will download the images while displaying the progress to the user.
The LEADERS toolkit is a generic toolset that enables the creation of an online environment which integrates EAD finding aids and EAC authority records with TEI transcripts and digitised images of archival material suitable to a wide variety of archives.
The Rainbow project is an open source initiative to build a comprehensive content management system using Microsoft's ASP.NET and C# technologies. It has ASP.NET 1.1 and ASP.NET 2.0 code bases.
CaC is a application to easily download and convert Videos from Videosites like YouTube, Google Video etc. It´s written in Lazarus / FreePascal and availible for Linux, Windows and Mac OS X Systems.
Written in PHP and designed to maintain a personal database of bookmarks, Linkerdoodle is a simple link organizer.
"Open Source Book Collector" The main purpose of this utility is to provide a complete solution for organizing and accessing data of eBooks.
A simple php script that retrieves weather information from wunderground.com quickly and easily. Enter a city and state, then submit and a wunderground forecast banner image will load on the page. Not affiliated with wunderground.com
The goal of bookman is to implement a network based service for managing and distributing bookmarks transparently from a central server to any bookman-enabled client software (curently focussing on Mozilla, IE and Opera).
(Project is participated in the Zend PHP5 Contest. Project information will be released after the event, Oct 11, 2004)
Open-site PHP code.
contentix - open source content management system contentix is a cms and a framework to develop any personalized browser based application. It use xml to store data in media nutral way and xsl to generate output. Check the demowebsite from downloads.
pyChelsea is a python based, personal, visited, web page indexer, seach engine and interface for the browser/platform of your choice. If you remember a page based on a phrase, pyChelsea is for you.
This project provides additional tools, plugins, and documents to make your life better if you are a member of the virtual world There.com.
An application used to search various web-based genealogy sites simultaneously and review and analyse the data gathered.
Cheshire3 is a fast Z39.50, SRW, XML search engine, written in Python for extensability and using C libraries for speed. Next generation of the Cheshire system (http://cheshire.berkeley.edu) and designed around a distributable, object oriented model.
A hypertext-browser written in Java which filters links (emails, docs or pics for e.g.) out of .html-documents and paints them on screen in hierarchical order. Users get a quick overview of how a website is put together.
Coherence is an advanced Content Management System build on top of Zope. Coherence has site-, user- and filemanagement. Some of the special features are a WYSIWYG page-editor with a drag and drop interface, versioncontrol, workflow and linkmanagement.
HooDoo is designed to provide most of the same functionality of Google, but available to all for their websites
This project was started by myself and a few friends a while ago to solve out problems with other more well know CMS's. the problem was that the others didnt have the functions we required so we started our own.
Cicerone is a multi-platform, multi-server, multi-database, web-based corporate information system like no other. Completely web-driven and accessible through any 4.x web browser, Cicerone allows your company to create and maintain information on the fly
Open Source Application for databasing your Music Collection(s). iChoons will utilize other open source products such as MySQL, Apache Webserver and PHP as well as Python / wxPython and SQL Lite. We will also be including tools written in Python for Win3
Analysis and interactive visualization of a web-based community. Supports different focuses on the given social network to present community groups to the user. Also specific information of each member is provided.
RIG is a web-based JPEG image album viewer, especially useful for digital camera albums; provides automatic image resizing, preview & thumbnail caching, user authentication; composed of a PHP web interface and a C++ thumbnail engine.