This is an ***old archive*** of tools developed for facilitating the use of Creative Commons licenses and metadata. --- For the most up to date representation of any of the projects listed here, please see: http://creativecommons.org/project/Developer.
webExtractor is a Java application that is used for extracting specific content from web based HTML, XML, CSV, and free form text. The extracted data can be used for data gathering and mining purposes.
webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy to master. The standard webStraktor output format is XML based, either in ASCII, UTF-8 or ISO-8859-1 (Latin1) code pages. webStraktor relies on the Apache HttpClient for retrieving content via the HTTP protocol. It adheres to the Robots Exclusion Protocol and it can be configured to operate in an anonymous way by connecting to the predominant types of web proxy servers. webStraktor extends the functionality of web crawlers, spiders or bots by integrating scraping and crawling capabilities.
Linklist with mysql and PHP
HttpFinder is web content searching tool. It enables look for text content that matches given regular expression in html pages/scripts etc. All navigation is performed with use of other regexp which describes links to visit.
Kobold's file searchengine is a cgi script for Homepages, programed in Aptilis, a easy to learn scripting language. The main search script, an example html file is included plus a script for indexing your files. No Database required, only a HTTP Server.
Mp3 JudeBox Server Interface
My Community Portal is a all in one internet portal that offers, forum, groups, chat, your own e-mail, search engine, internet directory, your own home page, poll's, dating services, buddy list, MP3 and file sharing, and many more.
PHP World Portal is being developed as the framework for JLS Web Development's site. After each module is completed it will be released as open source for the public. The core framework will be released by 1/23/04.
SearchIRC Deskbar. Provides multiple methods to access SearchIRC from your desktop and/or browser. Deskbar is NO LONGER SUPPORTED. Mozilla search plug-in should still work, however.
A hypertext-browser written in Java which filters links (emails, docs or pics for e.g.) out of .html-documents and paints them on screen in hierarchical order. Users get a quick overview of how a website is put together.
WinningBid Pro! is an advanced auction monitoring/bidding tool. Build to be able to support a variety of auction sites, WinningBid allows you to add an auction by simply dragging the link from your web browser, and to set up a bid!