Sarcomere is distributed information retrieval software based on the Grid Information Retrieval (GIR) proposed standard.
A robust website scraping framework that uses XML, XPath, RegEx and scripting to consume, parse, normalize and traverse HTML based on a set of seed URLs. Scrape.NET is built using C#, TidyForNet (the p-invoke only version) and HTML Tidy.
A collection of Dokuwiki plugins that will enable the user to spatially enable and use the wiki, currently we have: openlayersmap (a map), geotag (ways of geotagging a page)
Thenali is a content management system software project aimed to support the publication and maintenance of educational counselling and career counselling information website.
Simple, but powerful and extensible TV shows' torrents and subtitles auto-downloader (grabber) written in Python.
What would you do if you need to watch a static page for changes, which does not provide RSS Feeds? This application can check the changes in a set of website and send notification email.
Vitalina's newsreader aims to fill in the gap of a lighweight, easy to use mash up service for feeds. Vitalina's newsreader provides an alert funcionality for saved queries and the possibility to comment on feeds to share your point of view with others.
A program to fetch definitions for words from an Internet source, designed for use with monotonous school vocabulary assignments. Has both a command line version and a graphical front-end.
This project aims to develop a web-based search engine of distributed file system, such as NT's network share. You may use it to provide a search interface for your owned ftp server, or the network shares.
A big "How to build your own search engine", accompanied by the code of the search engine itself. All of this in French.
You can analyze a, img, h1, h2 tags in your site.
XGreen Picture Gallery is a .Net Component which allows developers to add picture gallery component from toolbox. Now with Drag and Drop! The Project is still at development. -It will not run without JS file download it too..
XML abstraction interface for Lucene and reference implementation
"girtools" is an implementation of Grid Information Retrieval (GIR). GIR is an emerging open standard for IR on the grid designed to allow dynamic, secure creation and searching of distributed information systems.
Open Source Application for databasing your Music Collection(s). iChoons will utilize other open source products such as MySQL, Apache Webserver and PHP as well as Python / wxPython and SQL Lite. We will also be including tools written in Python for Win3
A comprehensive jQuery plugin that provides various text highlight capabilities.
navTango - Local is a link and document management application.
"navTango - Local" is a web based application that lets you manage documents and links on your PC. navTango come with a search engine to index documents that live in its repository. The search engine with index PDF, HTML, Word, Powerpoint, Text, Excel and many other types of documents. navTango - Local works with IE, Firefox, Opera, Safari, and Chrome. This is an alpha version so you are on the bleeding edge. Use at your own risk.
p-get, a porn web spider for the efficient download of internet video pornography. p-get downloads from TGP style galleries. Although written in Java, it is a command line tool in the unix tradition.
palbum is a perl script which turns a directory (or directory hierarchy) full of images into a nice image gallery. It generates thumbnails and index.html files, and requires no configuration. It uses the ubiquitous netpbm library for image processing.
This UNIX shell utility reads a the HTML of a webpage and creates an output-stream of it.