TransOnto is a knowledge transformation and migration system for transferring knowledge between semantic representations. TransOnto includes the tron library and the SemPP POWDER processor.
Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
WebWatcher - a Web-page Update Monitor This program will help you keep an eye on interesting Web-pages. You register a list of URLs you want to monitor, and WebWatcher checks for changes whenever you ask it to, or at given intervals. WebWatcher bases
The purpose of this project is to implement a generic Search Engine for object oriented domain models.
This project is based on published work by the author and it's intended to become the authors grade thesis project.
This tool help the user to decrease the time to send your site in the top of browsers like google, yahoo, bing. You provide the url of your site and the beta version of this. The beta version must have on your local Desktop computer, for example.
NewsRack is a tool/service that attempts to automate news monitoring. Based on user-specified definitions and rules, NewsRack will enable automated downloading, classification, filing, and long-term archiving of news.
GHIRL is the Graph-based Heterogeneous Information Representation Language: a java library for representing, querying, and navigating graph- or network-based data structures.
JavaPub is a one-click install BibTex-publications portal based on a simple java codebase. It features a drag-and-drop uploader module to upload BibTex files and a module that generates the html-index and entry-pages for publication listings.
One search tool that gets nzb files and open them directly in to your newsreader by default.
I made it because its annoying having to constantly search and open the files on the web, and there's no free app doing that right now(that i know off).
Egothor is a high-performance, full-featured text search engine written entirely in Java. It is a technology suitable for nearly any application that requires full-text search.
MuSE-CIR is a Multigram-based Search Engine and Collaborative Information Retrieval system. Written in Java /JSP, supports any JDBC connectable database - thoroughly tested only with OracleXE, and somewhat with MySQL, JSP on Apache Tomcat 5.5
Website Searcher is PHP application based on Zend Framework which uses Zend Lucene technology to index and search web site. It don't use any DBMS for search index database only files on disk. You can index and search your site or any site from Web.
A simple to set up web scraper written in Java. It uses modified regEx to quickly write complex patterns to parse data out of a website. It contains a GUI tool for testing your configuration scripts and is fully automated through the command line
Desk.Now is a cross-platform Java client for the WhereIsNow WebService which allows you to know where is the latest version of a document, with just two clicks.
With DoCASU, Alfresco users have a simplified and easy to use solution to access, search and manage documents. DoCASU is a Rich Internet Application (RIA) based on Alfresco Web Scripts and ExtJS. Find all details on: http://code.optaros.com/trac/docasu
Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
http://easyfinderweb.blogspot.com/ Easy finder is a java tool to find all links you are looking for in search engines, at once. Lets say that you want to get all java thread pdfs documents those found by search engines,here is an easy option for you...