Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors.
Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
ISO - Customized version of dcm4chee 2.17.3 for MySQL.
1. Add JBoss Application Server 4.2.3.GA for JDK 6.
2. Cleanup for Windows and deprecated files.
3. Off CONSOLE records - http://forums.dcm4che.org/jiveforums/thread.jspa?messageID=4787
Index biological data (genbank sheets, Uniprot...) in a Solr indexer, with index shard support and provides a query interface. Project goal is to create a virtual image with indexer and web interface to query and visualize biological data.
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Spider that recollects data from MySpace Social Network.
At now, it is only designed to extract information from native american people because it is used for a social science study in the UNAM (Universidad Nacional Autónoma de México).
A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
iVia is an Internet subject portal or virtual library system. As a hybrid expert and machine built collection creation and management system, resources can be crawled and metadata and selected full-text can be automatically generated/extracted.
A threaded Web graph (Power law random graph) generator written in Python. It can generate a synthetic Web graph of about one million nodes in a few minutes on a desktop machine. It implements a threaded variant of the RMAT algorithm.
vbullmin is a data miner bot for vBulletin boards. vbullmin can get all Forums, Topics, Post and Users from a vBulletin. It can be export this values with phpbb2 database schema. It's a sample for Machine Learning. It's using patterns for getting data.
Open Source Semantic Web Search Engine Software: If two machines anywhere on the web can agree on the same definition of a digital service or digital good, then machine to machine transactions can use this lingua franca to transact on the users behalf.
OpenSiteSearch is the new Open Source version of OCLC's original java-based web application for building Z39.50 portals (i.e. virtual union catalogues). This project is specifically aimed at the library community.
VDC has been superseded by DVN: https://sourceforge.net/projects/dvn/ ---- The Virtual Data Center project is building an operational, open-source, digital library to enable the sharing of quantitative research data, and the development of distribute
AVD is a continuation of the swim project. The goal is to create a suitable SQL server from swim's not-installed DB, and to maintain the swim client. AVD will be used as a gBootRoot method.
Elvis Digital Library - e-Library with semantics - is a virtual library system based on J2EE platform, XML database and what is most important semantics. It is a complete solution for storing, presenting and SEARCHING. It is based i.a. on the RDF/DublinC
Buzzsearch is a Perl and MySQL based SMB/FTP search engine that originated at Georgia Tech. It should run on any UNIX machine with Samba, however I have only tested it on Linux.
ePub4U - a simple document publishing solution. Includes Table of Contents & Metadata storage based upon document properties. Front end can serve files via UNC, Mapped Drive or Virtual Directories. Back end stores metadata in database. Maintain docume
DocLib is a Web-based Document Management System implemented
in ASP .NET technology to facilitate documents finding in Full Text Search,
Directory Browsering, Search by document summary,by Catagory/Attribute ,
and by Virtual Directory.
The OpenBorges project intends to provide an humble place to experiment, and debate, about what can be an open, distributed, adaptive and collaborative, semantic virtual library. Inspirations are: As we May Think, Library of Babel, and Weaving the web
Mp3base is a web based mp3 database and jukebox. It uses mysql to store songs and mpg123, xmms or shout (icecast) for playing. The player can be run on a remote machine. Multiple users, playlists and voting are supported.
A web content management system with special emphasis on multimedia content. Designed as part of the TITAN grant at Manhattan College. Special thanks to Mike Mucciardi (Project team leader), Matt Joyce (Me), Vlad Panov (Design Layout), and Ananda Das (
The Medlane project is an attempt to create a set of tools that will enable librarians to move from the standard MARC (MAchine Readable Cataloging) format to a new library/museum XML format. This move will ensure traditional library/museum data remains