LucidDB is a DBMS optimized for business intelligence. Besides architectural innovations such as column-store, it supports many advanced features from SQL:2003, including SQL/MED and user-defined transformations written in Java.
WQuery is a domain-specific query language designed to process WordNet-like lexical databases. It may be used as a standalone application or as an API to a lexical database in Java based systems.
MoMS (Model Management System) is a model management system for statistical models, a little bit like a database management system. Instead of having tables, we have models that can be updated and queried.
Web application to make user-friendly requests on large XML database.
Tools to XML-ize large bodies of semi-formal texts (like floras).
Computer-assisted specimen identification.
Uses natural language processing, 2D/3D images analysis and generation.
This is intended as an open-source, multi-platform link diagramming tool for intelligence analysis. The goal is to be able to store objects using any database abstraction or in XML. This project needs much wheel reinvention, so assistance is welcome.
SIRIUS Ramsetcube 2011 born on the old Ramsetcube project of 2008. It is a powerful and easy to use cube designer aimed to help create and build OLAP schemas. The models can be exported to XML and the data format can be customized for the great OLAP
Ontology MApping FRAmework Toolkit allows to create semantic relations between two (source and target) ontologies, and apply such relations in translating source ontology instances into target ontology instances.
MAPONTO is an interactive tool for discovering semantic mappings between database schemas and ontologies as well as between different database schemas. The tool analyzes semantics in the input schemas and ontologies for generating semantic mappings.
Snowflake is a CASE Tool directed towards data warehousing projects. Through an easy to use graphical interface, it allows the construction of a conceptual design of the data warehouse or data mart, using a simple multidimensional notation.
Mandarax is a pure Java implementation of a rule engine. It supports mutiples types of facts and rules based on reflection, databases, EJB etc, supporting XML standards (RuleML 0.8). It provides a J2EE compliant inference engine using backward chaining.
Wikipedia Concept Association Map (WCAM) is new approach for textual knowledge representation and understanding. All concepts and associations are stored in a graph database for better performance and easy distribution.
GridSQL is an open source shared-nothing clustered database system for data warehousing designed for PostgreSQL. GridSQL includes intelligence to parallelize over multiple servers for achieving faster query response times.
The Pentaho Personalizer is based on the "dead" project PentahoLooker. The Pentaho Personalizer is sponsored by Lizacom and have a "commercialy" financed core of developers to ensure that the project do not "die" by lack of time or interest.
A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
POESIA= Public Opensource Environment for a Safer Internet Access
an opensource Internet content filter (multimodal, mulitlingual) aimed for protection of youth (in schools...); partly funded by the European Commission
DB2RDF is a a software tool that will convert data from relational data model to semantic data model (in the form of RDF and RDFS). A SPARQL endpoint for querying the converted data. For querying the semantic data, SPARQL query language is used.
A Flex/java based system for presenting statistics over the net. Based on Mondrian backend. Able to handle large amounts of data, administration facilities not worth shouting about.
COLDIC is an integrated lexicographic platform for creation and management of electronic lexica. Generates itself automatically from a DTD with the database schema and provides human interfaces (query and insert tools) and machine interfaces(webservices)
The JPalo Client, API, and WebClient are tools to visualize and model data of a Palo or XMLA database. Their main purpose is Business Intelligence Reporting and Planning. With the API, programmers can easily model their specific needs.
An adaptive mini database learns algorithmic operations for data intensive algebraic computational methods within a pervasive computing graph. It also provides generic structures and algorithms as a data fabric.