<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Related Work</title><link>https://sourceforge.net/p/odcleanstore/wiki/Related%2520Work/</link><description>Recent changes to Related Work</description><atom:link href="https://sourceforge.net/p/odcleanstore/wiki/Related%20Work/feed" rel="self"/><language>en</language><lastBuildDate>Wed, 14 Nov 2012 14:44:59 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/odcleanstore/wiki/Related%20Work/feed" rel="self" type="application/rss+xml"/><item><title>WikiPage Related Work modified by Tomas Knap</title><link>https://sourceforge.net/p/odcleanstore/wiki/Related%2520Work/</link><description>&lt;pre&gt;--- v4
+++ v5
@@ -48,6 +48,8 @@
 
 Sieve offers functionality similar to our Conflict Resolution component; however the purpose of Sieve in LDIF is different - it aggregates data while being stored to the clean database (unlike Conflict Resolution used at query time). This may be suitable when the desired data are known in advance but is not sufficient for open Web environments, where every consumer has different requirements on the aggregated data. Furthermore, ODCleanStore provides quality for each result statement where Sieve computes quality only for whole named graph.
 
+** [Karma](http://www.isi.edu/integration/karma/) **
+
 **Integration systems in relational databases**
 
 The problem of integration of heterogeneous data (solved in ODCleanStore for RDF data) is solved by several systems for relational databases, e.g. [Aurora](http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.50.7261) or [Fusionplex](http://dl.acm.org/citation.cfm?id=1221048).
&lt;/pre&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tomas Knap</dc:creator><pubDate>Wed, 14 Nov 2012 14:44:59 -0000</pubDate><guid>https://sourceforge.net2bae6975dcbf2199ada2d63fc0db307deaf8ce45</guid></item><item><title>WikiPage Related Work modified by Tomas Knap</title><link>https://sourceforge.net/p/odcleanstore/wiki/Related%2520Work/</link><description>&lt;pre&gt;--- v3 
+++ v4 
@@ -31,6 +31,7 @@
 
 LDIF components encompass the whole process from data import and processing to integration and quality assessment. We use some of LDIF componets internally in ODCleanStore (Silk). The main difference is that LDIF is a framework other applications can built on, while ODCleanStore is a ready-to-use solution that can be easily deployed and managed via a web interface. Differences in quality assessment and data aggregation with Sieve, a part of the LDIF framework, are described below.
 
+Provenance in LDIF - see Figure 2 of [LOD2 Deliverable 4.3.2](http://static.lod2.eu/Deliverables/deliverable-4.3.2.pdf)
 
 
 ----
&lt;/pre&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tomas Knap</dc:creator><pubDate>Sat, 28 Apr 2012 12:38:32 -0000</pubDate><guid>https://sourceforge.net52401fd3be4211c7e630903fa368f2f28aacad49</guid></item><item><title>WikiPage Related Work modified by Jan Michelfeit</title><link>https://sourceforge.net/p/odcleanstore/wiki/Related%2520Work/</link><description>&lt;pre&gt;--- v2 
+++ v3 
@@ -38,7 +38,7 @@
 Data aggregation and quality
 ---------------
 
-&lt;!--ODCleanStore enables aggregation of data from various sources with quality estimation, and provenance tracking with the Conflict Resolution component.--&gt;
+&lt;!--ODCleanStore enables aggregation of data from various sources with quality estimation, and provenance tracking with the Conflict Resolution component. --&gt;
 
 
 **[Sieve](http://sieve.wbsg.de)**
&lt;/pre&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jan Michelfeit</dc:creator><pubDate>Tue, 24 Apr 2012 13:32:01 -0000</pubDate><guid>https://sourceforge.net5d2df0f13624bd8668473490245430c54ccd699e</guid></item><item><title>WikiPage Related Work modified by Jan Michelfeit</title><link>https://sourceforge.net/p/odcleanstore/wiki/Related%2520Work/</link><description>&lt;pre&gt;--- v1 
+++ v2 
@@ -1,15 +1,53 @@
 Related Work
 ============
 
-The purpose of ODCleanStore is to provide means for storing, cleaning, linking, and scoring incoming RDF data and provide aggregated and integrated views on the data to Linked Data consumers. Our focus is on the data processing and queries over cleaned data. Nevertheless, the extraction process that feeds data to ODCleanStore is also important - a related project [Strigil](https://sourceforge.net/p/strigil/home/Home/) implements a web scraper and document extractor that produces RDF data and integrates with ODCleanStore as a store for the produced data.
-
+ODCleanStore provides means for storing, cleaning, linking, and scoring incoming [RDF data](http://en.wikipedia.org/wiki/Resource_Description_Framework) data and provide aggregated and integrated views on the data to Linked Data consumers. In addition, we support trustworthiness of the data with quality assessment and provenance tracking. Our goal is to create a data store that would be easy to deploy and ready for use inside of the enterprise / organization.
+
+Our focus is on the data processing and queries over cleaned data. Nevertheless, the extraction process that feeds data to ODCleanStore is also important - a related project [Strigil](http://sourceforge.net/p/strigil/home/Home/) implements a web scraper and document extractor that produces RDF data and integrates with ODCleanStore as a store for the produced data.
+
+----
+
 Data Extraction
 ---------------
-
+**[Strigil](http://sourceforge.net/p/strigil/home/Home/)**
+
+Strigil implements a web scraper and document extractor that produces RDF data and integrates with ODCleanStore as the producer of data.
+
+**[Linked Data Manager](http://www.semantic-web.at/linked-data-manager)**
+
+Linked Data Manager (LDM) is a Java based Linked (Open) Data Management Suite to schedule and monitor required Extract - Transform - Load jobs for web-based Linked Open Data portals  as well as for sustainable Data Management and Data Integration usage.
+
+LDM data processing pipeline is similar to the data processing pipeline in ODCleanStore. LDM is a counterpart of ODCleanStore in that it provides facilities for managing the extraction process but doesn't provide any permanent storage or direct access to the data. Thus an LDM Loader could be used to send data to ODCleanStore and access it from here. Cooperation with LDM is currently being considered.
+
+----
 
 Data Processing
 ---------------
 
-Data access and aggregation
----------------
+**[Linked Data Integration Framework (LDIF)](http://www4.wiwiss.fu-berlin.de/bizer/ldif/)**
+
+LDIF is an open-source Linked Data Integration Framework that can be used by Linked Data applications to translate Web data and normalize URI while keeping track of data provenance.
+The framework consists of a Scheduler, Data Import and an Integration component with a set of pluggable modules. 
+
+LDIF components encompass the whole process from data import and processing to integration and quality assessment. We use some of LDIF componets internally in ODCleanStore (Silk). The main difference is that LDIF is a framework other applications can built on, while ODCleanStore is a ready-to-use solution that can be easily deployed and managed via a web interface. Differences in quality assessment and data aggregation with Sieve, a part of the LDIF framework, are described below.
+
+
+
+----
+
+Data aggregation and quality
+---------------
+
+&lt;!--ODCleanStore enables aggregation of data from various sources with quality estimation, and provenance tracking with the Conflict Resolution component.--&gt;
+
+
+**[Sieve](http://sieve.wbsg.de)**
+
+Sieve adds quality assessment and data fusion capabilities to the LDIF architecture. It uses metadata about named graphs in order to assess data quality, agnostic to provenance vocabulary and quality models. Sieve uses customizable scoring functions to output data quality descriptors. Based on these quality descriptors (and/or optionally other descriptors ), Sieve can use configurable FusionFunctions to clean the data according to task-specific requirements. 
+
+Sieve offers functionality similar to our Conflict Resolution component; however the purpose of Sieve in LDIF is different - it aggregates data while being stored to the clean database (unlike Conflict Resolution used at query time). This may be suitable when the desired data are known in advance but is not sufficient for open Web environments, where every consumer has different requirements on the aggregated data. Furthermore, ODCleanStore provides quality for each result statement where Sieve computes quality only for whole named graph.
+
+**Integration systems in relational databases**
+
+The problem of integration of heterogeneous data (solved in ODCleanStore for RDF data) is solved by several systems for relational databases, e.g. [Aurora](http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.50.7261) or [Fusionplex](http://dl.acm.org/citation.cfm?id=1221048).
 
&lt;/pre&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jan Michelfeit</dc:creator><pubDate>Tue, 24 Apr 2012 13:31:44 -0000</pubDate><guid>https://sourceforge.netec8335e38864de43727704a17e0c920d1b56b3dc</guid></item><item><title>WikiPage Related Work modified by Jan Michelfeit</title><link>https://sourceforge.net/p/odcleanstore/wiki/Related%2520Work/</link><description>Related Work
============

The purpose of ODCleanStore is to provide means for storing, cleaning, linking, and scoring incoming RDF data and provide aggregated and integrated views on the data to Linked Data consumers. Our focus is on the data processing and queries over cleaned data. Nevertheless, the extraction process that feeds data to ODCleanStore is also important - a related project [Strigil](https://sourceforge.net/p/strigil/home/Home/) implements a web scraper and document extractor that produces RDF data and integrates with ODCleanStore as a store for the produced data.

Data Extraction
---------------


Data Processing
---------------

Data access and aggregation
---------------

</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jan Michelfeit</dc:creator><pubDate>Tue, 24 Apr 2012 12:26:21 -0000</pubDate><guid>https://sourceforge.netb1c85af08bf9ea965557e54d9ae4d6cfc0b8f17d</guid></item></channel></rss>