Home / data
Name Modified Size InfoDownloads / Week
Parent folder
linked_data_catalog.rdf 2013-02-05 53.5 kB
Totals: 1 Item   53.5 kB 0
Introduction
	This s a linked data project. The aim is to create a linked data set catalog in linked data standards it self. 
To know more about linked data ans semantic web use these links

	Linked data:
		http://en.wikipedia.org/wiki/Linked_data
		http://linkeddata.org
		
	Semantic Web:
		http://en.wikipedia.org/wiki/Semantic_Web

	The linked data sets can be found here http://linkeddata.org/data-sets. This is where the starting point for this project. Convertion of all the datasets listed here to the voiD  catalog format.The data sets are constantly being added to the linkeddata so we need to find and update newer data to the graph.
	
Software needed
	This is very simple project and a text editor and an internet connection is all that is needed. 

Adding data sets to the catalog
	VoID vocabulary is used to make the catalog, which is originally designed for single dataset provider,but here used it for describing multiple data sets.It uses generic vocabularies lke FOAF and DC as well.

Tutorial
Adding an entry to the catalog 


Choose a dataset and add it to the catalog. To start with, chose CrunchBase entry listed in linkeddata.org.Go to VoID editor and add the following details in it


Dataset URI:  http://cb.semsol.org/

Dataset Homepage URI: http://cb.semsol.org/

Dataset Name:  Crunchbase

Dataset Description:  RDFized Crunchbase entries

Example Resource:  http://cb.semsol.org/company/yahoo

Dataset Topic : business, database

Vocabulary URI: http://www.w3.org/1999/02/22-rdf-syntax-ns# , http://www.w3.org/2003/01/geo/wgs84_pos#,  http://cb.semsol.org/ns#

Publisher : http://semsol.com

SPARQL endpoint : http://cb.semsol.org/sparql

Now the dataset entry for cruchbase dataset is ready in VoID vocabulary, in the right side textarea. It is in Turtle notation.

			@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
			@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
			@prefix foaf: <http://xmlns.com/foaf/0.1/> .
			@prefix dcterms: <http://purl.org/dc/terms/> .
			@prefix void: <http://rdfs.org/ns/void#> .
			@prefix : <#> .
			## your dataset
			<http://cb.semsol.org/> rdf:type void:Dataset ;
			foaf:homepage <http://cb.semsol.org/> ;
			dcterms:title “Crunchbase” ;
			dcterms:description “RDFized Crunchbase entries” ;
			dcterms:publisher <http://semsol.com> ;
			void:sparqlEndpoint <http://cb.semsol.org/sparql> ;
			void:vocabulary <http://www.w3.org/1999/02/22-rdf-syntax-ns#> ;
			void:vocabulary <http://www.w3.org/2003/01/geo/wgs84_pos#> ;
			void:vocabulary <http://cb.semsol.org/ns#> ;
			void:exampleResource <http://cb.semsol.org/company/yahoo> ;
			dcterms:subject <http://dbpedia.org/resource/Database> ;
			dcterms:subject <http://dbpedia.org/resource/Business> .
			@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix dcterms: <http://purl.org/dc/terms/> .@prefix void: <http://rdfs.org/ns/void#> .@prefix : <#> .
			## your dataset<http://cb.semsol.org/> rdf:type void:Dataset ; foaf:homepage <http://cb.semsol.org/> ; dcterms:title “Crunchbase” ; dcterms:description “RDFized Crunchbase entries” ; dcterms:publisher <http://semsol.com> ; void:sparqlEndpoint <http://cb.semsol.org/sparql> ; void:vocabulary <http://www.w3.org/1999/02/22-rdf-syntax-ns#> ; void:vocabulary <http://www.w3.org/2003/01/geo/wgs84_pos#> ; void:vocabulary <http://cb.semsol.org/ns#> ; void:exampleResource <http://cb.semsol.org/company/yahoo> ; dcterms:subject <http://dbpedia.org/resource/Database> ; dcterms:subject <http://dbpedia.org/resource/Business> .

Convert in to RDF/XML file(serialize) – copy the dataset details in turtle notation and go to RDF/XML converter tool(http://www.rdfabout.com/demo/validator/). And paste the turtle notated content there and select input format as N-Triples/Turtle, and click validate. The result is given below.


<?xml version=”1.0??>
<rdf:RDF xmlns:foaf=”http://xmlns.com/foaf/0.1/” xmlns:void=”http://rdfs.org/ns/void#”
xmlns:rdf=”http://www.w3.org/1999/02/22-rdf-syntax-ns#”
xmlns:dcterms=”http://purl.org/dc/terms/”>
<void:Dataset rdf:about=”http://cb.semsol.org/”>
<foaf:homepage rdf:resource=”http://cb.semsol.org/” />
<dcterms:title>Crunchbase</dcterms:title>
<dcterms:description>RDFized Crunchbase entries</dcterms:description>
<dcterms:publisher rdf:resource=”http://semsol.com” />
<void:sparqlEndpoint rdf:resource=”http://cb.semsol.org/sparql” />
<void:vocabulary rdf:resource=”http://www.w3.org/1999/02/22-rdf-syntax-ns#” />
<void:vocabulary rdf:resource=”http://www.w3.org/2003/01/geo/wgs84_pos#” />
<void:vocabulary rdf:resource=”http://cb.semsol.org/ns#” />
<void:exampleResource rdf:resource=”http://cb.semsol.org/company/yahoo” />
<dcterms:subject rdf:resource=”http://dbpedia.org/resource/Database” />
<dcterms:subject rdf:resource=”http://dbpedia.org/resource/Business” />
</void:Dataset>
</rdf:RDF>

Now repeat the above process and create entries for all the datasets listed in the Linkeddata.org.
Combine the RDF entries in to a single file, no need to copy all the tags but starting from <void:Dataset .. >only
something like

<?xml version=”1.0??>
<rdf:RDF xmlns:foaf=”http://xmlns.com/foaf/0.1/” xmlns:void=”http://rdfs.org/ns/void#”
xmlns:rdf=”http://www.w3.org/1999/02/22-rdf-syntax-ns#”
xmlns:dcterms=”http://purl.org/dc/terms/”>
<void:Dataset rdf:about=”http://cb.semsol.org/”>
…
…
</void:Dataset>
<void:Dataset rdf:about=”http://dbpedia.org/”>
….
….
<void:Dataset>
<void:Dataset rdf:about=”http://www.geonames.org/”>
…
…
<void:Dataset>
</rdf:RDF>



Source: readme.txt, updated 2010-10-02