<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Home</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>Recent changes to Home</description><atom:link href="https://sourceforge.net/p/strigil/home/Home/feed" rel="self"/><language>en</language><lastBuildDate>Thu, 21 Nov 2013 09:35:33 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/strigil/home/Home/feed" rel="self" type="application/rss+xml"/><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v36
+++ v37
@@ -27,7 +27,7 @@

 # Download

-You can download a complete [Strigil package](https://drive.google.com/file/d/0B6zMwJVoIsoQaWR3WV92TW80N28/edit?usp=sharing) or get the last sources from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).
+You can download a complete [Strigil package](https://drive.google.com/file/d/0B6zMwJVoIsoQaWR3WV92TW80N28/edit?usp=sharing) or get the most recent sources from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).

 -----------------

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:35:33 -0000</pubDate><guid>https://sourceforge.net9d74e5ef25db2fab0573053964f7f3376fbefc80</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v35
+++ v36
@@ -27,7 +27,7 @@

 # Download

-You can download complete [Strigil package](https://drive.google.com/file/d/0B6zMwJVoIsoQaWR3WV92TW80N28/edit?usp=sharing) or get the last sources from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).
+You can download a complete [Strigil package](https://drive.google.com/file/d/0B6zMwJVoIsoQaWR3WV92TW80N28/edit?usp=sharing) or get the last sources from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).

 -----------------

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:34:34 -0000</pubDate><guid>https://sourceforge.net530190195a316c8ec84f167623015af3192ec563</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v34
+++ v35
@@ -27,7 +27,7 @@

 # Download

-You can get the last version of Strigil from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).
+You can download complete [Strigil package](https://drive.google.com/file/d/0B6zMwJVoIsoQaWR3WV92TW80N28/edit?usp=sharing) or get the last sources from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).

 -----------------

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:33:29 -0000</pubDate><guid>https://sourceforge.net632bd15a2b9a3b464a3712a84e40d473f1ed7e73</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v33
+++ v34
@@ -28,6 +28,8 @@
 # Download

 You can get the last version of Strigil from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).
+
+-----------------

 # Documentation

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:20:48 -0000</pubDate><guid>https://sourceforge.net1d7fafb15ddf1bc01823e5da6ed1ba9ea55bd6e9</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v32
+++ v33
@@ -24,6 +24,10 @@
 * Martin Major

 -----------------
+
+# Download
+
+You can get the last version of Strigil from [svn](https://sourceforge.net/p/strigil/code/HEAD/tree/src/).

 # Documentation

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:18:56 -0000</pubDate><guid>https://sourceforge.netc85b05593a88c1f1ecc9ff976fdf8ac2e60e28bd</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v31
+++ v32
@@ -31,6 +31,7 @@
 * [User Manual](https://drive.google.com/file/d/0B4On-lGb38CgTGdYV3RmS1hPdTg/edit?usp=sharing)
 * [Developers Guide](https://drive.google.com/file/d/0B4On-lGb38Cga2JxaW5OU3NBbjQ/edit?usp=sharing)
 * [Scraping Script Documentation](https://drive.google.com/file/d/0B4On-lGb38CgWlAyZDhGbDV2TFk/edit?usp=sharing)
+* [Scraping Script schema](https://sourceforge.net/p/strigil/code/HEAD/tree/doc/Scripts/Schema_27_11_2012/Schema_27-11-2012.xsd)

 --------------

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:10:51 -0000</pubDate><guid>https://sourceforge.net79b6c39a0a0bad426a0ea366d61a0ffd0fe80561</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v30
+++ v31
@@ -1,10 +1,11 @@
 # Introduction

-*Strigil* project was started on [Faculty of Mathematic and Physics](http://www.mff.cuni.cz) on [Charles University in Prague](http://www.cuni.cz) with a purpose to create an easily extendable and usable web scraping tool.
+*Strigil* project was started on [Faculty of Mathematic and Physics](http://www.mff.cuni.cz) on [Charles University in Prague](http://www.cuni.cz) with a purpose to create an easily extendable and usable web scraping tool, that enables one to retrieve a data from textual or weak-structured documents, e.g. [**HTML**](http://cs.wikipedia.org/wiki/HyperText_Markup_Language), spreadsheet documents, etc. 

-It is able to scrape [**HTML**](http://cs.wikipedia.org/wiki/HyperText_Markup_Language) and spreadsheet documents. The tool is managed from a simple web UI, which controls the whole system.
+Additionally, we propose a scraping language inspired by the XSL transformations designed to extract data from di
+fferent kinds of documents. This scraping language is designed to work with an ontology to map scraped data directly to classes and attributes.

-The output of the scraper are RDF data which can be then inserted to a database. The whole project is based on [**JAVA**](http://www.java.com/en/), thus it is easily portable to all widely used platforms.
+The output of the scraper are RDF triple which can be then inserted to a triple store. The tool is based on [**JAVA**](http://www.java.com/en/), thus it is easily portable to all widely used platforms.

 # Table of Contents

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Thu, 21 Nov 2013 09:07:06 -0000</pubDate><guid>https://sourceforge.net643eb0f3743ee3a570778509b69653474e879cf8</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v29
+++ v30
@@ -10,6 +10,8 @@

 [TOC]

+----------------
+
 # Team

 Project leader is Mgr. Jakub Stárka, Ph.D. Team members:
@@ -19,6 +21,8 @@
 * Rastislav Kadleček
 * Jonáš Klimeš
 * Martin Major
+
+-----------------

 # Documentation

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Wed, 20 Nov 2013 23:28:08 -0000</pubDate><guid>https://sourceforge.net3b3a200f3fa744afc1d8760035529e104028b9bb</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v28
+++ v29
@@ -26,6 +26,8 @@
 * [User Manual](https://drive.google.com/file/d/0B4On-lGb38CgTGdYV3RmS1hPdTg/edit?usp=sharing)
 * [Developers Guide](https://drive.google.com/file/d/0B4On-lGb38Cga2JxaW5OU3NBbjQ/edit?usp=sharing)
 * [Scraping Script Documentation](https://drive.google.com/file/d/0B4On-lGb38CgWlAyZDhGbDV2TFk/edit?usp=sharing)
+
+--------------

 # Framework Architecture
 ## Main Requirements
@@ -82,6 +84,8 @@

 The deployment model shows again that every component which was mentioned can run on a different machine and can be replicated. This way a high level of scalability can be provided. The throughput for one source can be increased by adding new Proxy servers to the system. The download performance can be increased by adding more Downloaders to the system. And when all of this is not enough even the number of Download Managers can be increased.

+--------------
+
 # Publications

 * Stárka, J., Holubová, I., Nečaský, M.: **Strigil: A Framework for Data Extraction in Semi-Structured Web Documents**, [15th International Conference on Information Integration and Web-based Applications &amp; Services](http://www.iiwas.org/conferences/iiwas2013/), Vienna, Austria, 2013.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Wed, 20 Nov 2013 23:27:34 -0000</pubDate><guid>https://sourceforge.net8ad3270d4a753a3a67efe025244e3192c49aff82</guid></item><item><title>Home modified by Jakub Starka</title><link>https://sourceforge.net/p/strigil/home/Home/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v27
+++ v28
@@ -84,4 +84,4 @@

 # Publications

-Stárka, J., Holubová, I., Nečaský, M.: **Strigil: A Framework for Data Extraction in Semi-Structured Web Documents**, [15th International Conference on Information Integration and Web-based Applications &amp; Services](http://www.iiwas.org/conferences/iiwas2013/), Vienna, Austria, 2013.
+* Stárka, J., Holubová, I., Nečaský, M.: **Strigil: A Framework for Data Extraction in Semi-Structured Web Documents**, [15th International Conference on Information Integration and Web-based Applications &amp; Services](http://www.iiwas.org/conferences/iiwas2013/), Vienna, Austria, 2013.
&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jakub Starka</dc:creator><pubDate>Wed, 20 Nov 2013 23:25:35 -0000</pubDate><guid>https://sourceforge.net2acf1fc7d4b6072909cab5486cba36c9a0726c36</guid></item></channel></rss>