<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to scraper_element</title><link>https://sourceforge.net/p/openrokn/home/scraper_element/</link><description>Recent changes to scraper_element</description><atom:link href="https://sourceforge.net/p/openrokn/home/scraper_element/feed" rel="self"/><language>en</language><lastBuildDate>Fri, 09 Nov 2012 19:29:21 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/openrokn/home/scraper_element/feed" rel="self" type="application/rss+xml"/><item><title>WikiPage scraper_element modified by kavulix</title><link>https://sourceforge.net/p/openrokn/home/scraper_element/</link><description>&lt;pre&gt;--- v2
+++ v3
@@ -1,10 +1,10 @@
-&lt;b&gt;Element name:&lt;/b&gt; scraper
+&lt;u&gt;Element name:&lt;/u&gt; &lt;b&gt;scraper&lt;/b&gt;
 
 The scraper element is supported by all items that display a springboard screen (audio, video, slideshow).  You can use the scraper element to fill in missing content on the springboard screen before it's displayed.  This is accomplished by specifying the url of a html page and then using regex with parentheses to match a specific portion of the html page.  That value is then extracted and added to either an attribute on the parent element or it is used to create a new sibling element.  In the example below the url is extracted from the src attribute of a video element on a html page.  It is then added to the parent element (i.e., the video item) as the url attribute.  In order to use scrapers you must set the enablescrapers attribute on the parent item.
 
 ***
 
-&lt;b&gt;Example:&lt;/b&gt;
+&lt;u&gt;Example&lt;/u&gt;
 
     &lt;item type="video" enablescrapers="true"&gt;
        &lt;scraper type="parent" ename="item" aname="url" url="http://yoursite.com/page.htm" regex="&amp;lt;video\s.*?src=\x22(.+?)\x22" limit="1"/&gt;
@@ -12,44 +12,44 @@
 
 ***
 
-&lt;strong&gt;Supported Attributes&lt;/strong&gt;
+&lt;b&gt;Supported Attributes&lt;/b&gt;
 
-&lt;b&gt;Name:&lt;/b&gt; type
-&lt;b&gt;Type:&lt;/b&gt; string
-&lt;b&gt;Supported values:&lt;/b&gt;
+&lt;u&gt;Name:&lt;/u&gt; &lt;b&gt;type&lt;/b&gt;
+&lt;u&gt;Type:&lt;/u&gt; &lt;i&gt;string&lt;/i&gt;
+&lt;u&gt;Supported values&lt;/u&gt;
 parent|sibling
 
-&lt;b&gt;Name:&lt;/b&gt; ename
-&lt;b&gt;Type:&lt;/b&gt; string
-&lt;b&gt;Supported values:&lt;/b&gt;
+&lt;u&gt;Name:&lt;/u&gt; &lt;b&gt;ename&lt;/b&gt;
+&lt;u&gt;Type:&lt;/u&gt; &lt;i&gt;string&lt;/i&gt;
+&lt;u&gt;Supported values&lt;/u&gt;
 the element name
-&lt;b&gt;Notes:&lt;/b&gt;
+&lt;u&gt;Notes&lt;/u&gt;
 If the type attribute is set to parent then the ename attribute will always be item.
 
-&lt;b&gt;Name:&lt;/b&gt; aname
-&lt;b&gt;Type:&lt;/b&gt; string
-&lt;b&gt;Supported values:&lt;/b&gt;
+&lt;u&gt;Name:&lt;/u&gt; &lt;b&gt;aname&lt;/b&gt;
+&lt;u&gt;Type:&lt;/u&gt; &lt;i&gt;string&lt;/i&gt;
+&lt;u&gt;Supported values&lt;/u&gt;
 the attribute name
 
-&lt;b&gt;Name:&lt;/b&gt; url
-&lt;b&gt;Type:&lt;/b&gt; string
-&lt;b&gt;Supported values:&lt;/b&gt;
+&lt;u&gt;Name:&lt;/u&gt; &lt;b&gt;url&lt;/b&gt;
+&lt;u&gt;Type:&lt;/u&gt; &lt;i&gt;string&lt;/i&gt;
+&lt;u&gt;Supported values&lt;/u&gt;
 any url pointing to a html or text document
 
-&lt;b&gt;Name:&lt;/b&gt; regex
-&lt;b&gt;Type:&lt;/b&gt; string
-&lt;b&gt;Supported values:&lt;/b&gt;
+&lt;u&gt;Name:&lt;/u&gt; &lt;b&gt;regex&lt;/b&gt;
+&lt;u&gt;Type:&lt;/u&gt; &lt;i&gt;string&lt;/i&gt;
+&lt;u&gt;Supported values&lt;/u&gt;
 a regular expression that includes parentheses
 
-&lt;b&gt;Name:&lt;/b&gt; limit
-&lt;b&gt;Type:&lt;/b&gt; integer
-&lt;b&gt;Supported values:&lt;/b&gt;
+&lt;u&gt;Name:&lt;/u&gt; &lt;b&gt;limit&lt;/b&gt;
+&lt;u&gt;Type:&lt;/u&gt; &lt;i&gt;integer&lt;/i&gt;
+&lt;u&gt;Supported values&lt;/u&gt;
 any integer
-&lt;b&gt;Notes:&lt;/b&gt;
+&lt;u&gt;Notes&lt;/u&gt;
 This attribute value will limit the number of matches returned by the regex. If you are scraping image urls to create a slideshow and you only want to display the first 5 image urls found then you would set this attribute to 5.
 
 ***
 
-&lt;strong&gt;Supported Child Nodes&lt;/strong&gt;
+&lt;b&gt;Supported Child Nodes&lt;/b&gt;
 
 NONE
&lt;/pre&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">kavulix</dc:creator><pubDate>Fri, 09 Nov 2012 19:29:21 -0000</pubDate><guid>https://sourceforge.netd9cc1286bdfc564c055c0076348046308e7742a1</guid></item><item><title>WikiPage scraper_element modified by kavulix</title><link>https://sourceforge.net/p/openrokn/home/scraper_element/</link><description>&lt;pre&gt;--- v1
+++ v2
@@ -7,7 +7,7 @@
 &lt;b&gt;Example:&lt;/b&gt;
 
     &lt;item type="video" enablescrapers="true"&gt;
-       &lt;scraper type="parent" ename="item" aname="url" url="http://yoursite.com/page.htm" regex="&amp;lt;video\s.*?src=\x22(.+?)\x22"/&gt;
+       &lt;scraper type="parent" ename="item" aname="url" url="http://yoursite.com/page.htm" regex="&amp;lt;video\s.*?src=\x22(.+?)\x22" limit="1"/&gt;
     &lt;/item&gt;
 
 ***
@@ -41,6 +41,13 @@
 &lt;b&gt;Supported values:&lt;/b&gt;
 a regular expression that includes parentheses
 
+&lt;b&gt;Name:&lt;/b&gt; limit
+&lt;b&gt;Type:&lt;/b&gt; integer
+&lt;b&gt;Supported values:&lt;/b&gt;
+any integer
+&lt;b&gt;Notes:&lt;/b&gt;
+This attribute value will limit the number of matches returned by the regex. If you are scraping image urls to create a slideshow and you only want to display the first 5 image urls found then you would set this attribute to 5.
+
 ***
 
 &lt;strong&gt;Supported Child Nodes&lt;/strong&gt;
&lt;/pre&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">kavulix</dc:creator><pubDate>Fri, 09 Nov 2012 05:55:05 -0000</pubDate><guid>https://sourceforge.netdbfeeee496905d9b71b0a9ca12df97cc354db551</guid></item><item><title>WikiPage scraper_element modified by kavulix</title><link>https://sourceforge.net/p/openrokn/home/scraper_element/</link><description>&lt;b&gt;Element name:&lt;/b&gt; scraper

The scraper element is supported by all items that display a springboard screen (audio, video, slideshow).  You can use the scraper element to fill in missing content on the springboard screen before it's displayed.  This is accomplished by specifying the url of a html page and then using regex with parentheses to match a specific portion of the html page.  That value is then extracted and added to either an attribute on the parent element or it is used to create a new sibling element.  In the example below the url is extracted from the src attribute of a video element on a html page.  It is then added to the parent element (i.e., the video item) as the url attribute.  In order to use scrapers you must set the enablescrapers attribute on the parent item.

***

&lt;b&gt;Example:&lt;/b&gt;

    &lt;item type="video" enablescrapers="true"&gt;
       &lt;scraper type="parent" ename="item" aname="url" url="http://yoursite.com/page.htm" regex="&amp;lt;video\s.*?src=\x22(.+?)\x22"/&gt;
    &lt;/item&gt;

***

&lt;strong&gt;Supported Attributes&lt;/strong&gt;

&lt;b&gt;Name:&lt;/b&gt; type
&lt;b&gt;Type:&lt;/b&gt; string
&lt;b&gt;Supported values:&lt;/b&gt;
parent|sibling

&lt;b&gt;Name:&lt;/b&gt; ename
&lt;b&gt;Type:&lt;/b&gt; string
&lt;b&gt;Supported values:&lt;/b&gt;
the element name
&lt;b&gt;Notes:&lt;/b&gt;
If the type attribute is set to parent then the ename attribute will always be item.

&lt;b&gt;Name:&lt;/b&gt; aname
&lt;b&gt;Type:&lt;/b&gt; string
&lt;b&gt;Supported values:&lt;/b&gt;
the attribute name

&lt;b&gt;Name:&lt;/b&gt; url
&lt;b&gt;Type:&lt;/b&gt; string
&lt;b&gt;Supported values:&lt;/b&gt;
any url pointing to a html or text document

&lt;b&gt;Name:&lt;/b&gt; regex
&lt;b&gt;Type:&lt;/b&gt; string
&lt;b&gt;Supported values:&lt;/b&gt;
a regular expression that includes parentheses

***

&lt;strong&gt;Supported Child Nodes&lt;/strong&gt;

NONE</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">kavulix</dc:creator><pubDate>Fri, 09 Nov 2012 02:16:17 -0000</pubDate><guid>https://sourceforge.net9f8bc2bc29126946177a970cce17fbb6fc6727cc</guid></item></channel></rss>