<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to ReleasesValidator</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>Recent changes to ReleasesValidator</description><atom:link href="https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/feed" rel="self"/><language>en</language><lastBuildDate>Mon, 08 Apr 2013 22:14:00 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/feed" rel="self" type="application/rss+xml"/><item><title>Discussion for ReleasesValidator page</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>&lt;div class="markdown_content"&gt;&lt;p&gt;This settings do not match the current code:&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;ValidOutputPath&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;    &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;valid&lt;/span&gt; &lt;span class="n"&gt;output&lt;/span&gt; &lt;span class="n"&gt;path&lt;/span&gt;
&lt;span class="n"&gt;InValidOutputPath&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;  &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;invalid&lt;/span&gt; &lt;span class="n"&gt;output&lt;/span&gt; &lt;span class="n"&gt;path&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;


&lt;p&gt;Instead use:&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;OutputPathValid&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;    &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;valid&lt;/span&gt; &lt;span class="n"&gt;output&lt;/span&gt; &lt;span class="n"&gt;path&lt;/span&gt;
&lt;span class="n"&gt;OutputPathNotValid&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;invalid&lt;/span&gt; &lt;span class="n"&gt;output&lt;/span&gt; &lt;span class="n"&gt;path&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;

&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Lee Thompson</dc:creator><pubDate>Mon, 08 Apr 2013 22:14:00 -0000</pubDate><guid>https://sourceforge.net24f2826fae3e6d8a3b67f5d17b7a393cfe5838d3</guid></item><item><title>WikiPage ReleasesValidator modified by Shaun</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v4
+++ v5
@@ -16,7 +16,7 @@

 In the case of RAR password checking just the first segment (part) of the RAR file is downloaded and analyses. This will incur a download data hit of a few hundred KB of data but it allows exclusion of passworded releases which can be a pain when building indexes.

-To extract RAR file data and check for passworded files inside other RAR files the validator uses the freeware unrar tool available from RarLabs [http://www.rarlab.com/rar_add.htm]. To do deep inspection of RAR files you need to download the unrar tool for your platform from the above page and extract it to the same path as the ReleasesValidator.exe.
+To extract RAR file data and check for passworded files inside other RAR files the validator uses the freeware unrar tool available from RarLabs . To do deep inspection of RAR files you need to download the unrar tool for your platform from the above page and extract it to the same path as the ReleasesValidator.exe.

 Once the release has been validated it is copied to one of two output paths, a directory containing valid releases and a directory containing invalid releases. The invalid releases can be deleted on a daily bases is they are no longer needed.

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Shaun</dc:creator><pubDate>Tue, 05 Mar 2013 12:24:20 -0000</pubDate><guid>https://sourceforge.net208843c88ba87c0bc7624e7b8b24f390ed9877fd</guid></item><item><title>WikiPage ReleasesValidator modified by Shaun</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v3
+++ v4
@@ -22,7 +22,8 @@

 By default ReleasesValidator.exe tries to load config data from "validator.conf" you can also specify the config file name on the command line

-ReleasesValidator.exe "config file name"
+Usage:
+***ReleasesValidator.exe "config file name"***

 The config file has the following options:

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Shaun</dc:creator><pubDate>Fri, 22 Feb 2013 00:33:02 -0000</pubDate><guid>https://sourceforge.net89774da657d8d5ce4a29720fac67f7946d7c6fbc</guid></item><item><title>WikiPage ReleasesValidator modified by Shaun</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v2
+++ v3
@@ -16,7 +16,7 @@

 In the case of RAR password checking just the first segment (part) of the RAR file is downloaded and analyses. This will incur a download data hit of a few hundred KB of data but it allows exclusion of passworded releases which can be a pain when building indexes.

-To extract RAR file data and check for passworded files inside other RAR files the validator uses the freeware unrar tool available from RarLabs [http://www.rarlab.com/rar_add.htm]. To do deep inspection of RAR files you need to download the unrar tool for you platform from the above page and extract it to the same path as the ReleasesValidator.exe.
+To extract RAR file data and check for passworded files inside other RAR files the validator uses the freeware unrar tool available from RarLabs [http://www.rarlab.com/rar_add.htm]. To do deep inspection of RAR files you need to download the unrar tool for your platform from the above page and extract it to the same path as the ReleasesValidator.exe.

 Once the release has been validated it is copied to one of two output paths, a directory containing valid releases and a directory containing invalid releases. The invalid releases can be deleted on a daily bases is they are no longer needed.

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Shaun</dc:creator><pubDate>Fri, 22 Feb 2013 00:20:06 -0000</pubDate><guid>https://sourceforge.net27014834dcb005eb90feadeaace1b61681362617</guid></item><item><title>WikiPage ReleasesValidator modified by Shaun</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>&lt;div class="markdown_content"&gt;&lt;pre&gt;--- v1
+++ v2
@@ -15,6 +15,8 @@
 The first few checks are trivial and only require the data that is already available in the release NZB file. The more comprehensive RAR password testing and ultimately any file content testing needs to actually download parts or whole files to be further processed.

 In the case of RAR password checking just the first segment (part) of the RAR file is downloaded and analyses. This will incur a download data hit of a few hundred KB of data but it allows exclusion of passworded releases which can be a pain when building indexes.
+
+To extract RAR file data and check for passworded files inside other RAR files the validator uses the freeware unrar tool available from RarLabs [http://www.rarlab.com/rar_add.htm]. To do deep inspection of RAR files you need to download the unrar tool for you platform from the above page and extract it to the same path as the ReleasesValidator.exe.

 Once the release has been validated it is copied to one of two output paths, a directory containing valid releases and a directory containing invalid releases. The invalid releases can be deleted on a daily bases is they are no longer needed.

&lt;/pre&gt;
&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Shaun</dc:creator><pubDate>Fri, 22 Feb 2013 00:19:24 -0000</pubDate><guid>https://sourceforge.net32c4dc80773828c805fb6e4b8782d34e136f9c9d</guid></item><item><title>WikiPage ReleasesValidator modified by Shaun</title><link>https://sourceforge.net/p/nntpit/wiki/ReleasesValidator/</link><description>&lt;div class="markdown_content"&gt;&lt;h1 id="releasesvalidator"&gt;ReleasesValidator&lt;/h1&gt;
&lt;p&gt;The job of the validator is to confirm the release is valid, not just from a completeness point of view i.e. all files and parts of files are available as this is done by the extractor. But that the release is useable and fit for use in a system. There are a number of tests that can be run and depending on what your target use is of the releases different tests can be used.&lt;/p&gt;
&lt;p&gt;The current implementation is outlined on the &lt;a class="alink" href="/p/nntpit/wiki/How%20to%20Index%20NNTP%20Usenet%20Groups/"&gt;[How to Index NNTP Usenet Groups]&lt;/a&gt; page. &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Release must not contain just a single file that is an NZB file&lt;/li&gt;
&lt;li&gt;Check release subject for black listed RegEx matches&lt;/li&gt;
&lt;li&gt;Check RAR files for password&lt;ul&gt;
&lt;li&gt;Download first segment&lt;/li&gt;
&lt;li&gt;Check for password header&lt;/li&gt;
&lt;li&gt;Extract file list and look for names starting with *&lt;/li&gt;
&lt;li&gt;Extract files and check for rars, if any check them for passwords&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The first few checks are trivial and only require the data that is already available in the release NZB file. The more comprehensive RAR password testing and ultimately any file content testing needs to actually download parts or whole files to be further processed.&lt;/p&gt;
&lt;p&gt;In the case of RAR password checking just the first segment (part) of the RAR file is downloaded and analyses. This will incur a download data hit of a few hundred KB of data but it allows exclusion of passworded releases which can be a pain when building indexes.&lt;/p&gt;
&lt;p&gt;Once the release has been validated it is copied to one of two output paths, a directory containing valid releases and a directory containing invalid releases. The invalid releases can be deleted on a daily bases is they are no longer needed.&lt;/p&gt;
&lt;p&gt;By default ReleasesValidator.exe tries to load config data from "validator.conf" you can also specify the config file name on the command line&lt;/p&gt;
&lt;p&gt;ReleasesValidator.exe "config file name"&lt;/p&gt;
&lt;p&gt;The config file has the following options:&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;NntpServer&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;         &lt;span class="n"&gt;your&lt;/span&gt; &lt;span class="n"&gt;news&lt;/span&gt; &lt;span class="n"&gt;server&lt;/span&gt;
&lt;span class="n"&gt;NntpUser&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;           &lt;span class="n"&gt;your&lt;/span&gt; &lt;span class="n"&gt;user&lt;/span&gt; &lt;span class="n"&gt;name&lt;/span&gt;
&lt;span class="n"&gt;NntpPassword&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;       &lt;span class="n"&gt;your&lt;/span&gt; &lt;span class="n"&gt;password&lt;/span&gt;
&lt;span class="n"&gt;InputPath&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;          &lt;span class="n"&gt;path&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;NZB&lt;/span&gt; &lt;span class="n"&gt;files&lt;/span&gt; &lt;span class="n"&gt;to&lt;/span&gt; &lt;span class="n"&gt;be&lt;/span&gt; &lt;span class="n"&gt;tested&lt;/span&gt;
&lt;span class="n"&gt;ValidOutputPath&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;    &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;valid&lt;/span&gt; &lt;span class="n"&gt;output&lt;/span&gt; &lt;span class="n"&gt;path&lt;/span&gt;
&lt;span class="n"&gt;InValidOutputPath&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;  &lt;span class="n"&gt;the&lt;/span&gt; &lt;span class="n"&gt;invalid&lt;/span&gt; &lt;span class="n"&gt;output&lt;/span&gt; &lt;span class="n"&gt;path&lt;/span&gt;
&lt;span class="n"&gt;BlackListedSubject&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt; &lt;span class="n"&gt;RegEx&lt;/span&gt; &lt;span class="n"&gt;with&lt;/span&gt; &lt;span class="n"&gt;black&lt;/span&gt; &lt;span class="n"&gt;listed&lt;/span&gt; &lt;span class="n"&gt;subject&lt;/span&gt; &lt;span class="n"&gt;text&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;


&lt;p&gt;You can repeat the last config option multiple times&lt;/p&gt;
&lt;div class="codehilite"&gt;&lt;pre&gt;&lt;span class="n"&gt;BlackListedSubject&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="n"&gt;RegEx01&lt;/span&gt;
&lt;span class="n"&gt;BlackListedSubject&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="n"&gt;RegEx02&lt;/span&gt;
&lt;span class="n"&gt;BlackListedSubject&lt;/span&gt;&lt;span class="p"&gt;=&lt;/span&gt;&lt;span class="n"&gt;RegEx03&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;


&lt;p&gt;This will load all the black list RegEx in and test each one of a release subject for blacklist matches.&lt;/p&gt;&lt;/div&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Shaun</dc:creator><pubDate>Fri, 22 Feb 2013 00:10:05 -0000</pubDate><guid>https://sourceforge.net49f061e4cd3ed5b685ad2f87349657217b13986f</guid></item></channel></rss>