Hi,
 
Thanks for the reply.
 
It still seems to be trying to index these links with the exclude_urls directive in the main config file and either all the entries on one line or on separate lines separated by a backslash.
 
So no luck yet :-(

 
On 11/29/06, Joe R. Jah <jjah@cloud.ccsf.cc.ca.us> wrote:
On Wed, 29 Nov 2006, Dan Richardson wrote:

> Date: Wed, 29 Nov 2006 12:15:32 +0000
> From: Dan Richardson < dsrmlac@googlemail.com>
> To: htdig-general@lists.sourceforge.net
> Subject: [htdig] exclude_urls problem
>
> Hello,
>
> In my htdig conf file I have:
>
> start_url:              http://www.mydomain.co.uk/
> limit_urls_to:         http://www.mydomain.co.uk/
> exclude_urls:        /usr/local/htdig/conf/excludes

Try this istead:

exclude_urls:        /cgi-bin/ .cgi action=vote&voteid bookmark.html email.html reddit.com del.icio.us www.google.com digg.com ma.gnolia.com www.newsvine.com

All on one line; or lines separated by space-back-slash, " \", at the end
of the line.

> As I understand it both limit_urls and exclude_urls are string patterns, but
> which one takes precedence?

I believe correctly defined exclude_urls takes precedence.

> I have links on my site such as:
> http://reddit.com/submit?url=http://www.mydomain.co.uk/about_us/for_your_site/email.html
>  which contains both exclude_urls and limit_urls strings and htdig seems to
> be trying to index these links, any pointers on how I can definately exclude
> them from an index?
>
> Many thanks in advance,

Regards,

Joe
--
    _/   _/_/_/       _/              ____________    __o
    _/   _/   _/      _/         ______________     _-\<,_
_/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
_/_/ oe _/   _/.  _/_/ ah        jjah@cloud.ccsf.cc.ca.us