#16 exclude by URL pattern

open
nobody
None
5
2012-11-19
2006-05-08
Lars Huttar
No

Hi...
GetLeft looks good.
The only limitation that makes it hard for my
requirements is that you can only exclude URLs based on
extension (*.jpg etc.). I'm trying to test a web
application that's implemented using a dynamic
programming framework, so the nature of the page is
reflected not in the extension but in the general
pattern of the URL. So I would like to be able to
exclude e.g. all URLs of the pattern
"http://mamre/mount/foo/*/store" because those are the
ones that regenerate a lot of data and take a long time.

So that would be my request -- allow the user to
exclude URLs to download by having them match a URL
pattern.

Discussion

  • ajx
    ajx
    2008-05-02

    Logged In: YES
    user_id=1432814
    Originator: NO

    Additionally it would be nice to have a option to only include
    URLs of a specific pattern / restrict to URLs containing a specific word
    e.g. URL must include "test":
    http://example.com/test.html -> will be downloaded
    http://example.com/e.html -> won't be downloaded
    http://example.com/test/e.html -> will be downloaded