From: Edward Elliott <webcleaner@ed...> - 2006-05-13 00:39:44
Note: I'm resending this because it appears to have gotten lost during the
problem sourceforge had with their list server today.
I recently installed webcleaner and went through most of the documentation.
It really looks fantastic, the elegant proxy I've always wanted. However I
have a few unanswered questions:
1) Is there a convenient enable/disable toggle to switch webcleaner between
full filtering mode and a transparent pass-through mode where requests are
served with no filtering? I'm thinking in terms of Privoxy's toggle url.
This may be better handled by the browser, but that's not always an option.
2) Is there a convenient way to load a single url in pass-through mode?
E.g. Privoxy looks for '/PRIVOXY-FORCE' at the beginning of the path, and
Middleman wants bypass.. between the scheme and the host name. Middleman
actually uses a number of elegant "url commands", as shown here:
(scroll down to Url Commands header, can't link directly to the section)
3) Is there a convenient way to show which filters were applied to a
particular url? Privoxy does this with http://[proxy]/show-url-info while
Middleman has the filter.. and profiles.. url commands. If not, how do you
tell which rules are being applied when a page goes haywire? Manually
tracing through the list of filters seems unmanageable.
If the answer to any of these is no (particularly 2 and especially 3), what
would it take to add those features? I'm very comfortable in Python, but I
have no idea where to start in the code base. Any pointers and suggestions
would be greatly appreciated.
Thanks, and keep up the good work.
-----BEGIN PGP SIGNED MESSAGE-----
thanks for your feedback!
Edward Elliott schrieb:
> 1) Is there a convenient enable/disable toggle to switch webcleaner bet=
> full filtering mode and a transparent pass-through mode where requests =
> served with no filtering?
No, there is no configuration shortcut. Currently I am using the
switchproxy Firefox plugin, which is sufficient for me at the moment. I
will put the pass-through feature on the todo list.
> 2) Is there a convenient way to load a single url in pass-through mode?
You can append ".wc-nofilter" to any URL hostname to disable all
filtering for this particular URL. Single filters can also be disabled
Disable all filters of http://hullabulla.org/path/to/page.html:
Disable only blocker filter:
> 3) Is there a convenient way to show which filters were applied to a
> particular url?
Not really. There is a filter debugging mode, but this is only intended
for development/debugging, and outputs to the log file webcleaner.log.
To activate it, edit /etc/webcleaner/logging.conf and set:
Plus, you have to start webcleaner without the -O python flag since all
the debug statements are assert's and therefore disabled when starting
So a really short overview over what filter did what is not there.
> If the answer to any of these is no (particularly 2 and especially 3), =
> would it take to add those features? I'm very comfortable in Python, b=
> have no idea where to start in the code base. Any pointers and suggest=
> would be greatly appreciated.
Perhaps the best place to investigate the filtering is
wc/filter/__init__.py and its applyfilter() function. The documentation
there is not very good, but it is a start ;)
,''`. Bastian Kleineidam
: :' : GnuPG Schl=FCssel
`. `' gpg --keyserver wwwkeys.pgp.net --recv-keys 32EC6F3E
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.3 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
-----END PGP SIGNATURE-----