From: Tom A. <tal...@ta...> - 2004-04-29 10:27:31
|
Roman Suzi wrote: > Your answer can be found in the Pyzor source code. The code is easy to read, > and you will see that before making hash Pyzor removes URLs, spaces, > and other things. > > But I must admit that large amount of spam today uses randomizing > techniques which Pyzor does not like. So the only solution is to > use rule-based spam-catcher (like SpamAssassin) + Bayes filter > (like spamoracle, etc) + some SMTP-level euristics to catch > spam. > > Probably banning HTML emails will have devastating effect on spam too ;-) > > Sincerely yours, Roman Suzi Thank you for the reply. I wasn't sure how much randomization spam had in it. I did some tests and found that only 0.2% of my spam has my email address in it (in some form). But the blocks of bayesian gook at the end I could not test. bogofilter is my spam filter of choice. |