Update of /cvsroot/tess/homepage
In directory sc8-pr-cvs1:/tmp/cvs-serv1706/homepage
Modified Files:
index.html
Log Message:
More homepage stuff
Index: index.html
===================================================================
RCS file: /cvsroot/tess/homepage/index.html,v
retrieving revision 1.3
retrieving revision 1.4
diff -C2 -d -r1.3 -r1.4
*** index.html 6 Jan 2003 02:14:46 -0000 1.3
--- index.html 7 Jan 2003 04:54:41 -0000 1.4
***************
*** 24,27 ****
--- 24,29 ----
<li><a href="#well">How well does it work?</a>
<li><a href="#use">How do I use it?</a>
+ <li><a href="#what">What's it do?</a>
+ <li><a href="#why">Why another program?</a>
<li><a href="http://sourceforge.net/projects/tess">SourceForge page</a>
</ul>
***************
*** 38,41 ****
--- 40,47 ----
<p>
<a name="use">The program is really rather flexible, and can be used in several different ways. The only real requirements are that you need to have python (I use 2.2) and you have to be able to invoke a python script to deliver mail. That way the script can put the incoming message into one of two boxes: either "regular incoming", or "spam incoming". After that, the options vary a lot depending on how YOU read and handle your mail. What I do is save all messages that are verified spam (I've checked to make sure the messages in the spam box are really spam, or I move spam that was not correctly identified) to a "VerifiedSpam" mailbox. Regular email that I'm done with, I just delete, which moves it to the "Deleted" mailbox. Then, every time I get a new message, the contents of "VerifiedSpam" and "Deleted" are consumed (parsed and deleted) by TeSS and added to the appropriate word database. All of this is handled in the single command in the .forward file.</a>
+ <p>
+ <a name="details">The fine details... Assuming my configuration as described <a href="#use">above</a>: Incoming mail is [mime decoded and then] broken into tokens - each token is (lowercased) a-z, 0-9, "-", and "'". Each token must have at least one letter in it. Each token may be at most 20 characters. These tokens are then looked up as described in <a href="http://www.paulgraham.com/spam.html" target=_blank>Paul Graham's paper</a>. If it is spam (> 80% probable), it is put in the spam box. If it is not, it is put in the Inbox. All messages in the Deleted box are parsed the same way, and each token is added to the "good words" file (some db file according to your platform - dmb, gdbm, whatever python decides). Then that mail file is truncated to 0 length. The same thing happens to the messages in the VerifiedSpam mailbox. That's all there is to it... I check my spam box every once in a while to make sure no good messages got dumped, and then I move them all to the VerifiedSpam box to be consumed. Regular email I delete as usual (which my mail program moves to a Deleted box). Eventually I may decide to dump Spam directly to VerifiedSpam without looking at it.</a>
+ <p>
+ <a name="why">Why did I write this where there are so many other anti-spam programs out ther? There were 2 compelling reasons: I use multiple clients, so I needed a server-based solution; I don't like or want to mess with qmail or procmail, which virtually all other server solutions seem to want to use.</a>
<p>
  <!-- add a spacer -->
|