Thread: [Assorted-commits] SF.net SVN: assorted: [712] mailing-list-filter/trunk/README
Brought to you by:
yangzhang
From: <yan...@us...> - 2008-05-08 06:03:23
|
Revision: 712 http://assorted.svn.sourceforge.net/assorted/?rev=712&view=rev Author: yangzhang Date: 2008-05-07 23:03:28 -0700 (Wed, 07 May 2008) Log Message: ----------- added an overview readme Added Paths: ----------- mailing-list-filter/trunk/README Added: mailing-list-filter/trunk/README =================================================================== --- mailing-list-filter/trunk/README (rev 0) +++ mailing-list-filter/trunk/README 2008-05-08 06:03:28 UTC (rev 712) @@ -0,0 +1,32 @@ +% Mailing List Filter +% Yang Zhang + +Overview +-------- + +I have a Gmail account that I use for subscribing to and posting to mailing +lists. When dealing with high-volume mailing lists, I am typically only +interested in those threads that I participated in. This is a simple filter +for starring and marking unread any messages belonging to such threads. + +This is accomplished by looking at the set of messages that were either sent +from me or explicitly addressed to me. From this "root set" of messages, we +can use the `Message-ID`, `References`, and `In-Reply-To` headers to determine +threads, and thus the other messages that we care about. + +I have found this to be more accurate than my two original approaches. I used +to have Gmail filters that starred/marked unread any messages containing my +name anywhere in the message. This worked OK since my name is not too common, +but it produced some false positives (not that bad, just unstar messages) and +some false negatives (much harder to detect). + +A second approach is to tag all subjects with some signature string. This +usually is fine, but it doesn't work when you did not start the thread (and +thus determine the subject). You can try to change the subject line, but this +is (1) poor netiquette, (2) unreliable because your reply may not register in +other mail clients as being part of the same thread (and thus other +participants may miss your reply), and (3) unreliable because replies might not +directly referencing your post (either intentionally or unintentionally). It +also fails when others change the subject. Finally, this approach is +unsatisfactory because it pollutes subject lines, and it essentially replicates +exactly what Message-ID was intended for. This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <yan...@us...> - 2008-05-08 06:54:30
|
Revision: 714 http://assorted.svn.sourceforge.net/assorted/?rev=714&view=rev Author: yangzhang Date: 2008-05-07 23:54:08 -0700 (Wed, 07 May 2008) Log Message: ----------- added to readme Modified Paths: -------------- mailing-list-filter/trunk/README Modified: mailing-list-filter/trunk/README =================================================================== --- mailing-list-filter/trunk/README 2008-05-08 06:05:12 UTC (rev 713) +++ mailing-list-filter/trunk/README 2008-05-08 06:54:08 UTC (rev 714) @@ -30,3 +30,8 @@ also fails when others change the subject. Finally, this approach is unsatisfactory because it pollutes subject lines, and it essentially replicates exactly what Message-ID was intended for. + +This script is not intended to be a replacement for the Gmail filters. I still +keep those active so that I can get immediate first-pass filtering. I execute +this script on a daily basis to perform second-pass filtering/unfiltering to +catch those false negatives that may have been missed. This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <yan...@us...> - 2008-05-08 08:04:03
|
Revision: 720 http://assorted.svn.sourceforge.net/assorted/?rev=720&view=rev Author: yangzhang Date: 2008-05-08 01:04:04 -0700 (Thu, 08 May 2008) Log Message: ----------- added todos and setup to readme Modified Paths: -------------- mailing-list-filter/trunk/README Modified: mailing-list-filter/trunk/README =================================================================== --- mailing-list-filter/trunk/README 2008-05-08 08:03:44 UTC (rev 719) +++ mailing-list-filter/trunk/README 2008-05-08 08:04:04 UTC (rev 720) @@ -1,6 +1,3 @@ -% Mailing List Filter -% Yang Zhang - Overview -------- @@ -35,3 +32,32 @@ keep those active so that I can get immediate first-pass filtering. I execute this script on a daily basis to perform second-pass filtering/unfiltering to catch those false negatives that may have been missed. + +Setup +----- + +Requirements: + +- [argparse](http://argparse.python-hosting.com/) +- [Python Commons](http://assorted.sf.net/python-commons/) 0.4 +- [path](http://www.jorendorff.com/articles/python/path/) + +Install the program using the standard `setup.py` program. + +Future Work Ideas +----------------- + +- Currently, we assume that the server specification points to a mailbox + containing all messages (both sent and received), and a message is determined + to have been sent by you by looking at the From: header field. This works + well with Gmail. An alternative strategy is to look through two folders, one + that's the Inbox and one that's the Sent mailbox, and treat all messages in + Sent as having been sent by you. This is presumably how most other IMAP + servers work. + +- Implement incremental maintenance of local cache. + +- Accept custom operations for filtered/unfiltered messages + (trashing/untrashing, labeling/unlabeling, etc.). + +- Refactor the message fetching/management part out into its own library. This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |
From: <yan...@us...> - 2008-05-08 19:06:23
|
Revision: 729 http://assorted.svn.sourceforge.net/assorted/?rev=729&view=rev Author: yangzhang Date: 2008-05-08 12:06:11 -0700 (Thu, 08 May 2008) Log Message: ----------- added versions to dependencies Modified Paths: -------------- mailing-list-filter/trunk/README Modified: mailing-list-filter/trunk/README =================================================================== --- mailing-list-filter/trunk/README 2008-05-08 19:05:50 UTC (rev 728) +++ mailing-list-filter/trunk/README 2008-05-08 19:06:11 UTC (rev 729) @@ -38,9 +38,9 @@ Requirements: -- [argparse](http://argparse.python-hosting.com/) +- [argparse](http://argparse.python-hosting.com/) 0.8.0 - [Python Commons](http://assorted.sf.net/python-commons/) 0.4 -- [path](http://www.jorendorff.com/articles/python/path/) +- [path](http://www.jorendorff.com/articles/python/path/) 2.2 Install the program using the standard `setup.py` program. This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. |