Bogofilter now understands how to parse mbox format files.
Please modify it so it can grok emacs RMAIL files.
These are much like mbox files, but messages are
separated by ^L linefeeds, and From lines are changed
to Mail-from: From lines.
I've been able to register my spam and ham files by
first running sed over my RMAIL files like this:
sed -e 's/^Mail-from: From/From/' RMAIL | bogofilter -M -n
But it would be really nice if I could skip the sed step.
On a related note, the man page needs clarification. It
says that bogofilter can handle many messages, and
breaks them on From_ lines. But then there is the -M
option, which apparently breaks them on "From " lines
(without the underscore or a colon). What's the
difference? When would the default (From_) behavior
be useful? When is the -M option useful instead?