From: Tony G. <to...@of...> - 2007-05-25 04:27:01
|
Good idea. Run the test and report back on the improvement, if any, in accuracy. On 5/24/07, martin f krafft <ma...@ma...> wrote: > Hi, > > a colleague showed me his way of invoking crm114, which is > > |formail -kxFrom: -xSubject: | crm =85 > > so effectively he deletes all headers but From and Subject. > > This got me thinking: do headers contain valuable information for > crm114? There is a lot of redundant or constant information in > there, and spam these days comes via random routes and from random > senders with random subjects anyway. > > Would it not make sense to simply crop all headers other than > Subject and then to train and classify based on the subject and body > only? > > Cheers, > > -- > martin; (greetings from the heart of the sun.) > \____ echo mailto: !#^."<*>"|tr "<*> mailto:" net@madduck > > spamtraps: mad...@ma... > > http://www.vcnet.com/bms/ > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.6 (GNU/Linux) > > iD8DBQFGViWMIgvIgzMMSnURAkm4AKDi89/b1kqAR7lBzUkZffFOiHj9twCfWzgZ > lVVf9sUgjl+HeQ1iR6bFR8M=3D > =3Doznd > -----END PGP SIGNATURE----- > > ------------------------------------------------------------------------- > This SF.net email is sponsored by DB2 Express > Download DB2 Express C - the FREE version of DB2 express and take > control of your XML. No limits. Just data. Click to get it now. > http://sourceforge.net/powerbar/db2/ > _______________________________________________ > Crm114-discuss mailing list > Crm...@li... > https://lists.sourceforge.net/lists/listinfo/crm114-discuss > > --=20 -- Tony Godshall g |