Notes: Besides the usual minor bugfixes (thanks!) there are two big new features in this revision: 1) We now test against ( and ship with ) TRE version 0.6.8 . Better, faster, all that. :) 2) A fourth new classifier with very impressive statistics is now available. This is the OSB-Winnow classifier, originally designed by Christian Siefkes. It combines the OSB frontend with a balanced Winnow backend. But it may well be twice as accurate as SBPH Markovian and four times more accurate than Bayesian. Like correlative matching, it does NOT produce a direct probability, but it does produce a pR, and it's integrated into the CLASSIFY statement. You invoke it with the winnow flag:<br> <br> classify < winnow > (file1.cow | file2.cow) /token_regex/<br> and<br> learn < winnow > (file1.cow) /token_regex/<br> learn < winnow refute > (file2.cow) /token_regex/<br> <br> Note that you MUST do two learns on a Winnow .cow files- one "positive" one on the correct class, and a "refute" learn on the incorrect class (actually, it's more complicated than that and I'm still working out the details.) Being experimental, the OSB-Winnow file format is NOT compatible with Markovian, OSB, nor correlator matching, and there's no great functional checking mechanism to verify you haven't mixed up a .cow file with a .css file. Cssutil, cssdiff, and cssmerge think they can handle the new format- but they can't. You can (and should) microgroom with winnow , both on the forward and refute passes.<br> <br> checksums for the tinfoil-coiffed:<br> <br> 0e72f3ade3b0d88b51f71a2fbaf7c55a crm114-20040627-BlameSeifkes.i386.tar.gz<br> 637471a68c29bab77373f9a24c310984 crm114-20040627-BlameSeifkes.src.tar.gz
Copyright © 2010 Geeknet, Inc. All rights reserved. Terms of Use