RE: [Classifier4j-devel] Bayesian Case Study
Status: Beta
Brought to you by:
nicklothian
From: Nick L. <nl...@es...> - 2003-11-14 03:47:09
|
> > As a general point I'm not sure you are really going to > find Bayesian > > classification a great match for deciding what kind of a > document something > > is, simply because I don't think you can fairly compare the > scores documents > > get in various categories and say if a score is higher in > one than the other > > it is a better match. > > > > For instance, if you have two categories (say Tax and > Investments), then you > > can't say that the word "Tax" in a document means that it > is not about > > "Investments". > > If this is true, I would then ask you how and why POPFile is > using a Bayesian > algorithm to do exactly this? Have they deviated somehow > from a true Bayesian > calculation? > Hmm.. that is a fair point. I should really do some experimentation. > The vector stuff sounds really cool too! Can you have that > working by next > week? :) > Yeah, if someone offers to pay :) |