RE: [Classifier4j-devel] Bayesian Case Study
Status: Beta
Brought to you by:
nicklothian
From: Matt C. <MCo...@my...> - 2003-11-14 03:04:22
|
> As a general point I'm not sure you are really going to find Bayesian > classification a great match for deciding what kind of a document something > is, simply because I don't think you can fairly compare the scores documents > get in various categories and say if a score is higher in one than the other > it is a better match. > > For instance, if you have two categories (say Tax and Investments), then you > can't say that the word "Tax" in a document means that it is not about > "Investments". If this is true, I would then ask you how and why POPFile is using a Bayesian algorithm to do exactly this? Have they deviated somehow from a true Bayesian calculation? The vector stuff sounds really cool too! Can you have that working by next week? :) Matt |