[Classifier4j-devel] Project Improvements
Status: Beta
Brought to you by:
nicklothian
From: Matt C. <MCo...@my...> - 2003-11-12 23:29:06
|
Alright... now that we're over THAT hump. There was talk in the mailing list archive regarding the addition of a number of features. Can someone bring me up to speed on where we're at in the following areas? Bayesian tokenizer. It was reported that the tokenizer improperly handles a number of strings including possessive pronouns and others. Anybody working on this? HTML togenizer for Bayesian system. Idea was to be able to "ignore" xml in a classification string. This happens to be required for my current project. I've either got to remove HTML from my source documents or get C4J to ignore it. Connection pooling. What ARE we going to do about connection pooling. Documentation. We need some. I would like to help with this. How do we do it? What framework are we using for documentation. Matt Collier RemoteIT mco...@my... 877-4-NEW-LAN |