[Classifier4j-devel] New Stop Words Provider
Status: Beta
Brought to you by:
nicklothian
From: Matt C. <MCo...@my...> - 2003-11-15 18:34:59
|
Attached is an alternate stop words provider for classifier4J. I simply copied the whole of DefaultStopWordsProvide.java and renamed it to AlphaStopWordsProvider.java. I am pretty sure that this is not the correct way to do this since there is a comment about overriding the getStopWords method, but I'm not sure how to do this right now. I wanted to get this code out for review. Please advise. This reads the stop list from a file "c:/stoplist/english.stop". You will need to download the stop list or create your own. There is a link on the wiki site for the stop-list that Nick found : http://www.ishmaelswiki.org/wiki/index.php/TextClassification there should be a single word on each line of your stop list file. Matt Collier RemoteIT mco...@my... 877-4-NEW-LAN |