Hi all,

I have read this interesting blog post by Tom Morton on faster model training using averaged perceptron vs. the default MaxEnt implementation of OpenNLP:

http://www.annotation.org/index.php/blog/25-nlp/50-average-perceptron-algorithm-better-than-average

If am not mistaken, "MaxEnt" is the NLP community name for "Logistic Regression" in the statistical learning community. Furthermore there exist a very scalable implementation of regularized Logistic Regression and Linear Support Vector Classifiers that has been ported to java under a BSD license here:

http://www.bwaldvogel.de/liblinear-java/

Maybe liblinear would be a good alternative to the MaxEnt and averaged perceptron learners in OpenNLP? Is this part pluggable enough in the current source base? It does not seem to be the case in the latest stable release.