Download Latest Version svmtool-java.zip (106.8 kB)
Email in envelope

Get an email when there's a new version of svmtool-java

Home
Name Modified Size InfoDownloads / Week
README.txt 2016-02-11 2.4 kB
svmtool-java.zip 2016-02-11 106.8 kB
Totals: 2 Items   109.2 kB 0
***********************************************************************
TOOL:        Java port of SVMTool++
AUTHOR(s):   Jesus Gimenez, Lluis Marquez, Senen Moya and Brian Bartell
DATE:        1/10/2016
VERSION:     1.1.4 
DESCRIPTION: A general POS tagger generator based on Support Vector Machines.
***********************************************************************

Contents
--------

README.txt       _ this file.
src              _ java source files.
svmtool-java.jar _ JAR file of compiled sources
               
Description
-----------

SVMT is a very simple and effective part-of-speech tagger based on Support Vector Machines, written by Jesus Gimenez, Lluis Marquez, Senen Moya in 2004. svmtool-java is a direct port of the C++ code (version 1.1.4) to Java.

To use this software, you need to download SVMTool (in addition to this java port) in order to access the lexicon data files.

Usage
-----

  SVMTagger tagger = new SVMTagger( SVMUtil.carr( file.getCanonicalPath() ) ) ;
  tagger.execute( new Callback() {
    int i=0;
    String sentence[] = { "The", "quick", "brown", "fox", "slept", "deeply" } ;
	public Object getNextWord() { return i>=sentence.length ? null : sentence[i++] ; }
	public void wordTagged( Object appData, String word, String tag ) { System.out.println( word + "\t" + tag ) ; }
	public void sentenceUntagged( Object appData, String sentence ) { System.out.println( sentence + "\t" + "UNKNOWN" ) ;}
  } ) ;


CONTRIBUTING:
=============

The SVMTool library is licensed under LGPL , which means that it may be linked to and used by commercial software packages. But the license also enforces that any changes or improvements made to the library (and in this case also to the morphological data) must be redistributed under LGPL terms.

Thus, if you improve the software or data, either adding new functionalities, fixing bugs, or adding analyzers for new languages, you can not distribute them under different conditions than those stated in the license (i.e. freely and with no usage restrictions).


REFERENCES:
==========

Please reference this tool in your academic works citing the following paper:

* Jesús Giménez and Lluis Màrquez
  SVMTool: A general POS tagger generator based on Support Vector Machines.
  Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC'04).
  Lisbon, Portugal. 2004.
Source: README.txt, updated 2016-02-11