TML - Text Mining Library for LSA

9 Recommendations
21 Downloads (This Week)
Last Update:
Download tml-3.1.zip
Browse All Files
Windows Mac Linux

Description

TML is a Text Mining Library with a focus on LSA (Latent Semantic Analysis) tightly integrated with Apache's Lucene which focuses on ease of use for researchers and developers that want to integrate Text Mining capabilities in their applications.

TML - Text Mining Library for LSA Web Site

Features

  • Document indexing and selection using Apache's Lucene
  • Fast VSM generation with several local and global weights (term - doc matrix)
  • Dimensionality reduction using SVD or NMF for LSA or related.
  • Meta-data annotators (PennTree grammar parsing).
  • Operations: Document distances, topic clustering, keyword extraction, and many more!

Update Notifications





User Ratings

 
 
9
1
Write a Review

User Reviews

  • Posted by Collin 2012-11-11

    Fast and simple.

  • Posted by thotegt 2010-12-28

    It seems to be good, but there are some errors that dont let the program load correctly the library ( Abstract Annotator constructor receives parameters but PennTreeAnnotator doesnt receive)

  • Posted by Ming Liu 2009-11-30

    very good library for doing text mining

  • Posted by rafa 2009-11-27

    great

  • Posted by Elliot 2013-02-16

    Nice and simple.

Read more reviews

Additional Project Details

Intended Audience

Developers, Science/Research

User Interface

Command-line

Programming Language

Java

Registered

2009-11-11

Icons must be PNG, GIF, or JPEG and less than 1 MiB in size. They will be displayed as 48x48 images.