Features

  • Document indexing and selection using Apache's Lucene
  • Fast VSM generation with several local and global weights (term - doc matrix)
  • Dimensionality reduction using SVD or NMF for LSA or related.
  • Meta-data annotators (PennTree grammar parsing).
  • Operations: Document distances, topic clustering, keyword extraction, and many more!

Project Activity

See All Activity >

License

Apache License V2.0

Follow TML - Text Mining Library for LSA & CMM

TML - Text Mining Library for LSA & CMM Web Site

Other Useful Business Software
Find Hidden Risks in Windows Task Scheduler Icon
Find Hidden Risks in Windows Task Scheduler

Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
Download Free Tool
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
3
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • It seems to be good, but there are some errors that dont let the program load correctly the library ( Abstract Annotator constructor receives parameters but PennTreeAnnotator doesnt receive)
  • very good library for doing text mining
  • great
Read more reviews >

Additional Project Details

Intended Audience

Developers, Science/Research

User Interface

Command-line

Programming Language

Java

Database Environment

MySQL

Related Categories

Java Artificial Intelligence Software, Java Linguistics Software, Java Research Software

Registered

2009-11-11