Features

  • Document indexing and selection using Apache's Lucene
  • Fast VSM generation with several local and global weights (term - doc matrix)
  • Dimensionality reduction using SVD or NMF for LSA or related.
  • Meta-data annotators (PennTree grammar parsing).
  • Operations: Document distances, topic clustering, keyword extraction, and many more!

Project Activity

See All Activity >

License

Apache License V2.0

Follow TML - Text Mining Library for LSA & CMM

TML - Text Mining Library for LSA & CMM Web Site

Other Useful Business Software
Forever Free Full-Stack Observability | Grafana Cloud Icon
Forever Free Full-Stack Observability | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
3
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • It seems to be good, but there are some errors that dont let the program load correctly the library ( Abstract Annotator constructor receives parameters but PennTreeAnnotator doesnt receive)
  • very good library for doing text mining
  • great
Read more reviews >

Additional Project Details

Intended Audience

Developers, Science/Research

User Interface

Command-line

Programming Language

Java

Database Environment

MySQL

Related Categories

Java Artificial Intelligence Software, Java Linguistics Software, Java Research Software

Registered

2009-11-11