TexLexAn is an open source text analyser for Linux, able to estimate the readability and reading time, to classify and summarize texts. It has some learning abilities and accepts html, doc, pdf, ppt, odt and txt documents. Written in C and Python.

Features

  • GUI and CLI to Analyze, classify and summarize document
  • Accept: text, html, odt, msdoc, ppt, ps
  • Analyze: syllables/word distr., readability, sentiments
  • Sentiment: Evaluate bipolar sentiments
  • Extract: keywords
  • Classify: linear classifier unigram...n-gram based
  • Summarize: extract relevant sentences and simplify them.
  • Learn: perceptron algorithm
  • Retrieve original docs by searching in archived summaries.
  • Classify & extract sentences from previous summaries
  • Detect: English, French, German, Italian, Spanish languages

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Text Analyzer Classifier Summarizer

Text Analyzer Classifier Summarizer Web Site

You Might Also Like
Real Time Accounts Payable Automation. Icon
Real Time Accounts Payable Automation.

Invoice capture and automation seamlessly integrated with your accounting software

Yooz provides the smartest, most powerful, and easiest-to-use cloud-based E-invoicing and Purchase-to-Pay automation solution. It delivers unmatched savings, speed, and security with affordable zero-risk subscriptions to more than 5,000 customers and 300,000 users worldwide.
Rate This Project
Login To Rate This Project

User Reviews

There are no 4 star reviews.

Additional Project Details

Operating Systems

Linux

Intended Audience

End Users/Desktop

User Interface

Gnome, KDE, Command-line

Programming Language

Python, C

Related Categories

Python Business Software, Python Machine Learning Software, C Business Software, C Machine Learning Software

Registered

2009-02-16