TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in.

DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en

TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org).

Read the scientific background at the Textométrie project web site http://textometrie.ens-lyon.fr/?lang=en.

Read a full description at the TEI Tools wiki http://wiki.tei-c.org/index.php/TXM.

Features

  • Provides qualitative analysis tools : concordancer of lexical patterns based on word & structure level queries, rich HTML based text editions navigation, patterns occurrences layout display
  • Provides quantitative analysis tools : factorial correspondance analysis, constrative word specificities, hierarchical classification, cooccurrents of patterns
  • Works on any collection of Unicode encoded documents of various formats: texts collections (TXT, XML, XML-TEI P5), recordings transcriptions (XML-Transcriber), aligned corpora (XML-TMX), press articles (XML-PPS Factiva, Europress) and more.
  • Applies various NLP tools on the fly on texts before analysis (e.g. TreeTagger for lemmatization and pos tagging)
  • Allows to build various subcorpora and partitions (for constrative analysis between text structures or groups of words)
  • Exports any result in CSV, XML or SVG format
  • Script drivable for repetitive tasks automation or platform extension (in Groovy/Java)
  • Includes a text editor to edit data sources, results and scripts
  • Runs as standalone Windows, Mac OS X or Linux application
  • Runs also as portal web application to access and analyze corpora online through a web browser (with access control management)
  • Open source: based on the best open source components for text analysis: CQP, R and Java & XSLT libraries
  • Modular architecture (Eclipse RCP OSGi and J2EE conformant): one toolbox connecting all core components is used by all the applications
  • Efficient Eclipse or Netbeans powered development framework

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow TXM

TXM Web Site

You Might Also Like
Claims Processing solution for healthcare practitioners. Icon
Claims Processing solution for healthcare practitioners.

Very easy to use for medical, dental and therapy offices.

Speedy Claims became the top CMS-1500 Software by providing the best customer service imaginable to our thousands of clients all over America. Medical billing isn't the kind of thing most people get excited about - it is just a tedious task you have to do. But while it will never be a fun task, it doesn't have to be as difficult or time consumimg as it is now. With Speedy Claims CMS-1500 software you can get the job done quickly and easily, allowing you to focus on the things you love about your job, like helping patients. With a simple interface, powerful features to eliminate repetitive work, and unrivaled customer support, it's simply the best HCFA 1500 software available on the market. A powerful built-in error checking helps ensure your HCFA 1500 form is complete and correctly filled out, preventing CMS-1500 claims from being denied.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of TXM!

Additional Project Details

Operating Systems

MinGW/MSYS2, Linux, Mac

Languages

French, English, Russian

Intended Audience

Science/Research, Advanced End Users, Developers, End Users/Desktop

User Interface

Java SWT, Web-based, Console/Terminal, Eclipse

Programming Language

C, Groovy, Java, S/R

Database Environment

Other API

Related Categories

C Information Analysis Software, C Linguistics Software, C Statistics Software, C Natural Language Processing (NLP) Tool, Groovy Information Analysis Software, Groovy Linguistics Software, Groovy Statistics Software, Groovy Natural Language Processing (NLP) Tool, Java Information Analysis Software, Java Linguistics Software, Java Statistics Software, Java Natural Language Processing (NLP) Tool, S/R Information Analysis Software, S/R Linguistics Software, S/R Statistics Software, S/R Natural Language Processing (NLP) Tool

Registered

2008-12-04