TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in.

DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en

TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org).

Read the scientific background at the Textométrie project web site http://textometrie.ens-lyon.fr/?lang=en.

Read a full description at the TEI Tools wiki http://wiki.tei-c.org/index.php/TXM.

Features

  • Provides qualitative analysis tools : concordancer of lexical patterns based on word & structure level queries, rich HTML based text editions navigation, patterns occurrences layout display
  • Provides quantitative analysis tools : factorial correspondance analysis, constrative word specificities, hierarchical classification, cooccurrents of patterns
  • Works on any collection of Unicode encoded documents of various formats: texts collections (TXT, XML, XML-TEI P5), recordings transcriptions (XML-Transcriber), aligned corpora (XML-TMX), press articles (XML-PPS Factiva, Europress) and more.
  • Applies various NLP tools on the fly on texts before analysis (e.g. TreeTagger for lemmatization and pos tagging)
  • Allows to build various subcorpora and partitions (for constrative analysis between text structures or groups of words)
  • Exports any result in CSV, XML or SVG format
  • Script drivable for repetitive tasks automation or platform extension (in Groovy/Java)
  • Includes a text editor to edit data sources, results and scripts
  • Runs as standalone Windows, Mac OS X or Linux application
  • Runs also as portal web application to access and analyze corpora online through a web browser (with access control management)
  • Open source: based on the best open source components for text analysis: CQP, R and Java & XSLT libraries
  • Modular architecture (Eclipse RCP OSGi and J2EE conformant): one toolbox connecting all core components is used by all the applications
  • Efficient Eclipse or Netbeans powered development framework

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow TXM

TXM Web Site

Other Useful Business Software
Spreadsheets are hard. Nostra is easy. Icon
Spreadsheets are hard. Nostra is easy.

A single tool that advances the performance of your professional services business through data and AI.

Save administrative costs with simple time tracking and approvals. Understand with precision how your employees are actually spending their time relative to plan. Gain insights on the performance of your company so you can be more strategic on growing your business. Integrate with your existing CRM, or leverage Nostra's to gain insight on your profits and how your sales pipeline is putting demands on your resources. Make only the hires you have to. Gain early insight your sales pipeline and being intune with all inflight projects, Nostra will guide you on exactly when, what and who to hire for. Track milestones and time entry so you know what you can invoice for and when and get paid on time. With approval workflows and integrations with GL systems, you will not leak any revenue.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of TXM!

Additional Project Details

Operating Systems

MinGW/MSYS2, Linux, Mac

Languages

French, English, Russian

Intended Audience

Science/Research, Advanced End Users, Developers, End Users/Desktop

User Interface

Java SWT, Web-based, Console/Terminal, Eclipse

Programming Language

C, Groovy, Java, S/R

Database Environment

Other API

Related Categories

C Information Analysis Software, C Linguistics Software, C Statistics Software, C Natural Language Processing (NLP) Tool, Groovy Information Analysis Software, Groovy Linguistics Software, Groovy Statistics Software, Groovy Natural Language Processing (NLP) Tool, Java Information Analysis Software, Java Linguistics Software, Java Statistics Software, Java Natural Language Processing (NLP) Tool, S/R Information Analysis Software, S/R Linguistics Software, S/R Statistics Software, S/R Natural Language Processing (NLP) Tool

Registered

2008-12-04