1. Summary
  2. Files
  3. Support
  4. Report Spam
  5. Create account
  6. Log in

Toolbox general behaviour

From txm

Jump to: navigation, search

Contents

Binary Corpus

It includes 3 types of data :

  • CQP data :
    • Data (with indexes)
    • Registry
  • TXM data :
    • one file per text
  • HTML data :
    • multi-page edition : it is the default one
    • one-page edition (BP version)

Workspace

  • Project
    • Base
      • T1
        • E1
        • E2
      • T2
      • T3

Importation timeline :

  • source data -> importer -> txm data (recursive step with annotator)
  • txm data -> compiler -> wtc data
  • txm data -> pager -> html data

Functionalities

Back to text

The corpus is a succession of tokens (with position property). By the position, we get the word_id and the text_id. Edition

Personal tools