Looking for the latest version? Download XTZImporterPack.zip (12.7 MB)
Name Modified Size Downloads / Week Status
Parent folder
Totals: 23 Items   191.3 kB 46
txm-filter-corpusakkadien-xmlw_mots_effaces.xsl 2015-02-23 3.6 kB 22 weekly downloads
txm-filter-corpusakkadien-xmlw_syllabes.xsl 2015-02-23 4.7 kB 22 weekly downloads
README.markdown 2014-03-10 4.1 kB 22 weekly downloads
txm-filter-teibvh-xmlw.xsl 2014-03-10 31.6 kB 22 weekly downloads
txm-filter-teibvh-xmlw-posttok.xsl 2014-03-10 19.5 kB 22 weekly downloads
txm-filter-teiperseus-xmlw.xsl 2014-02-27 2.4 kB 22 weekly downloads
txm-filter-perseustreebank-xmlw.xsl 2014-02-27 2.8 kB 22 weekly downloads
txm-filter-rnc-xmlw.xsl 2014-02-27 1.2 kB 22 weekly downloads
txm-filter-qgraal_cm-xmlw.xsl 2014-02-17 9.3 kB 22 weekly downloads
txm-edition-xmltxm-textgrid.xsl 2014-02-03 14.0 kB 22 weekly downloads
txm-filter-teitextgrid-xmlw-posttok.xsl 2014-02-03 5.4 kB 22 weekly downloads
txm-edition-page-split.xsl 2014-02-03 6.8 kB 22 weekly downloads
txm-filter-teicorpustextgrid-xmlw.xsl 2014-02-03 4.2 kB 22 weekly downloads
txm-filter-teip5-xmlw-simplify.xsl 2013-12-31 8.1 kB 22 weekly downloads
txm-filter-teip5-xmlw-preserve.xsl 2013-12-31 4.1 kB 22 weekly downloads
txm-filter-teip5-teibfm.xsl 2013-12-31 14.0 kB 22 weekly downloads
txm-filter-teifrantext-xmlw.xsl 2013-12-31 3.8 kB 22 weekly downloads
txm-filter-teifrantext-teibfm.xsl 2013-12-31 13.9 kB 22 weekly downloads
filter-keep-only-select.xsl 2013-12-31 3.3 kB 22 weekly downloads
filter-out-sp.xsl 2013-12-07 1.8 kB 22 weekly downloads
filter-out-p.xsl 2013-12-07 1.8 kB 22 weekly downloads
txm-filter-teibrown-xmlw.xsl 2013-07-10 5.0 kB 22 weekly downloads
p4top5_perseus.xsl 2012-12-03 25.9 kB 22 weekly downloads

TXM XSLT IMPORT FILTERS LIBRARY

This is a collection of XSLT (1.0 or 2.0) stylesheets that can be used to prepare various types of XML documents for import into TXM. Use "Front XSLT" option in the import parameters interface to select the appropriate filter.

Filters are usually named according to the following pattern: txm-filter-[input format]-[import module](-[option])?

Basic stylesheets for filtering XML sources

filter-keep-only-select.xsl

This stylesheet may be customized to filter out all the text and tags except the content of the specified element (select by default) and its ancestors.

filter-out-p.xsl

This stylesheet may be customized to filter out any particular xml element (p by default) and its content from the source document.

filter-out-sp.xsl

This stylesheet may be customized to filter out any particular xml element with a specific attribute value (sp with an attribute who with the value 'enqueteur' by default) and its content from the source document.

Basic stylesheets for adapting XML TEI P5 sources

txm-filter-teip5-teibfm.xsl

This stylesheet may be customized for use with any TEI P5 in the TEI BFM import module. Note that this module is experimental and may fail on documents that do not follow BFM encoding guidelines.

txm-filter-teip5-xmlw-preserve.xsl

This stylesheet may be customized for use with any TEI P5 in the XML/w+CSV import module. By default, it eliminates teiHeader and facsimile elements and their contents and preserves all other elements.

txm-filter-teip5-xmlw-simplify.xsl

This stylesheet may be customized for use with any TEI P5 in the XML/W+CSV import module. By default, it eliminates teiHeader, facsimile and all note elements and their contents and filters out all tags in the text body except ab, body, div, front, lb, p, pb, s, TEI, text and w.

Additional stylesheets for particular corpora

p4top5_perseus.xsl

This stylesheet is needed to convert Perseus TEI P4 files to TEI P5 prior to any import process.

txm-edition-page-split.xsl

This styleheet should be used to create separate HTML pages for TXM editions.

txm-edition-xmltxm-textgrid.xsl

This styleheet should be used to customize TXM editions of DARIAH-DE Textgrid texts.

txm-filter-perseustreebank-xmlw.xsl

This filter should be used on the Perseus Treebank corpus texts with the XML/w+CSV import module.

txm-filter-qgraal_cm-xmlw.xsl

This styleheet should be used on the diffracted format of Quest del Saint Graal source files with the XML/w+CSV import module.

txm-filter-rnc-xmlw.xsl

This filter should be used on the Russian National Corpus texts with the XML/w+CSV import module.

txm-filter-teibrown-xmlw.xsl

This filter should be used on the TEI Brown corpus texts with the XML/w+CSV import module.

txm-filter-teibvh-xmlw.xsl

This filter should be used on the TEI BVH texts with the XML/w+CSV import module.

txm-filter-teibvh-xmlw-posttok.xsl

This styleheet should be used to fix the tokenization errors and to adjust word properties in the tokenized version of TEI BVH texts.

txm-filter-teicorpustextgrid-xmlw.xsl

This styleheet should be used to prepare DARIAH-DE TEIcorpus xml files to TXM XML/w+CSV import process.

txm-filter-teifrantext-teibfm.xsl

This filter should be used on TEI Frantext texts with the TEI BFM import module. It is automatically applied in the TEI Frantext import module. Note that this module is experimental and may fail on documents that do not follow BFM encoding guidelines.

txm-filter-teifrantext-xmlw.xsl

This styleheet should be used on TEI Frantext texts with the XML/w+CSV import module.

txm-filter-teiperseus-xmlw.xsl

This filter should be used on the TEI Perseus corpus texts with the XML/w+CSV import module (after conversion to TEI P5).

txm-filter-teitextgrid-xmlw-posttok.xsl

This styleheet should be used to adjust word properties in the tokenized version of DARIAH-DE Textgrid texts.

Please address any enquiries about the TXM XSLT library to textometrie@ens-lyon.fr

Source: README.markdown, updated 2014-03-10