Welcome, Guest! Log In | Create Account

eXtensible Text Framework (XTF)

The eXtensible Text Framework (XTF) is a flexible indexing, display and query tool that supports searching across collections of heterogeneous data and presents results in a highly configurable manner. XTF is an open-source project of the eScholarship Publishing Group of the California Digital Library, and is deployed in academic settings worldwide. Highlights of the XTF system are described in an online brochure PDF?] .

Downloads and Documentation

  • Download XTF: XTF can be downloaded from SourceForge.net site.
  • Documentation:
    1. Change log: List of the bug fixes and new features for each release.
    2. Deployment guide: Installation and basic configuration.
    3. Programming guide: In-depth technical overview of XTF and how to program it.
    4. Tag reference: Complete reference for all XTF elements and attributes.
    5. Tips & Tricks: Hints to help you get results quickly with XTF.
    6. Under the hood: Details about the actual operation of XTF and how it performs tasks.
    7. Experimental features: Information on features being developed for XTF.
    8. Resources: Presentations, papers, and tutorials on XTF.

Technical Overview

The system is divided into four components:

  1. crossQuery: The front-end to the collection search system.
  2. dynaXML: Interface to individual documents.
  3. Text Engine: Used by crossQuery and dynaXML to perform text searches.
  4. Indexer: Full-text indexer based on Lucene.

The following diagrams give a general overview of how documents are indexed, stored, queried, retrieved, and displayed using XTF (somewhat outdated).

  • System architecture diagram: A general illustration showing the roles the XTF components play in the user experience. GIF?]
  • Collection searching diagram: A more detailed view of the collection searching process, covering query parsing and results formatting. GIF?]
  • Individual object display diagram: A more detailed view of the object display and internal search mechanisms, covering request parsing, authentication, and document formatting. GIF?]
  • Text indexing diagram: An illustration of the workflow for the creation of collection indexes. GIF?]

Who uses XTF?#WhoUsesXTF?

The CDL uses XTF as a building block for new services and has used it to replace a number of systems previously used for text searching (i.e., DLXS, Greenstone, DynaWeb). As of 2008, CDL has deployed XTF in the following ways:

XTF is also extensively used outside the CDL:

Other institutions exploring XTF include: University of Sydney; OhioLink; University of Texas at Austin; University of Virginia; University of Denver; and University of Kansas Digital Initiatives.

Support

Implementers

While CDL does not directly support XTF implementers, we do make a good-faith effort to address the needs of the XTF community through the following resources on SourceForge:

  • The xtf-user email list for those trying to set-up and use XTF. It is monitored by the principal developers of the application.
  • The Bug Tracker is the place to submit bug reports.
  • The Support Request Tracker and Feature Request Tracker are alternative ways to bring to our attention documentation errors and potential new features, respectively. However, the xtf-user list is probably a better way to start.

Developers

SourceForge resources for XTF developers and others who are interested in contributing to the architecture

  • The xtf-devel email list is where developers share their ideas. It also logs all CVS commits.
  • The Patch Tracker allows developers to submit XTF patches for our approval.
  • Access to the CVS repository is also available