The data structure for multimodal data
Dominate is a Python library for creating and manipulating HTML docs
PDF Indexing Script: Searches PDF for words, records page numbers
Query data on the command line with SQL-like SELECTs powered by Python
postprocessing tool for Project Gutenberg Distributed Proofreaders
A tool to look into file contents