The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
Features
- Index corpora into a compact and swiftly-searchable format (with Unicode support!)
- Search corpora efficiently using the super-fast Corpus Query Processor (CQP)
- Queries can contain regular expressions on individual words or annotations, AND across sequences of words
- Support for indexing and querying of within-text XML elements and attribute values
- Plus CQPweb: a user-friendly online interface with lots of additional features, especially suitable for teaching and for non-specialists
Follow IMS Open Corpus Workbench
Other Useful Business Software
Stop Storing Third-Party Tokens in Your Database
Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of IMS Open Corpus Workbench!