The Textract Project consists of C++ source code to extract text from a growing assortment of file formats. Output is indexing-ready. The Textract Project is intended as a foundation to support research-quality search engines.

Project Activity

See All Activity >

Categories

HTML/XHTML

License

GNU General Public License version 2.0 (GPLv2)

Follow Textract

Textract Web Site

Other Useful Business Software
AI-powered service management for IT and enterprise teams Icon
AI-powered service management for IT and enterprise teams

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Try it Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Textract!

Additional Project Details

Operating Systems

Windows

Intended Audience

Developers

Programming Language

C++

Database Environment

Flat-file

Related Categories

C++ HTML XHTML

Registered

2008-11-13