This is a library to extract raw unicode text from any written documents (office documents such as PDF, Word, OpenOffice, ...). It should be useful to developpers of search engine, text processing, corpus analysis, ....

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Universal text extractor

Universal text extractor Web Site

You Might Also Like
Top-Rated Free CRM Software Icon
Top-Rated Free CRM Software

216,000+ customers in over 135 countries grow their businesses with HubSpot

HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Universal text extractor!

Additional Project Details

Operating Systems

Linux

Intended Audience

Science/Research, Developers

Programming Language

C

Related Categories

C Scientific Engineering

Registered

2005-04-08