DataExtract is a program that scans files of many different types - text, PDF, Word, Excel etc, extracting all kinds of structured patterns, like email addresses and phone numbers, from them.
Features
- Reads Plain Text From Most Of The Major File Types - PDF, DOC, DOCX etc.
- Processes Extracted Text Looking For Specific Data Items Like Email Addresses.
- Define Your Own Text Patterns To Search For.
- Or Select From A Large Number Of Existing Library Patterns.
- Define Words Or Phrases Of Interest To Search For.
- Add Your Own Sets Of Data Items For Extraction.
- Screen Colours Configurable.
- Six Different Ways To See Extracted Data.
- Comprehensive Help.
- Extract Data From Single, Multiple Files or Whole Folder Structures.
License
Apache License V2.0Follow DataExtract
Other Useful Business Software
Orchestrate Your AI Agents with Zenflow
Zenflow orchestrates AI agents like a real engineering system. With parallel execution, spec-driven workflows, and deep multi-repo understanding, agents plan, implement, test, and verify end-to-end. Upgrade to AI workflows that work the way your team does.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of DataExtract!