DataExtract is a program that scans files of many different types - text, PDF, Word, Excel etc, extracting all kinds of structured patterns, like email addresses and phone numbers, from them.
Features
- Reads Plain Text From Most Of The Major File Types - PDF, DOC, DOCX etc.
- Processes Extracted Text Looking For Specific Data Items Like Email Addresses.
- Define Your Own Text Patterns To Search For.
- Or Select From A Large Number Of Existing Library Patterns.
- Define Words Or Phrases Of Interest To Search For.
- Add Your Own Sets Of Data Items For Extraction.
- Screen Colours Configurable.
- Six Different Ways To See Extracted Data.
- Comprehensive Help.
- Extract Data From Single, Multiple Files or Whole Folder Structures.
License
Apache License V2.0Follow DataExtract
Other Useful Business Software
Cut Cloud Costs with Google Compute Engine
Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of DataExtract!