DataExtract is a program that scans files of many different types - text, PDF, Word, Excel etc, extracting all kinds of structured patterns, like email addresses and phone numbers, from them.

Features

  • Reads Plain Text From Most Of The Major File Types - PDF, DOC, DOCX etc.
  • Processes Extracted Text Looking For Specific Data Items Like Email Addresses.
  • Define Your Own Text Patterns To Search For.
  • Or Select From A Large Number Of Existing Library Patterns.
  • Define Words Or Phrases Of Interest To Search For.
  • Add Your Own Sets Of Data Items For Extraction.
  • Screen Colours Configurable.
  • Six Different Ways To See Extracted Data.
  • Comprehensive Help.
  • Extract Data From Single, Multiple Files or Whole Folder Structures.

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow DataExtract

DataExtract Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DataExtract!

Additional Project Details

Operating Systems

Windows

Registered

2025-01-12