contentCrawler
contentCrawler is an automated solution that ensures all documents in a repository are text-searchable and optimized for storage. Operating 24/7 without staff intervention, it uses Optical Character Recognition (OCR) to identify and convert image-based documents, such as scanned PDFs and graphic files, into searchable PDFs, enhancing productivity and compliance. Additionally, contentCrawler's compression module reduces file sizes, saving storage and migration costs without compromising document quality. The system supports various image types, including TIFF, BMP, GIF, EPS, JPG, and PNG, converting them into PDFs with an invisible text layer for improved search capabilities. Its dual processing modes handle both new and legacy documents simultaneously, ensuring comprehensive coverage across the entire document repository. Administrators can monitor OCR and compression progress in real-time through the administration console dashboard.
Learn more
FreeOCR
FreeOCR is a free Optical Character Recognition Software for Windows and supports scanning from most Twain scanners and can also open most scanned PDF's and multi-page Tiff images as well as popular image file formats. FreeOCR outputs plain text and can export directly to Microsoft Word format. Free OCR uses the latest Tesseract (v3.01) OCR engine. It includes a Windows installer and It is very simple to use and supports opening multi-page tiff documents, Adobe PDF, and fax documents as well as most image types including compressed Tiff's which the Tesseract engine on its own cannot read.It now can scan using Twain and WIA scanning drivers. FreeOCR V4 includes Tesseract V3 which increases accuracy and has page layout analysis so more accurate results can be achieved without using the zone selection tool. As well as OCR FreeOCR can scan and save images as JPG and we are currently working on a "Scan to PDF" capability with the option to save as searchable PDF.
Learn more
PaperStream
PaperStream Capture Pro is a powerful front-end capture software that transforms paper documents (or imported digital files) into clean, indexed, searchable digital data ready for document-management workflows. It supports batch scanning with any TWAIN-compatible scanner, whether a desktop model or an enterprise-grade device, and uses advanced image-processing via its integrated engine to automatically enhance scanned images, remove noise, correct skew/rotation/color issues, and improve clarity for better OCR and readability. It offers robust data-extraction capabilities; full-text OCR, zonal OCR, barcode and patch-code reading, and even optical-mark-recognition and handprint recognition for handwritten block text or checkboxes. It can extract many fields per document (for example, from forms, applications, or surveys), automatically separate documents in mixed batches (using blank pages, barcodes, patch codes, or form-template recognition), and assign metadata.
Learn more
FormKiQ
FormKiQ is a new way to manage documents in the cloud, using a powerful Open Source API paired with a dynamic ReactJS web client, both of which you can build on and extend. You can add FormKiQ to an existing application or product or install and run it as a full-featured electronic document management system on its own,
with as little or as much customization as you need.
NOTE: along with Pro and Enterprise versions, there is a free open-core version, FormKiQ Core, that provides the essential features of a document management system.
What makes FormKiQ stand out from other document management software is that it is highly flexible and customizable, due to being designed and built with API-First principles and using Amazon Web Services (AWS). This allows a level of customization and flexibility that is far beyond what other electronic document management systems can offer, and that's a good reason why tech-oriented companies across a wide range of industries are choosing FormKiQ.
Learn more