Sensible
Sensible is an API-first document-processing platform designed to enable developers and product teams to convert unstructured documents into structured data with minimal overhead. It supports extraction from PDFs, images, emails, and spreadsheets using a combination of LLM-based parsing and visual layout-rule engines. With over 150 pre-configured document-type parsers for common business forms (bank statements, invoices, policy declarations, utility bills, EOBs), organizations can accelerate deployment, while custom configurations allow unique workflows. It offers classification of document types via a dedicated classify endpoint, automatically identifying the form type before extraction, reducing manual pre-routing of files. Integration is straightforward through REST APIs, Webhooks, and SDKs (JavaScript, Python), allowing ingestion of documents in development and production environments with versioning support.
Learn more
ClassiGenius
A smarter AI delivers outstanding accuracy for the most demanding OCR/IDP solutions.
ClassiGenius reads documents, classifies them, extracts field content, and creates searchable PDF files using our strong Intelligent Document Processing (IDP) capabilities such as OCR, AI, neural network, and other advanced technologies and concepts.
ClassiGenius is provided with pre-defined solutions like reading invoices, identification documents, creating searchable PDF files, and it allows users to create their own solutions for automatic page classification and field extraction.
It monitors folders, identifies incoming files, processes them, and exports the results. It does so efficiently with minimum set up time, thus reducing your costs.
Learn more
NuOCR
NuOCR is a high-performance optical character recognition system for enterprises that automates data extraction from paper, images or PDF files. After extraction, it enables the user to validate the content and save it to the database or download the content. NuOCR is an intelligent document processing software that converts unstructured information to structured digital data allowing enterprises to power up their CRM capabilities for enhanced customer experience. Manual data collation is a tedious task, in which one minor error can result in mismatching outputs affecting the quality of the data. The solution to this problem lies in an automated data capture system that collects information from any document and gets it right, every time. As an intelligent document processing software, NuOCR converts information on any document, an image file, a paper document, or a pdf document, into quickly accessible, searchable, and error-free digital data.
Learn more
Patrivox
Patrivox is a European cloud platform that transforms collections of PDF documents and scanned archives into a fully searchable, AI-powered knowledge base. It allows organizations to upload large numbers of documents, individually or in bulk, and automatically processes them using advanced optical character recognition and artificial intelligence to extract text and identify important entities such as people, places, and organizations mentioned in the documents. Once processed, the platform enriches documents with metadata and links them together in an interactive knowledge graph, revealing relationships between historical records that would otherwise remain hidden. Users can explore their archives through instant full-text search with typo tolerance, advanced filters such as date or document type, or by asking natural-language questions through an AI chat interface that returns answers with exact source citations.
Learn more