Parsr is an open-source document parsing tool that converts PDFs, scanned images, and other structured documents into structured, machine-readable data formats.

Features

  • Extracts text, tables, and metadata from PDFs
  • Converts documents into structured JSON output
  • Supports OCR for scanned documents
  • Configurable pipeline for text cleaning and processing
  • Handles multi-column layouts and complex structures
  • Open-source with REST API support

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Parsr

Parsr Web Site

Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Parsr!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

JavaScript

Related Categories

JavaScript Natural Language Processing (NLP) Tool

Registered

2025-01-21