+
+

Related Products

  • Foxit Document Workflow APIs
    6 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    6,998 Ratings
    Visit Website
  • Nutrient SDK
    110 Ratings
    Visit Website
  • RAD PDF
    3 Ratings
    Visit Website
  • Docmosis
    51 Ratings
    Visit Website
  • LM-Kit.NET
    29 Ratings
    Visit Website
  • MindCloud
    71 Ratings
    Visit Website
  • Crowdin
    907 Ratings
    Visit Website
  • PackageX OCR Scanning
    48 Ratings
    Visit Website
  • Okyline
    2 Ratings
    Visit Website

About

Create a PDF from Microsoft Office documents, protect the content, and convert to other formats. Programmatically alter a document, such as reordering, inserting, and rotating pages, as well as compressing the file. Access the same cloud-based APIs that power Adobe's end-user applications to quickly deliver scalable, secure solutions. Extract text, images, tables, and more from native and scanned PDFs into a structured JSON file. PDF Extract API leverages AI technology to accurately identify text objects and understand the natural reading order of different elements such as headings, lists, and paragraphs spanning multiple columns or pages. Extract font styles with identification of metadata such as bold and italic text and their position within your PDF. The extracted content is output in a structured JSON file format with tables in CSV or XLSX and images saved as PNG.

About

Docling is an easy-to-use, self-contained, MIT-licensed open source toolkit for converting messy documents into structured data and simplifying downstream document and AI processing. It can parse many popular document formats into a unified and richly structured Docling Document, including PDF, DOCX, PPTX, XLSX, HTML, Markdown, AsciiDoc, CSV, images, audio, and scanned pages through an OCR engine of the user’s choice. Docling detects tables, formulas, reading order, chunks, bounding boxes, page headers and footers, pictures, captions, code, list items, paragraphs, cells, and document structure, making extracted content easier to process, search, and ingest into AI, RAG, and agentic systems. It can export parsed documents to JSON, text, Markdown, HTML, and Doctags, giving developers flexible outputs for pipelines and applications. Docling stores and traverses components according to reading order, partitions documents into bite-sized contiguous text chunks.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Businesses searching for a PDF solution that helps create, convert, transform, OCR PDFs and more

Audience

AI engineers, data teams, and developers building RAG or document-intelligence systems who need an open-source toolkit to convert complex documents into structured, searchable, AI-ready data

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Adobe
Founded: 1982
United States
developer.adobe.com/document-services/apis/pdf-services/

Company Information

Docling
United States
www.docling.ai/

Alternatives

Alternatives

PaddleOCR

PaddleOCR

PaddlePaddle
Pdftools

Pdftools

PDF Tools
LlamaParse

LlamaParse

LlamaIndex
PDF.co

PDF.co

ByteScout
Mistral OCR 3

Mistral OCR 3

Mistral AI

Categories

Categories

Integrations

HTML
JSON
Microsoft Excel
Python
.NET
Amazon
EximiousSoft ePage Creator
Google Sheets
Ivo
Microsoft 365
Microsoft Power Automate
Microsoft PowerPoint
Microsoft Word
Model Context Protocol (MCP)
Node.js
Postman
QCommission
Torvalds
UiPath

Integrations

HTML
JSON
Microsoft Excel
Python
.NET
Amazon
EximiousSoft ePage Creator
Google Sheets
Ivo
Microsoft 365
Microsoft Power Automate
Microsoft PowerPoint
Microsoft Word
Model Context Protocol (MCP)
Node.js
Postman
QCommission
Torvalds
UiPath
Claim Adobe PDF Services API and update features and information
Claim Adobe PDF Services API and update features and information
Claim Docling and update features and information
Claim Docling and update features and information