Docling

Docling is an open-source document processing toolkit built to prepare diverse content types for modern generative AI and data workflows. The project focuses on converting and parsing many document formats into a unified structured representation that downstream systems can easily consume. It supports advanced PDF understanding, including layout detection, table extraction, and reading order analysis, enabling high-fidelity document intelligence pipelines. Docling is designed to run efficiently on commodity hardware and can be used both as a Python API and a command-line tool. Its modular architecture allows developers to extend functionality and integrate specialized models for tasks such as OCR and audio transcription. Overall, Docling serves as a comprehensive preprocessing layer for AI applications that require reliable, structured access to complex document data.

Features

Multi-format document parsing
Advanced PDF layout understanding
Unified structured document representation
CLI and Python API interfaces
OCR and speech recognition support
Integrations with AI frameworks

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Docling

Docling Web Site

Other Useful Business Software

Gemini 3 and 200+ AI Models on One Platform

Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free

Rate This Project

User Reviews

Be the first to post a review of Docling!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

2026-03-02

Similar Business Software

Picsart Enterprise

AI-Powered Image & Video Editing for Seamless Integration. Enhance your visual content workflows with Picsart Creative APIs, a robust suite of AI-driven tools for developers, product owners, and entrepreneurs. Easily integrate advanced image and video processing capabilities into your...

See Software
Process Street

Process Street is an AI-powered compliance operations platform that automates complex workflows, enforces standards, and tracks audit data in real time. Teams use it to create structured SOPs, assign tasks, collect data, and monitor execution with intelligent oversight. From onboarding and...

See Software
Pipefy

Pipefy is the AI-driven Business Orchestration and Automation Technologies (BOAT) platform that delivers enterprise results in days, not months. Designed as a secure orchestration layer, Pipefy bridges the gap between rigid legacy systems (ERPs/CRMs) and agile business needs. It allows IT...

See Software
Crowdin

Crowdin, a localization management software powered by AI, facilitates the localization of diverse content such as websites, mobile apps, games, desktop and web applications, help centers, blogs, and email campaigns. With a repertoire of over 600 add-ons and integrations, the platform...

See Software
Windsurf Editor

The Windsurf Editor is a free AI-powered IDE and AI coding assistant that accelerates development by providing intelligent code generation and agents in over 70 programming languages and more than 40 IDEs, including VSCode, JetBrains, and Jupyter Notebooks. With Windsurf, developers can write...

See Software
Square 9

Square 9 removes the frustration of extracting data from documents, forms, and all external sources, so you can harness the full power of your information. Release your team from repetitive tasks while your work flows freely in areas like Accounts Payable, Order Processing, Customer and...

See Software