data integration free download

PDFCraft

PDFCraft is a free, privacy-focused PDF toolkit

...But beyond manual editing, it also offers a programmable layer so developers can write scripts to batch process documents, generate templated reports, or extract structured data from PDFs for integration in workflows. The design emphasizes quality and compatibility: output PDFs render accurately across readers, preserve metadata, and support interactive elements like hyperlinks and form fields.

Downloads: 40 This Week

Last Update: 21 hours ago

See Project

py-pdf-parser

A Python tool to help extracting information from structured PDFs

py-pdf-parser is a Python tool designed to help extract information from structured PDFs. It provides a simple interface to define parsing rules and extract data from PDF documents.

Downloads: 1 This Week

Last Update: 2025-04-28

See Project

Nano PDF Editor

Edit PDF files with Nano Banana

Nano PDF Editor is a minimalist, portable PDF viewer and toolkit that focuses on simplicity, speed, and ease of integration for applications that need basic PDF rendering without heavy dependencies. It provides core functionality such as page navigation, zooming, text selection, and rendering directly to native graphics surfaces, making it suitable for lightweight PDF viewing scenarios on desktop or embedded platforms. Designed to be easily embedded into larger software projects, Nano-PDF...

Downloads: 25 This Week

Last Update: 2026-02-05

See Project

Unredact

A simple tool for reading in poorly redacted documents

Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and...

Downloads: 23 This Week

Last Update: 2026-02-03

See Project

OCRBase

MD/.JSON Document OCR and structured data extraction API

OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured...

Downloads: 0 This Week

Last Update: 2026-04-16

See Project

Vanilla.PDF

Cross-platform SDK for creating and modifying PDF documents

Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures. Vanilla.PDF supports advanced...

Downloads: 2 This Week

Last Update: 2026-03-17

See Project

Snappy PDF

A ServiceProvider for Snappy

Laravel Snappy is a Laravel wrapper around the Snappy PDF/Image library, which itself is powered by wkhtmltopdf and wkhtmltoimage, allowing you to generate PDFs and images directly from HTML. It lets you take a Blade view, raw HTML string, or file and turn it into a downloadable, savable, or in-browser PDF/image response with just a few lines of code. The package integrates cleanly with the Laravel service container and offers a simple facade/API so you can quickly configure page size,...

Downloads: 0 This Week

Last Update: 2026-02-21

See Project

pdf-bot

A Node queue API for generating PDFs using headless Chrome

pdf-bot is a Node.js microservice designed to automate the generation of PDF documents from web pages using headless Chrome. The project provides a queue-based API that allows developers to submit URLs for PDF generation, which are then processed asynchronously by the service. Once a document is generated, the system can notify external applications through webhooks, enabling integration with other backend systems or automation pipelines. The service is particularly useful for generating...

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

Search Results for "data integration"

Showing 8 open source projects for "data integration"

PDFCraft

py-pdf-parser

Nano PDF Editor

Unredact

OCRBase

Vanilla.PDF

Snappy PDF

pdf-bot

Search Results for "data integration"

Showing 8 open source projects for "data integration"

PDFCraft

py-pdf-parser

Nano PDF Editor

Unredact

OCRBase

Vanilla.PDF

Snappy PDF

pdf-bot

Related Searches

Related Categories