PDFspyApago
|
PyMuPDFArtifex
|
|||||
Related Products
|
||||||
About
PDFspy is the ultimate “get info” utility for your PDF documents. It can extract a comprehensive list of attributes from a PDF file into an XML-based format. Support for PDF 1.7/ISO 32000 (Acrobat 9, X, DC). Element now shows CMYK separations that are actually used by text and vector elements. The new element that shows the number of shading objects in a PDF file. A restored output being written to stdout if -o option not used, recommend using -quiet option when writing to stdout. Fixed calculation of page labels. An improved text extraction algorithm. Calculates color simulation values for ICCBased, separation and DeviceN colorspaces. Improved Unicode, ISO Latin, and AdobePDF character set support. Fonts usage (name, type, embedding & subset status, use of Unicode). Asset management system, extract page count, metadata, font & image information. Document management, determine text or image-only documents, and extract comments.
|
About
PyMuPDF is a high-performance, Python-centric library for reading, extracting, and manipulating PDFs with ease and precision. It enables developers to access text, images, fonts, annotations, metadata, and structural layout of PDF documents, and to perform tasks such as extracting content, editing objects, rendering pages, searching text, modifying page content, and manipulating PDF components like links and annotations. PyMuPDF also supports advanced operations like splitting, merging, inserting, or deleting pages; drawing and filling shapes; handling color spaces; and converting between formats. The library is lightweight but robust, optimized for speed and low memory overhead. On top of the base PyMuPDF, PyMuPDF Pro adds support for reading and writing Microsoft Office-format documents and enhanced functionality for integrating Large Language Model (LLM) pipelines and Retrieval Augmented Generation (RAG).
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Any user searching for a solution to extract a comprehensive list of attributes from a PDF file
|
Audience
Developers, engineers, or automation teams interested in a solution to extract, render, edit, or convert PDFs in Python-based or cross-platform workflows
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
$600 one-time payment
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationApago
Founded: 1991
United States
www.apagoinc.com/product/pdfspy/
|
Company InformationArtifex
Founded: 1993
United States
artifex.com/products#pymupdf
|
|||||
Alternatives |
Alternatives |
|||||
|
||||||
|
|
|||||
|
|
|||||
|
|
|||||
Categories |
Categories |
|||||
Integrations
.NET
Adobe Acrobat
Amazon Web Services (AWS)
Hugging Face
JavaScript
LangChain
Llama
Make
Microsoft Excel
Microsoft Office 2024
|
Integrations
.NET
Adobe Acrobat
Amazon Web Services (AWS)
Hugging Face
JavaScript
LangChain
Llama
Make
Microsoft Excel
Microsoft Office 2024
|
|||||
|
|