PDFspy

PDFspy

Apago
PyMuPDF

PyMuPDF

Artifex
+
+

Related Products

  • MobiPDF (formerly PDF Extra)
    5,539 Ratings
    Visit Website
  • Adobe PDF Library SDK
    35 Ratings
    Visit Website
  • PDFCreator
    494 Ratings
    Visit Website
  • MobiOffice (formerly OfficeSuite)
    12,109 Ratings
    Visit Website
  • RAD PDF
    3 Ratings
    Visit Website
  • Nutrient SDK
    94 Ratings
    Visit Website
  • Apryse PDF SDK
    133 Ratings
    Visit Website
  • Titan
    362 Ratings
    Visit Website
  • Jotform
    6,933 Ratings
    Visit Website
  • Highcharts
    119 Ratings
    Visit Website

About

PDFspy is the ultimate “get info” utility for your PDF documents. It can extract a comprehensive list of attributes from a PDF file into an XML-based format. Support for PDF 1.7/ISO 32000 (Acrobat 9, X, DC). Element now shows CMYK separations that are actually used by text and vector elements. The new element that shows the number of shading objects in a PDF file. A restored output being written to stdout if -o option not used, recommend using -quiet option when writing to stdout. Fixed calculation of page labels. An improved text extraction algorithm. Calculates color simulation values for ICCBased, separation and DeviceN colorspaces. Improved Unicode, ISO Latin, and AdobePDF character set support. Fonts usage (name, type, embedding & subset status, use of Unicode). Asset management system, extract page count, metadata, font & image information. Document management, determine text or image-only documents, and extract comments.

About

PyMuPDF is a high-performance, Python-centric library for reading, extracting, and manipulating PDFs with ease and precision. It enables developers to access text, images, fonts, annotations, metadata, and structural layout of PDF documents, and to perform tasks such as extracting content, editing objects, rendering pages, searching text, modifying page content, and manipulating PDF components like links and annotations. PyMuPDF also supports advanced operations like splitting, merging, inserting, or deleting pages; drawing and filling shapes; handling color spaces; and converting between formats. The library is lightweight but robust, optimized for speed and low memory overhead. On top of the base PyMuPDF, PyMuPDF Pro adds support for reading and writing Microsoft Office-format documents and enhanced functionality for integrating Large Language Model (LLM) pipelines and Retrieval Augmented Generation (RAG).

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Any user searching for a solution to extract a comprehensive list of attributes from a PDF file

Audience

Developers, engineers, or automation teams interested in a solution to extract, render, edit, or convert PDFs in Python-based or cross-platform workflows

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$600 one-time payment
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apago
Founded: 1991
United States
www.apagoinc.com/product/pdfspy/

Company Information

Artifex
Founded: 1993
United States
artifex.com/products#pymupdf

Alternatives

Alternatives

JPedal

JPedal

IDR Solutions
LightPDF

LightPDF

Wangxu Technology Co.,Ltd.
PDFBox

PDFBox

Apache Software Foundation
PDF Agile

PDF Agile

DocuAgile
PDF Agile

PDF Agile

DocuAgile
BuildVu

BuildVu

IDR Solutions
UPDF

UPDF

Superace Software Technology Co., Ltd.

Categories

PDF

Categories

PDF

Integrations

.NET
Adobe Acrobat
Amazon Web Services (AWS)
Hugging Face
JavaScript
LangChain
Llama
Make
Microsoft Excel
Microsoft Office 2024
Microsoft Word
Node.js
NuGet
Postscript
PowerPoint
Python
Zapier
pdf2docx

Integrations

.NET
Adobe Acrobat
Amazon Web Services (AWS)
Hugging Face
JavaScript
LangChain
Llama
Make
Microsoft Excel
Microsoft Office 2024
Microsoft Word
Node.js
NuGet
Postscript
PowerPoint
Python
Zapier
pdf2docx
Claim PDFspy and update features and information
Claim PDFspy and update features and information
Claim PyMuPDF and update features and information
Claim PyMuPDF and update features and information