PDFspy Reviews in 2026

Audience

Any user searching for a solution to extract a comprehensive list of attributes from a PDF file

About PDFspy

PDFspy is the ultimate “get info” utility for your PDF documents. It can extract a comprehensive list of attributes from a PDF file into an XML-based format. Support for PDF 1.7/ISO 32000 (Acrobat 9, X, DC). Element now shows CMYK separations that are actually used by text and vector elements. The new element that shows the number of shading objects in a PDF file. A restored output being written to stdout if -o option not used, recommend using -quiet option when writing to stdout. Fixed calculation of page labels. An improved text extraction algorithm. Calculates color simulation values for ICCBased, separation and DeviceN colorspaces. Improved Unicode, ISO Latin, and AdobePDF character set support. Fonts usage (name, type, embedding & subset status, use of Unicode). Asset management system, extract page count, metadata, font & image information. Document management, determine text or image-only documents, and extract comments.

Other Popular Alternatives & Related Software

PDFBox

The Apache PDFBox® library is an open-source Java tool for working with PDF documents. This project allows the creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command-line utilities. Apache PDFBox is published under the Apache License v2.0. Extract Unicode text from PDF files. Split a single PDF into many files or merge multiple PDF files. Extract data from PDF forms or fill a PDF form. Validate PDF files against the PDF/A-1b standard. Print a PDF file using the standard Java printing API. Create a PDF from scratch, with embedded fonts and images. Save PDFs as image files, such as PNG or JPEG and digitally sign PDF files. See also the export control information related to the encryption features included in Apache PDFBox.

Learn more

PDF Constructor

Using an XML grammar incorporating features of XHTML, CSS, and SVG, PDF Constructor creates single or multiple-page PDF documents using existing or dynamically-created raster, vector, and text content. Build PDFs with content that is ready to go to print. Use CMYK and spot colors. Specify the bleed and trim. Use Type 1, TrueType, or OpenType fonts, always embedded and optionally subset. Produce web or screen-ready documents with bookmarks, hyperlinks, actions, and JavaScript. You can even build complete Acrobat Forms dynamically. Include JPEG and TIFF images in any colorspace and resolution. Apply your choice of transformations to ensure the image fits correctly into your layout. Include SVG drawings directly or by reference. Specify individual pages or entire PDF documents as new content or as a template on which to add new elements. Paragraph and character styles based on CSS2 can be specified for flowable content.

Learn more

Filestar

Do anything to any file. Tens of thousands of skills at your fingertips. Quickly convert files in a few clicks. Choose from over 30 000 file conversions. Both common and unusual file formats. Single files or in bulk. Easily merge one or many files at once. Combine files for many different file types. Merge documents, video, audio, Visio or other file formats. Split large files with many pages into several separate ones. For text file formats like .pdf, .doc and .txt. Divide files and documents into parts. Change or alter files. Rotate, add filters, replace file names, add watermarks, add text to images, and much more. One at a time or many at once. Simply compress or reduce the file size of your files. Wide selection of file compression formats and zip options to choose from. Smoothly extract selected pages or elements from a document. Collect images out of a file, or get all images or text from a document.

Learn more

Adobe PDF Services API

Create a PDF from Microsoft Office documents, protect the content, and convert to other formats. Programmatically alter a document, such as reordering, inserting, and rotating pages, as well as compressing the file. Access the same cloud-based APIs that power Adobe's end-user applications to quickly deliver scalable, secure solutions. Extract text, images, tables, and more from native and scanned PDFs into a structured JSON file. PDF Extract API leverages AI technology to accurately identify text objects and understand the natural reading order of different elements such as headings, lists, and paragraphs spanning multiple columns or pages. Extract font styles with identification of metadata such as bold and italic text and their position within your PDF. The extracted content is output in a structured JSON file format with tables in CSV or XLSX and images saved as PNG.

Learn more

Pricing

Starting Price:

$600 one-time payment

Integrations

See Integrations

Ratings/Reviews

Overall 0.0 / 5

ease 0.0 / 5

features 0.0 / 5

design 0.0 / 5

support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Videos and Screen Captures

Other Useful Business Software

Find Hidden Risks in Windows Task Scheduler

Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.

Download Free Tool

Product Details

Platforms Supported

Windows

Mac

Linux

Training

Documentation

In Person

Support

Phone Support

Online

Compare This Software

PDF Constructor

Using an XML grammar incorporating features of XHTML, CSS, and SVG, PDF Constructor creates single or multiple-page PDF documents using existing or dynamically-created raster, vector, and text content. Build PDFs with content that is ready to go to print. Use CMYK and spot colors. Specify the...

Compare
PDFBox

The Apache PDFBox® library is an open-source Java tool for working with PDF documents. This project allows the creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command-line utilities. Apache...

Compare
Filestar

Do anything to any file. Tens of thousands of skills at your fingertips. Quickly convert files in a few clicks. Choose from over 30 000 file conversions. Both common and unusual file formats. Single files or in bulk. Easily merge one or many files at once. Combine files for many different file...

Compare
pdf2docx

pdf2docx is a Python library that uses PyMuPDF to extract data from PDF files, parse their layouts according to rules, and generate corresponding .docx files via python-docx. It supports conversion of text, images, tables, and other structural elements; it includes tools to extract tables,...

Compare
Adobe PDF Services API

Create a PDF from Microsoft Office documents, protect the content, and convert to other formats. Programmatically alter a document, such as reordering, inserting, and rotating pages, as well as compressing the file. Access the same cloud-based APIs that power Adobe's end-user applications to...

Compare

Recommended Software

PDF Constructor

Using an XML grammar incorporating features of XHTML, CSS, and SVG, PDF Constructor creates single or multiple-page PDF documents using existing or dynamically-created raster, vector, and text content. Build PDFs with content that is ready to go to print. Use CMYK and spot colors. Specify the...

See Software
PDFBox

The Apache PDFBox® library is an open-source Java tool for working with PDF documents. This project allows the creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command-line utilities. Apache...

See Software
Filestar

Do anything to any file. Tens of thousands of skills at your fingertips. Quickly convert files in a few clicks. Choose from over 30 000 file conversions. Both common and unusual file formats. Single files or in bulk. Easily merge one or many files at once. Combine files for many different file...

See Software