iText

iText

Apryse
pdf2docx

pdf2docx

Artifex
+
+

Related Products

  • Nutrient SDK
    110 Ratings
    Visit Website
  • Apryse PDF SDK
    152 Ratings
    Visit Website
  • Foxit Document Workflow APIs
    6 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    6,998 Ratings
    Visit Website
  • MobiOffice
    14,758 Ratings
    Visit Website
  • PackageX OCR Scanning
    48 Ratings
    Visit Website
  • MyQ
    197 Ratings
    Visit Website
  • RAD PDF
    3 Ratings
    Visit Website
  • Square 9
    411 Ratings
    Visit Website
  • PDFCreator
    539 Ratings
    Visit Website

About

Now part of the Apryse family, iText is one of the best-documented and most versatile PDF SDKs in the world. The open-source iText Core library features a powerful layout engine and intuitive high-level APIs for document creation and manipulation, digital signing and validation, and much more. It has built-in support for PDF 2.0, all variants of PDF/A and PDF/UA, FIPS-140-2 and the very latest ISO standards for digital signatures and encryption. You can extend iText's capabilities even further, with add-ons for comprehensive HTML/XML and CSS templating, global language and writing systems, secure document redaction, OCR, document optimization, and working with dynamic XFA. iText Core is free to use under the AGPLv3 license, while a commercial license releases you from the AGPL terms and gives you professional support and maintenance. Visit the iText website to try the entire iText Suite free for 30 days, while keeping your IP safe under iText's commercial license terms.

About

pdf2docx is a Python library that uses PyMuPDF to extract data from PDF files, parse their layouts according to rules, and generate corresponding .docx files via python-docx. It supports conversion of text, images, tables, and other structural elements; it includes tools to extract tables, handle formatting, and preserve layout as much as possible. It offers both a command-line interface and a graphical user interface. The internal architecture is modular; it includes packages for handling pages, layout, tables, images, shape paths, text spans/blocks, and other elements, enabling fine control over how PDF content is mapped into Word documents. Developers can use the API for batch conversions or integrate it into workflows; there's documentation on installation (from PyPI or source), usage, and technical details of layout-parsing, table extraction, and internal modules. The project is open source, hosted on GitHub, and made available under its license with no warranty.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Anyone wanting to integrate PDF functionalities into applications and workflows in Java & .NET (C#)

Audience

Technical users seeking a solution to convert PDF documents into Word format programmatically while preserving layout, tables, images, and text structure

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apryse
US
itextpdf.com

Company Information

Artifex
Founded: 1993
United States
pdf2docx.readthedocs.io/en/latest/

Alternatives

Alternatives

AnyParser

AnyParser

CambioML
Nutrient SDK

Nutrient SDK

Nutrient
Docmosis

Docmosis

Docmosis Pty Ltd
PDF.co

PDF.co

ByteScout
PDFBox

PDFBox

Apache Software Foundation
PDF Conversa

PDF Conversa

ASCOMP Software
jPDFPreflight

jPDFPreflight

Qoppa Software

Categories

Categories

PDF

PDF Features

Annotations
Convert to PDF
Digital Signature
Encryption
Merge / Append
PDF Reader
Watermarking

OCR Features

Batch Processing
Convert to PDF
ID Scanning
Image Pre-processing
Indexing
Metadata Extraction
Multi-Language
Multiple Output Formats
Text Editor
Zone Selection Tool

Integrations

GitHub
.NET MAUI
AWS CodeBuild
AWS Key Management Service
Amazon API Gateway
Amazon EC2
Apache Maven
Appian
Azure Blob Storage
Google Cloud Storage
GraalVM
Jaspersoft
Jenkins
Microsoft Word
NuGet
PyMuPDF
Python
SAP NetWeaver
Spring Boot
Xodo Sign

Integrations

GitHub
.NET MAUI
AWS CodeBuild
AWS Key Management Service
Amazon API Gateway
Amazon EC2
Apache Maven
Appian
Azure Blob Storage
Google Cloud Storage
GraalVM
Jaspersoft
Jenkins
Microsoft Word
NuGet
PyMuPDF
Python
SAP NetWeaver
Spring Boot
Xodo Sign
Claim iText and update features and information
Claim iText and update features and information
Claim pdf2docx and update features and information
Claim pdf2docx and update features and information