pdf2docx

pdf2docx

Artifex
+
+

Related Products

  • PDFCreator
    535 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    6,998 Ratings
    Visit Website
  • Nutrient SDK
    108 Ratings
    Visit Website
  • RAD PDF
    3 Ratings
    Visit Website
  • Apryse PDF SDK
    154 Ratings
    Visit Website
  • ONLYOFFICE Docs
    715 Ratings
    Visit Website
  • Expedience Software
    33 Ratings
    Visit Website
  • Jotform
    8,206 Ratings
    Visit Website
  • Microsoft 365
    19,939 Ratings
    Visit Website
  • Secure Eraser
    14 Ratings
    Visit Website

About

Convert, create, edit, OCR, compare, and sign PDFs. Customize the interface language and its appearance from light to dark themes for working with PDFs comfortably. Tailor your conversions by selecting a page, a paragraph, or even a single line for conversion. Custom PDF to Excel conversion to convert complex PDF table data to Microsoft Excel with pinpoint precision and a Smart Layout Detector for keeping table styles intact. Edit PDF text and pages. Annotate and redact PDF content. Sign PDF documents. Fill, edit and create PDF forms. Split documents into even parts. Convert scanned PDFs in English, French, Spanish, and German. Automate the batch PDF conversion process by queuing up a large volume of PDF files and even whole directories. Batch create PDF from a wide range of formats and merge all PDFs into one file. Create secure PDFs from blank pages or existing documents by adding passwords and file permissions. Able2Extract Professional: Your Swiss Army Knife for PDF files.

About

pdf2docx is a Python library that uses PyMuPDF to extract data from PDF files, parse their layouts according to rules, and generate corresponding .docx files via python-docx. It supports conversion of text, images, tables, and other structural elements; it includes tools to extract tables, handle formatting, and preserve layout as much as possible. It offers both a command-line interface and a graphical user interface. The internal architecture is modular; it includes packages for handling pages, layout, tables, images, shape paths, text spans/blocks, and other elements, enabling fine control over how PDF content is mapped into Word documents. Developers can use the API for batch conversions or integrate it into workflows; there's documentation on installation (from PyPI or source), usage, and technical details of layout-parsing, table extraction, and internal modules. The project is open source, hosted on GitHub, and made available under its license with no warranty.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Companies and individuals who need an affordable yet powerful, all-in-one PDF solution with perpetual licensing

Audience

Technical users seeking a solution to convert PDF documents into Word format programmatically while preserving layout, tables, images, and text structure

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$149.95/one-time/user
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 4.0 / 5
ease 4.0 / 5
features 5.0 / 5
design 5.0 / 5
support 1.0 / 5

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Investintech.com
Founded: 2000
Canada
www.investintech.com

Company Information

Artifex
Founded: 1993
United States
pdf2docx.readthedocs.io/en/latest/

Alternatives

Alternatives

AnyParser

AnyParser

CambioML
Tungsten Power PDF

Tungsten Power PDF

Tungsten Automation
PDF.co

PDF.co

ByteScout
CoolNew PDF

CoolNew PDF

CoolNew Software
PDF Conversa

PDF Conversa

ASCOMP Software
Systweak PDF Editor

Systweak PDF Editor

Systweak Software

Categories

Categories

PDF

PDF Features

Annotations
Convert to PDF
Digital Signature
Encryption
Merge / Append
PDF Reader
Watermarking

PDF Editors Features

Access Controls / Permissions
Annotations
Commenting / Notes
Compare Side-by-Side
Customizable Branding
Delete Pages
Electronic Signature
Forms Management
Full Text Search
Merge / Append
Offline Access
Optical Character Recognition (OCR)
Rearrange Pages
Rotate Pages
Watermarking

Integrations

GitHub
Microsoft Word
PyMuPDF
PyPI
Python

Integrations

GitHub
Microsoft Word
PyMuPDF
PyPI
Python
Claim Able2Extract Professional and update features and information
Claim Able2Extract Professional and update features and information
Claim pdf2docx and update features and information
Claim pdf2docx and update features and information