Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.

Features

  • Consists of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file, with provision for maintaining separate system-wide configuration file and individual user-level configuration files.
  • Perl script also works with input/output redirection, and is useful in viewing docx file content directly with editors like vim, emacs, and file browsers like mc (midnight commander).
  • Can recover text from damaged docx documents in many cases.
  • Short line justifications, showing hyperlink and many character conversions (missing in MS text conversion).
  • Handles (bullet, decimal, letter, roman) lists along with indentation.
  • Installation via Makefiles and Windows batch file. On non-Windows systems scripts and configuration file can be installed in separate directories.
  • Can conveniently be used to build a web based docx document conversion service.

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow docx2txt

docx2txt Web Site

Other Useful Business Software
APIs for the next generation of business text messaging Icon
APIs for the next generation of business text messaging

For companies that need a reliable messaging API provider

Get your customers’ messages where they need to go with 99%+ deliverability. Telgorithm’s API automates A2P compliance & message management for faster, easier, & more reliable messaging, enabling you to offer the best service to your customers.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
3
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

Be the first to post a review of docx2txt!

Additional Project Details

Intended Audience

End Users/Desktop

User Interface

Command-line

Programming Language

Unix Shell, Perl

Related Categories

Unix Shell Word Processors, Unix Shell Office Suites, Unix Shell Data Recovery Software, Perl Word Processors, Perl Office Suites, Perl Data Recovery Software

Registered

2008-07-30