Docx2txt is a Perl based command-line tool to convert Microsoft docx documents to (ASCII) text files, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions.
This is an excellent extractor of text from docx files. If you use CakeCMD or No-Frills Command Unzipper to unzip the docx files, it will even extract text from corrupt docx files. This works well in a CGI script providing a text extraction web service of even corrupt docx files. See my instance at saveofficedata.com.
Quite handy tool for viewing docx document's content.
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?
Thanks for your review!
Get credit for your review by logging in via OpenID. Click your account provider: