[X] The "UltraVnc-101-Setup.zip " file could not be found or is not available. Please select another file.

Docx2txt is a Perl based command-line tool to convert Microsoft docx documents to (ASCII) text files, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions.


http://docx2txt.sourceforge.net





Separate each tag with a space.

Features:

  • Consists of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file
  • Can recover text from damaged docx documents in many cases
  • Short line justifications, showing hyperlink (features missing in MS text conversion)
  • Focus is on a good (ASCII) text experience
  • Installation via Makefiles and Windows batch file
  • Can conveniently be used to build a web based docx document conversion service

Release Date:

2009-10-04

Topic:

Operating System:

License:

Intended Audience:

User Interface:

Programming Language:

Registered:

2008-07-30

Ratings and Reviews

  • Thumbs up:

    4
  • Thumbs down:

    1
80% of 5 users recommend this project
  • Thumbs up

    This is an excellent extractor of text from docx files. If you use CakeCMD or No-Frills Command Unzipper to unzip the docx files, it will even extract text from corrupt docx files. This works well in a CGI script providing a text extraction web service of even corrupt docx files. See my instance at saveofficedata.com.

    posted by socrtwo22 104 days ago
    If you'd like to rate this review, please log in.
  • Thumbs up

    Quite handy tool for viewing docx document's content.

    posted by anonymous 125 days ago
    If you'd like to rate this review, please log in.

View all reviews

Project Feed

Rate and Review

Would you recommend this project?






<

Related Projects

docx2txt Actions

Thanks for your rating!

Would you also like to write a review?





Skip Review

Thanks for your review!

Get credit for your review by logging in via OpenID. Click your account provider:

No Thanks