Docx2txt is a Perl based command-line tool to convert Microsoft docx documents to (ASCII) text files, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions.


http://docx2txt.sourceforge.net





Separate each tag with a space.

Release Date:

2009-10-04

Topics:

License:

Features:

  • Consists of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file
  • Can recover text from damaged docx documents in many cases
  • Short line justifications, showing hyperlink (features missing in MS text conversion)
  • Focus is on a good (ASCII) text experience
  • Installation via Makefiles and Windows batch file
  • Can conveniently be used to build a web based docx document conversion service

Ratings and Reviews

  • Thumbs up:

    4
  • Thumbs down:

    1
80% of 5 users recommend this project
  • Thumbs up

    This is an excellent extractor of text from docx files. If you use CakeCMD or No-Frills Command Unzipper to unzip the docx files, it will even extract text from corrupt docx files. This works well in a CGI script providing a text extraction web service of even corrupt docx files. See my instance at saveofficedata.com.

    posted by socrtwo22 46 days ago
    If you'd like to rate this review, please log in.
  • Thumbs up

    Quite handy tool for viewing docx document's content.

    posted by anonymous 67 days ago
    If you'd like to rate this review, please log in.

View all reviews

Project Feed

Rate and Review

Would you recommend this project?






<

Related Projects

docx2txt Actions

Thanks for your rating!

Would you also like to write a review?





Skip Review