Perl based utility to extract formatted text content from MS Docx file
Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.
This is a mechanism for fast, incremental backup of web sites that allow CGI scripts and FTP access but no direct shell access, with better performance than solutions based solely on ftp.