Screenshots
Description
This GUI program will extract text from damaged/corrupted Word 2007 - 2013 DOCX format documents.
DOCX files are actually zipped collections of mostly XML files. XML as a format is unforgiving of data corruption. The main text in docx files is found in document.xml file in the collection. The program uses 7Zip, an unzipper that will sometimes unzip partially corrupt document.xml files even though reporting an error which this program ignores.
Additionally the Perl routine used to extract the text from the document.xml file doesn't care about well-formed XML, a stumbling block of Word 2007 - 2013.
Recent changes include the pretreatment of docx files with InfoZip's zip.exe -FF repair command, improving success rates. Also added are links to commercial solutions.
I also recommend trying my other corrupt Word recovery capable programs here on Sourceforge, S2 Recovery Tools for Word and Savvy Office Recovery.
Previously known as Damaged DOCX2TXT.
Categories
License
Update Notifications
User Ratings
User Reviews
-
Very useful. Thanks.
-
It couldn't fix ALL my files, but it certainly went a long way towards recovering everything it possibly could. Very happy with it.