XML was designed from the beginning to be intolerant of errors. This decision adversely affects MS Word's corruption recovery. With one error in the document.xml subfile where all the DOCX file's text is stored, instead of a partial recovery, Word will stop and throw an error.
Savvy DOCX Recovery attempts to do precise surgery on corrupt Word documents to reorder or excise bad XML tags.
If this doesn't work, it uses the command line app xmllint first to attempt to repair corrupt XML subfiles of the DOCX. If this doesn't work a second attempt is made where the corrupt XML subfiles are truncated at the first error, and the correct ending tags are again added with xmllint.
If all else fails, SilverCoder's DocToText is used to extract text.
Try also http://wordcorruptdocchecker.codeplex.com/ and https://support.microsoft.com/en-us/kb/2528942 and my other SF projects: Corrupt Extractor for Microsoft Office, Corrupt DOCX Salvager, S2 Recovery Tools for Microsoft Word.
- Recovers some DOCX files that Word will not open.
- Automatically iterates through four methods for recovering the document.
- Simple GUI interface.