The biggest cause of corruption of PPTX corruption appears to be zip problems. This GUI uses a somewhat corruption immune unzipper, 7zip. 7zip sometimes succeeds in extracting the slide xml files that contain the text from corrupt pptx files where PowerPoint 2007 - 2013 fail with their built in unzipper.
Furthermore Corrupt PPTX Salvager uses regular expressions to extract the text from these slide XML files rather than getting hung up on correct XML structure as PowerPoint seems to do during recovery attempts.
A recent improvement is adding a zip repair pretreatment using InfoZip's zip.exe -FF command and an alternatives menu with additional ppt and pptx resources.
Corrupt PPTX Salvager is based on PPTX to Text converter by Sopan Shewale. His project is hosted on Sourceforge. Sopan's project is further based on Sandeep Kumar's docx2txt which is also found here.
This program was formerly known as PPTX Recovery and Corrupt PPTX2TXT.
- Recovers text from pptx zip corruption.
- Recovers text from pptx xml corruption.
- Command line, usable in a web service.
- Recovers notes text.
- Uses either CakeCmd or No-Frills unzippers which are zip corruption tolerant.
- Perl source is included.
Includes spam and malware. DO NOT DOWNLOAD!!!