Pdftohtml is a tool based on the Xpdf package which translatespdf documents into html format.
Acrobat 6+ is not supported by windows version. :(
This utility solved a problem for me. Installation (on OS X and probably unix) requires some experience, but is really not that difficult. The output currently requires a pass through tidy to clean up some of the tags. That's not a real issue and could be cleaned up with some code work.
Excellent little utility - output generally needs some tweaking, but overall very easy to use and excellent output. It would perhaps be good if image files were placed in a separate folder but that's personal preference more than anything. Would be good if the readme file included a copy of the "pdftohtml -h" output just in case people don't find that switch :)
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?
Thanks for your review!
Get credit for your review by logging in via OpenID. Click your account provider: