first off, I want to thanks the author of pdftohtml because I'm using it since months right now, and it's a very very useful tool which really helps me.
And it's a pity it's not updated anymore from no one.
It's a great tool, but I'm still having some problems with it.
Here's my command line (I'm using the v0.39) :
"pdftohtml -c -i -noframes -xml -enc UTF-8 filename.pdf"
The problem is I have lots of € (euro) signs in my pdf, and those characters seem to not be converted and don't appear in my resulting xml file (for other pdfs, they're sometimes replaced by a square in my file).
Can someone help me with that annoying problem ?
Log in to post a comment.