v0.39 win: pdftohtml -xml VS pdftohtml -xml -enc UTF-8
Status: Beta
Brought to you by:
meshko
Dear Developers,
I have discovered a minor issue with pdftohtml v0.39. When I run:
INCORRECT
>pdftohtml -xml sample.pdf sample
the resulting sample.xml is not well-formed.
If I specify the output text encoding as in:
CORRECT
>pdftohtml -xml -enc UTF-8 sample.pdf sample
the resulting sample.xml is well-formed!
Best Regards,
Cetin Sert
sample pdf