[pdftohtml] Bugfix: Remove spans in XML output
Status: Beta
Brought to you by:
meshko
From: Holger B. <hol...@bl...> - 2009-01-10 13:10:56
|
When in XML mode, do not output font spans. (Done in HtmlFonts.cc but apparently forgotten in HtmlOutputDev.cc.) Otherwise the XML is not valid when e.g. a table is encountered that has different fonts in its cells, example: page Table E-29, "Simplified Mnemonics", of Freescale Semiconductor, "Programming Environments Manual for 32-Bit Implementations of the PowerPC Architecture, Rev. 3" (I have seen this also in a completely unrelated document, so this is not rare Another report of this bug (by sb else) is at: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=415764 ). Patch see attached (against version 0.40). -- Holger Blasum GnuPG 1024D/ACDFC3B769DC1ED66B47 Phone (cell) +49-174-7313590 |