Learn how easy it is to sync an existing GitHub or Google Code repo to a SourceForge project! See Demo

Close

#8 Jhove takes over 14.5 hours to process a tagged pdf file.

open
Gary McGath
None
2
2013-04-17
2013-04-17
Ira Terman
No

The file that can be downloaded from http://www.fcla.edu/daitss-test/files/01471-213X-12-33-S2.pdf
is a tagged pdf file. It takes JHove over 14.5 hours to process it and it requires Java heap space set to 4G.

The metadata seen in the file shows:
<size>92421469</size>
<format>PDF</format>
<version>1.4</version>
<status>Well-Formed and valid</status>
<properties>
<property>
<name>PDFMetadata</name>
<values arity="List" type="Property">
<property>
<name>Objects</name>
<values arity="Scalar" type="Integer">
<value>741988</value>
</values>
</property>

Can some mechanism be developed to detect and prevent such long processing times.

Discussion