Menu

#279 djvuxmlparser error: Failed to get specified page in XMLParser.cpp:581

djview
open
nobody
None
5
2017-04-26
2017-04-26
Sam Wilson
No

When trying to merge OCR text back into a concatenated DjVu file, I'm getting the following error:

$ djvuxmlparser web1990gard_djvu.xml_new.xml 
*** [1-16201] Failed to get specified page.
*** (XMLParser.cpp:581)
*** 'DJVU::GP<DJVU::DjVuFile> DJVU::lt_XMLParser::Impl::get_file(const DJVU::GURL&, DJVU::GUTF8String)'

The original XML is: https://archive.org/download/web1990gard/web1990gard_djvu.xml

And the DjVu file is being created by making single-page DjVus from Jpegs, and merging them with djvm -c. The XML is modified so that the <PARAM name="PAGE" value="web1990gard_p174.djvu" /> element point to the correct filename (the map name is not changed).

This process is working correctly for %95 of files that are being processed, but it's bombing out with the above error on a few.

Can anyone tell me how I might start to debug this error? (I'm sorry that this isn't a very good bug report!)

Are there restrictions on the value of the page name? Or of the map? My cursory reading of XMLparser.cpp suggests that different things are done with integer page IDs, but I'm not even sure which is the page ID, or if it's just the position of the page in the document (counted from zero?).

Thanks for any help!

Discussion

  • Leon Bottou

    Leon Bottou - 2017-05-08

    Could this come from the “_” character in the pageid ?

    Sorry for not being very useful.  I have little knowledge of all this XML code contributed by Lizardtech.

    • L.

    From: Sam Wilson samwilson@users.sf.net
    Reply-To: Ticket 279 279@bugs.djvu.p.re.sf.net
    Date: Wednesday, April 26, 2017 at 3:34 AM
    To: Ticket 279 279@bugs.djvu.p.re.sf.net
    Subject: [djvu:bugs] #279 djvuxmlparser error: Failed to get specified page in XMLParser.cpp:581

    [bugs:#279] djvuxmlparser error: Failed to get specified page in XMLParser.cpp:581

    Status: open
    Group: djview
    Created: Wed Apr 26, 2017 07:34 AM UTC by Sam Wilson
    Last Updated: Wed Apr 26, 2017 07:34 AM UTC
    Owner: nobody

    When trying to merge OCR text back into a concatenated DjVu file, I'm getting the following error:

    $ djvuxmlparser web1990gard_djvu.xml_new.xml
    [1-16201] Failed to get specified page.
    (XMLParser.cpp:581)
    *** 'DJVU::GP<DJVU::DjVuFile> DJVU::lt_XMLParser::Impl::get_file(const DJVU::GURL&, DJVU::GUTF8String)'
    The original XML is: https://archive.org/download/web1990gard/web1990gard_djvu.xml

    And the DjVu file is being created by making single-page DjVus from Jpegs, and merging them with djvm -c. The XML is modified so that the <PARAM name="PAGE" value="web1990gard_p174.djvu"/> element point to the correct filename (the map name is not changed).

    This process is working correctly for %95 of files that are being processed, but it's bombing out with the above error on a few.

    Can anyone tell me how I might start to debug this error? (I'm sorry that this isn't a very good bug report!)

    Are there restrictions on the value of the page name? Or of the map? My cursory reading of XMLparser.cpp suggests that different things are done with integer page IDs, but I'm not even sure which is the page ID, or if it's just the position of the page in the document (counted from zero?).

    Thanks for any help!

    Sent from sourceforge.net because you indicated interest in https://sourceforge.net/p/djvu/bugs/279/

    To unsubscribe from further messages, please visit https://sourceforge.net/auth/subscriptions/

     
    • Sam Wilson

      Sam Wilson - 2017-05-11

      Could this come from the “_” character in the pageid ?

      I'll give it a go without. But I'm not hopeful, because it seems to be working in most cases, all of which include the udnerscore. But perhaps there's some strange interaction with other things.

      Thanks for helping think about this. :-)

       

Log in to post a comment.