User Activity

  • Posted a comment on discussion DjVuLibre Development on DjVuLibre

    I would be very happy to be able to extract easily graphic snippets (say in png format) identified by a possibly long list of URLs like this Hebrew(?) string in an old dictionary (for the purpose of OCR). The most elegant way would be to extend ddjvu to handle pagespec not only in the form of page numbers, but also in the form of full URLs (for simplification to local files only). I guess requesting such an extensions would be unrealistic, so I would appreciate any suggestions to solve the problem....

  • Posted a comment on discussion DjVuLibre Help on DjVuLibre

    I have described the problem in details at http://teksty.klf.uw.edu.pl/11/ and provided the problematic files. When executing a script djvused doesn't issue any error messages, but in the resulting document text layer hierarchy is corrupt. The script is generated by ocrodjvu, so perhaps it is a culprit: cf. https://bitbucket.org/jwilk/ocrodjvu/issue/12/ocrodjvu-creates-an-incorrect-djvused. However, if the script is incorrect, I would like to know what is exactly wrong. Perhaps it is just too big?...

  • Posted a comment on discussion DjVuLibre Help on DjVuLibre

    I've created rather sophisticated bookmarks for the dictionary at http://korpusy.klf.uw.edu.pl/djvus/linde-t/01/index.djvu. They work as intended on Linux with djview4 as the browser plugin, but on Windows the Caminova (now DjVu Universe?) plugin is unable to handle them correctly. Is this the Caminova plugin bug? Is there a better forum to ask about the Caminova plugin? Anyway, can the problem be somehow circumvented? Most of the reders of the dictionary use Windows and djview4 doesn't work as a...

  • Posted a comment on discussion DjVuLibre Help on DjVuLibre

    I come back to this old topic because I've just noticed that the Caminowa plugin cannot handle the outline at http://korpusy.klf.uw.edu.pl/djvus/linde-t/01/index.djvu? What is the reason the djview4 doesn't work as a browser plugin? Is coding it so difficult or time-consuming?

  • Posted a comment on ticket #357 on DjVuLibre

    The problem appeared not related to the file format, it is now fixed.

  • Posted a comment on ticket #357 on DjVuLibre

    I was too optimistic, the impression the program accepts ddjvu produced TIFF as binary graphics was an illusion due to some mistake of mine. I try to persuade ChatGPT to correct the script (https://github.com/jsbien/tmp), we'll see what happens.

  • Posted a comment on ticket #357 on DjVuLibre

    Thank you very much for your patience and the detailed explanation! My problems were caused by the conversion steps. I will follow your suggestion and use the TIFF output. To my pleasant surprise the next program in the pipeline accepts TIFFs as the input, so conversion is not needed. FYI, I intend to process font tables uploaded to https://github.com/jsbien/early_fonts_inventory/tree/main/font_tables/oDjvu.I need binary files for processing and , instead of running some binarization tool, I prefer...

  • Posted a comment on ticket #357 on DjVuLibre

    Thanks for your quick answer, but what exactly do you mean by ddjvu -1? What is the complete invocation? I use in Python subprocess.run(["ddjvu", "-format=pbm", "-mode=mask", djvu_file, pbm_file]) and the output does not seem to be binary, especially when converted to PNG (this can be of course the conversion fault). Let me repeat what I posted 2 hours ago, as this doesn't seem to be distributed by mail: I just noted that for the original mask I get: Augezdecki-01a_PT08_403_mask.pbm PBM 1711x353...

View All

Personal Data

Username:
jsbien
Joined:
2005-02-03 17:39:34

Projects

  • No projects to display.

Personal Tools