Menu

#5 Error when installing pdfsandwich through software centre

v1.0 (example)
closed
nobody
None
5
2015-07-09
2015-02-14
Sati
No

The following warning came up when installing pdfsandwich:

Lintian check results for /home/sati/Downloads/pdfsandwich_0.1.3_i386.deb:
E: pdfsandwich: control-file-has-bad-owner md5sums james/james != root/root
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/pdfsandwich 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/changelog.Debian.gz 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/changelog 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/copyright 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/man/ 1000/1000

which subsequently led to the following error:

(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 337893 files and directories currently installed.)
Removing edisplay (0.8.9-3build1) ...
Removing exactimage (0.8.9-3build1) ...
Removing unpaper (0.4.2-1) ...
Processing triggers for man-db (2.6.7.1-1) ...
Processing triggers for mime-support (3.54ubuntu1) ...
Processing triggers for doc-base (0.10.5) ...
Processing 1 removed doc-base file...
Registering documents with scrollkeeper...
Selecting previously unselected package exactimage.
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 337830 files and directories currently installed.)
Preparing to unpack .../exactimage_0.8.9-3build1_i386.deb ...
Unpacking exactimage (0.8.9-3build1) ...
Selecting previously unselected package unpaper.
Preparing to unpack .../unpaper_0.4.2-1_i386.deb ...
Unpacking unpaper (0.4.2-1) ...
Processing triggers for man-db (2.6.7.1-1) ...
Processing triggers for doc-base (0.10.5) ...
Processing 1 added doc-base file...
Registering documents with scrollkeeper...
Setting up exactimage (0.8.9-3build1) ...
Setting up unpaper (0.4.2-1) ...
Selecting previously unselected package pdfsandwich.
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 337889 files and directories currently installed.)
Preparing to unpack .../pdfsandwich_0.1.3_i386.deb ...
Unpacking pdfsandwich (0.1.3) ...
dpkg: dependency problems prevent configuration of pdfsandwich:
pdfsandwich depends on tesseract-ocr (>= 3.00).

dpkg: error processing package pdfsandwich (--install):
dependency problems - leaving unconfigured
Processing triggers for man-db (2.6.7.1-1) ...

How should I resolve this issue? Thanks!

Related

Bugs: #5

Discussion

  • Tobias Elze

    Tobias Elze - 2015-02-14

    Hi,

    The critical error is this one:

    dpkg: dependency problems prevent configuration of pdfsandwich:
    pdfsandwich depends on tesseract-ocr (>= 3.00).

    So it seems that there is no tesseract >= 3.00 available in your software packaging system.

    What's your operating system exactly? And did you check which version of tesseract it provides, if any?

    Tobias

     
    • Sati

      Sati - 2015-02-15

      Hi,

      My OS is Ubuntu 14.04LTS.

      The tesseract version I am using is 3.03, which fits the requirement of >=
      3.00. Wonder what's wrong?

      sati@zephyr:~$ tesseract -v

      tesseract 3.03
      leptonica-1.70
      libgif 4.1.6(?) : libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib
      1.2.8 : webp 0.4.0

      Regards,
      Sati

      On Sun, Feb 15, 2015 at 12:40 AM, Tobias Elze tobias-elze@users.sf.net
      wrote:

      Hi,

      The critical error is this one:

      dpkg: dependency problems prevent configuration of pdfsandwich:
      pdfsandwich depends on tesseract-ocr (>= 3.00).

      So it seems that there is no tesseract >= 3.00 available in your software
      packaging system.

      What's your operating system exactly? And did you check which version of
      tesseract it provides, if any?

      Tobias

      Status: open
      Group: v1.0 (example)
      Created: Sat Feb 14, 2015 03:20 PM UTC by Sati
      Last Updated: Sat Feb 14, 2015 03:20 PM UTC
      Owner: nobody

      The following warning came up when installing pdfsandwich:

      Lintian check results for /home/sati/Downloads/pdfsandwich_0.1.3_i386.deb:
      E: pdfsandwich: control-file-has-bad-owner md5sums james/james != root/root
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/pdfsandwich 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/changelog.Debian.gz 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/
      1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/changelog 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/copyright 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/man/ 1000/1000

      which subsequently led to the following error:

      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337893 files and directories currently installed.)
      Removing edisplay (0.8.9-3build1) ...
      Removing exactimage (0.8.9-3build1) ...
      Removing unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for mime-support (3.54ubuntu1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 removed doc-base file...
      Registering documents with scrollkeeper...
      Selecting previously unselected package exactimage.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337830 files and directories currently installed.)
      Preparing to unpack .../exactimage_0.8.9-3build1_i386.deb ...
      Unpacking exactimage (0.8.9-3build1) ...
      Selecting previously unselected package unpaper.
      Preparing to unpack .../unpaper_0.4.2-1_i386.deb ...
      Unpacking unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 added doc-base file...
      Registering documents with scrollkeeper...
      Setting up exactimage (0.8.9-3build1) ...
      Setting up unpaper (0.4.2-1) ...
      Selecting previously unselected package pdfsandwich.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337889 files and directories currently installed.)
      Preparing to unpack .../pdfsandwich_0.1.3_i386.deb ...
      Unpacking pdfsandwich (0.1.3) ...
      dpkg: dependency problems prevent configuration of pdfsandwich:
      pdfsandwich depends on tesseract-ocr (>= 3.00).

      dpkg: error processing package pdfsandwich (--install):
      dependency problems - leaving unconfigured
      Processing triggers for man-db (2.6.7.1-1) ...

      How should I resolve this issue? Thanks!

      Sent from sourceforge.net because you indicated interest in
      https://sourceforge.net/p/pdfsandwich/bugs/5/

      To unsubscribe from further messages, please visit
      https://sourceforge.net/auth/subscriptions/

       

      Related

      Bugs: #5

  • Tobias Elze

    Tobias Elze - 2015-02-15

    Hi Sati,

    My OS is Ubuntu 14.04LTS.
    The tesseract version I am using is 3.03, which fits the requirement of >= 3.00. Wonder what's wrong?

    I see. That's the same Ubuntu version that I have here, really strange. However, I have a 64 bit OS, and as I understand, yours is 32 bit. As I don't have access to a 32 bit system, I can't try out the 32 bit deb package of pdfsandwich right now. There might be something wrong with the package then.

    Could you try to generate the package from the sources? It's very easy. You need to install the package "ocaml". Then download the sources, unpack them, and type

    make deb

    This should automatically generate the 32 bit deb package, if you run this on a 32 bit system.

    Does that work, and if so, can you install that package then?

    Tobias

     
    • Sati

      Sati - 2015-02-18

      Hi, Tobias,

      Mine is a 64-bit system, actually. As I have forgotten which one it was, I
      attempted with both 32 and 64-bit packages, but none worked.

      How should I proceed in this case, then?

      Thank you!

      Regards,
      Sati

      On Mon, Feb 16, 2015 at 1:49 AM, Tobias Elze tobias-elze@users.sf.net
      wrote:

      Hi Sati,

      My OS is Ubuntu 14.04LTS.
      The tesseract version I am using is 3.03, which fits the requirement of >=
      3.00. Wonder what's wrong?

      I see. That's the same Ubuntu version that I have here, really strange.
      However, I have a 64 bit OS, and as I understand, yours is 32 bit. As I
      don't have access to a 32 bit system, I can't try out the 32 bit deb
      package of pdfsandwich right now. There might be something wrong with the
      package then.

      Could you try to generate the package from the sources? It's very easy.
      You need to install the package "ocaml". Then download the sources, unpack
      them, and type

      make deb

      This should automatically generate the 32 bit deb package, if you run this
      on a 32 bit system.

      Does that work, and if so, can you install that package then?

      Tobias

      Status: open
      Group: v1.0 (example)
      Created: Sat Feb 14, 2015 03:20 PM UTC by Sati
      Last Updated: Sat Feb 14, 2015 04:40 PM UTC
      Owner: nobody

      The following warning came up when installing pdfsandwich:

      Lintian check results for /home/sati/Downloads/pdfsandwich_0.1.3_i386.deb:
      E: pdfsandwich: control-file-has-bad-owner md5sums james/james != root/root
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/pdfsandwich 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/changelog.Debian.gz 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/
      1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/changelog 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/copyright 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/man/ 1000/1000

      which subsequently led to the following error:

      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337893 files and directories currently installed.)
      Removing edisplay (0.8.9-3build1) ...
      Removing exactimage (0.8.9-3build1) ...
      Removing unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for mime-support (3.54ubuntu1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 removed doc-base file...
      Registering documents with scrollkeeper...
      Selecting previously unselected package exactimage.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337830 files and directories currently installed.)
      Preparing to unpack .../exactimage_0.8.9-3build1_i386.deb ...
      Unpacking exactimage (0.8.9-3build1) ...
      Selecting previously unselected package unpaper.
      Preparing to unpack .../unpaper_0.4.2-1_i386.deb ...
      Unpacking unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 added doc-base file...
      Registering documents with scrollkeeper...
      Setting up exactimage (0.8.9-3build1) ...
      Setting up unpaper (0.4.2-1) ...
      Selecting previously unselected package pdfsandwich.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337889 files and directories currently installed.)
      Preparing to unpack .../pdfsandwich_0.1.3_i386.deb ...
      Unpacking pdfsandwich (0.1.3) ...
      dpkg: dependency problems prevent configuration of pdfsandwich:
      pdfsandwich depends on tesseract-ocr (>= 3.00).

      dpkg: error processing package pdfsandwich (--install):
      dependency problems - leaving unconfigured
      Processing triggers for man-db (2.6.7.1-1) ...

      How should I resolve this issue? Thanks!

      Sent from sourceforge.net because you indicated interest in
      https://sourceforge.net/p/pdfsandwich/bugs/5/

      To unsubscribe from further messages, please visit
      https://sourceforge.net/auth/subscriptions/

       

      Related

      Bugs: #5

  • Tobias Elze

    Tobias Elze - 2015-02-19

    Hi Sati,

    Well, you could still try to compile it from sources. First, you need to install ocaml and subversion, if you don't have this installed yet. You can do this either through the sofware center or on the command line:

    apt-get install ocaml subversion

    Second, open a terminal and paste the following:

    svn checkout svn://svn.code.sf.net/p/pdfsandwich/code/trunk/src pdfsandwich
    cd pdfsandwich
    ./configure
    make
    sudo make install

    That's all. If you run pdfsandwich and it does not work, first try to run it with the additional option -verbose, which gives you detailed information which likely help you to understand what's the problem. If pdfsandwich, for instance, complains that some program, like tesseract, is not installed, you need to install the respective software first.

    You may also want to read through the manual - there are some usage examples:

    http://www.tobias-elze.de/pdfsandwich/

    Does that help?

     
    • Sati

      Sati - 2015-02-19

      Hi, Tobias,

      Thanks a lot! The installation was successful.

      I have nonetheless two separate issues here - wonder if you could help:

      Running pdfsandwich on my Ubuntu machine for a single-page English document
      generated satisfactory results with a few errors here and there.

      As for a similar Traditional Chinese document, the translation was
      unsuccessful. As you can see from the samples attached below, gs27_ocr.pdf
      was rotated to the left and the 'layered text' was garbled and off.

      Here's a verbose record of the process:

      pdfsandwich -verbose -lang chi_tra gs27.pdf

      pdfsandwich version 0.1.3
      Checking for convert:
      convert -version
      Version: ImageMagick 6.7.7-10 2014-03-06 Q16 http://www.imagemagick.org
      Copyright: Copyright (C) 1999-2012 ImageMagick Studio LLC
      Features: OpenMP

      Checking for unpaper:
      unpaper -version
      0.4.2
      Checking for tesseract:
      tesseract -v
      tesseract 3.03
      leptonica-1.70
      libgif 4.1.6(?) : libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib
      1.2.8 : webp 0.4.0

      Checking for gs:
      gs -v
      GPL Ghostscript 9.10 (2013-08-30)
      Copyright (C) 2013 Artifex Software, Inc. All rights reserved.
      Input file: "gs27.pdf"
      Output file: "gs27_ocr.pdf"
      Number of pages in inputfile: 1
      More threads than pages. Using 1 threads instead.
      Processing page 1.
      identify -format "%w\n%h\n" "gs27.pdf[0]"
      convert -type Bilevel -density 300x300 "gs27.pdf[0]"
      /tmp/pdfsandwich28d055.pbm
      unpaper --overwrite --no-grayfilter --layout none
      /tmp/pdfsandwich28d055.pbm /tmp/pdfsandwichcb227a_unpaper.pbm
      Processing sheet #1: /tmp/pdfsandwich28d055.pbm ->
      /tmp/pdfsandwichcb227a_unpaper.pbm
      tesseract /tmp/pdfsandwichcb227a_unpaper.pbm /tmp/pdfsandwich3e32f2 -l
      chi_tra pdf
      Tesseract Open Source OCR Engine v3.03 with Leptonica
      gs -q -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -dDEVICEWIDTHPOINTS=595
      -dDEVICEHEIGHTPOINTS=841 -dPDFFitPage -o /tmp/pdfsandwich8ed81b.pdf
      /tmp/pdfsandwich3e32f2.pdf
      GPL Ghostscript 9.10: Missing glyph CID=0, glyph=5341 in the font
      GlyphLessFont . The output PDF may fail with some viewers.
      OCR done. Writing "gs27_ocr.pdf"
      gs -q -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -sOutputFile="gs27_ocr.pdf"
      /tmp/pdfsandwich8ed81b.pdf
      GPL Ghostscript 9.10: Missing glyph CID=0, glyph=5341 in the font
      WZFCRI+GlyphLessFont . The output PDF may fail with some viewers.

      Done.


      I've also installed pdfsandwich on my Macbook. The program ran fine without
      any indication of errors, or so it seems, but the product was a 2kb pdf
      file with nothing on it.

      The verbose process is as follows:

      pdfsandwich -verbose -lang chi_tra 1.pdf

      pdfsandwich version 0.1.3
      Checking for convert:
      convert -version
      Version: ImageMagick 6.9.0-0 Q16 x86_64 2014-12-11
      http://www.imagemagick.org
      Copyright: Copyright (C) 1999-2014 ImageMagick Studio LLC
      Features: DPC OpenCL
      Delegates (built-in): bzlib fftw jng jpeg lcms lzma png tiff webp xml zlib

      Checking for unpaper:
      unpaper -version
      5.1
      Checking for tesseract:
      tesseract -v
      tesseract 3.02.02
      leptonica-1.71
      libgif 4.2.3 : libjpeg 9a : libpng 1.6.16 : libtiff 4.0.3 : zlib 1.2.8 :
      libwebp 0.4.2 : libopenjp2 2.1.0

      Checking for gs:
      gs -v
      GPL Ghostscript 9.10 (2013-08-30)
      Copyright (C) 2013 Artifex Software, Inc. All rights reserved.
      Input file: "1.pdf"
      Output file: "1_ocr.pdf"
      Number of pages in inputfile: 1
      More threads than pages. Using 1 threads instead.
      Processing page 1.
      identify -format "%w\n%h\n" "1.pdf[0]"
      convert -type Bilevel -density 300x300 "1.pdf[0]"
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwichc52b80.pbm
      unpaper --overwrite --no-grayfilter --layout none
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwichc52b80.pbm
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4aa1a0_unpaper.pbm
      Processing sheet #1:
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwichc52b80.pbm ->
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4aa1a0_unpaper.pbm
      tesseract
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4aa1a0_unpaper.pbm
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4c47db -l
      chi_tra pdf
      read_params_file: Can't open pdf
      Tesseract Open Source OCR Engine v3.02.02 with Leptonica
      Version of tesseract is prior to 3.03 and cannot output pdf yet. Using
      hocr2pdf instead.
      tesseract
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4aa1a0_unpaper.pbm
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4c47db -l
      chi_tra hocr
      Tesseract Open Source OCR Engine v3.02.02 with Leptonica
      hocr2pdf -r 300 -i
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4aa1a0_unpaper.pbm
      -o
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich6661a8.pdf</var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich4c47db.html
      sh: hocr2pdf: command not found
      OCR done. Writing "1_ocr.pdf"
      gs -q -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -sOutputFile="1_ocr.pdf"
      /var/folders/rw/rpr418x91h13ysbc6k7fzj240000gn/T/pdfsandwich6661a8.pdf

      Done.

      Testing out with an English document produces the same results.

      Have you any clue as to how to tackle these issues? Thanks!

      Regards,
      Sati

      On Thu, Feb 19, 2015 at 9:35 AM, Tobias Elze tobias-elze@users.sf.net
      wrote:

      Hi Sati,

      Well, you could still try to compile it from sources. First, you need to
      install ocaml and subversion, if you don't have this installed yet. You can
      do this either through the sofware center or on the command line:

      apt-get install ocaml subversion

      Second, open a terminal and paste the following:

      svn checkout svn://svn.code.sf.net/p/pdfsandwich/code/trunk/src
      pdfsandwich
      cd pdfsandwich
      ./configure
      make
      sudo make install

      That's all. If you run pdfsandwich and it does not work, first try to run
      it with the additional option -verbose, which gives you detailed
      information which likely help you to understand what's the problem. If
      pdfsandwich, for instance, complains that some program, like tesseract, is
      not installed, you need to install the respective software first.

      You may also want to read through the manual - there are some usage
      examples:

      http://www.tobias-elze.de/pdfsandwich/

      Does that help?

      Status: open
      Group: v1.0 (example)
      Created: Sat Feb 14, 2015 03:20 PM UTC by Sati
      Last Updated: Sun Feb 15, 2015 05:49 PM UTC
      Owner: nobody

      The following warning came up when installing pdfsandwich:

      Lintian check results for /home/sati/Downloads/pdfsandwich_0.1.3_i386.deb:
      E: pdfsandwich: control-file-has-bad-owner md5sums james/james != root/root
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/pdfsandwich 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/changelog.Debian.gz 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/
      1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/changelog 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/copyright 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/man/ 1000/1000

      which subsequently led to the following error:

      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337893 files and directories currently installed.)
      Removing edisplay (0.8.9-3build1) ...
      Removing exactimage (0.8.9-3build1) ...
      Removing unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for mime-support (3.54ubuntu1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 removed doc-base file...
      Registering documents with scrollkeeper...
      Selecting previously unselected package exactimage.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337830 files and directories currently installed.)
      Preparing to unpack .../exactimage_0.8.9-3build1_i386.deb ...
      Unpacking exactimage (0.8.9-3build1) ...
      Selecting previously unselected package unpaper.
      Preparing to unpack .../unpaper_0.4.2-1_i386.deb ...
      Unpacking unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 added doc-base file...
      Registering documents with scrollkeeper...
      Setting up exactimage (0.8.9-3build1) ...
      Setting up unpaper (0.4.2-1) ...
      Selecting previously unselected package pdfsandwich.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337889 files and directories currently installed.)
      Preparing to unpack .../pdfsandwich_0.1.3_i386.deb ...
      Unpacking pdfsandwich (0.1.3) ...
      dpkg: dependency problems prevent configuration of pdfsandwich:
      pdfsandwich depends on tesseract-ocr (>= 3.00).

      dpkg: error processing package pdfsandwich (--install):
      dependency problems - leaving unconfigured
      Processing triggers for man-db (2.6.7.1-1) ...

      How should I resolve this issue? Thanks!

      Sent from sourceforge.net because you indicated interest in
      https://sourceforge.net/p/pdfsandwich/bugs/5/

      To unsubscribe from further messages, please visit
      https://sourceforge.net/auth/subscriptions/

       

      Related

      Bugs: #5

  • Tobias Elze

    Tobias Elze - 2015-02-19

    Hi Sati,

    As for the Chinese documents, I'm sorry I don't know what went wrong. However, the problem on the Mac is easy to explain:

    Your tesseract version on the Mac is too old to process pdfs directly - only versions from 3.03 can do that. pdfsandwich has a fallback option in such cases: It uses the software hocr2pdf, which is part of the exact-image package. However, this software is not installed on your Mac.

    So you have two options to solve this:

    (1) recommended: Install a newer version of tesseract (>=3.03). If you don't have access to this, then

    (2) install exact-image, which contains hocr2pdf.

    I hope that helps?

    Tobias

     
    • Sati

      Sati - 2015-02-20

      Hi, Tobias,

      Processing of exact-image failed because it does not build on Mavericks or
      later.

      Error: exact-image does not build on Mavericks or later.

      Error: org.macports.fetch for port exact-image returned: unsupported
      platform
      Please see the log file for port exact-image for details:

      /opt/local/var/macports/logs/_opt_local_var_macports_sources_rsync.macports.org_release_tarballs_ports_graphics_exact-image/exact-image/main.log
      To report a bug, follow the instructions in the guide:
      http://guide.macports.org/#project.tickets
      Error: Processing of port exact-image failed

      As for tesseract, version 3.03 doesn't seem to be available in the
      repositories. I tried to download the source code an install on my own but
      ./configure cannot detect leptonica in the system.

      tesseract-3.03 satibodhi$ ./configure

      checking for a BSD-compatible install... /usr/bin/install -c
      checking whether build environment is sane... yes
      checking for a thread-safe mkdir -p... config/install-sh -c -d
      checking for gawk... gawk
      checking whether make sets $(MAKE)... yes
      checking whether to enable maintainer-specific portions of Makefiles... no
      checking build system type... x86_64-apple-darwin13.4.0
      checking host system type... x86_64-apple-darwin13.4.0
      checking --enable-graphics argument... yes
      checking --enable-embedded argument... no
      checking --enable-opencl argument... no
      checking --enable-visibility argument... no
      checking --enable-multiple-libraries argument... no
      checking whether to use tessdata-prefix... yes
      checking whether to enable debugging... no
      checking for g++... g++
      checking whether the C++ compiler works... yes
      checking for C++ compiler default output file name... a.out
      checking for suffix of executables...
      checking whether we are cross compiling... no
      checking for suffix of object files... o
      checking whether we are using the GNU C++ compiler... yes
      checking whether g++ accepts -g... yes
      checking for style of include used by make... GNU
      checking dependency style of g++... gcc3
      checking how to print strings... printf
      checking for gcc... gcc
      checking whether we are using the GNU C compiler... yes
      checking whether gcc accepts -g... yes
      checking for gcc option to accept ISO C89... none needed
      checking dependency style of gcc... gcc3
      checking for a sed that does not truncate output... /usr/bin/sed
      checking for grep that handles long lines and -e... /usr/bin/grep
      checking for egrep... /usr/bin/grep -E
      checking for fgrep... /usr/bin/grep -F
      checking for ld used by gcc...
      /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld
      checking if the linker
      (/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld)
      is GNU ld... no
      checking for BSD- or MS-compatible name lister (nm)... /opt/local/bin/nm
      checking the name lister (/opt/local/bin/nm) interface... BSD nm
      checking whether ln -s works... yes
      checking the maximum length of command line arguments... 196608
      checking whether the shell understands some XSI constructs... yes
      checking whether the shell understands "+="... yes
      checking how to convert x86_64-apple-darwin13.4.0 file names to
      x86_64-apple-darwin13.4.0 format... func_convert_file_noop
      checking how to convert x86_64-apple-darwin13.4.0 file names to toolchain
      format... func_convert_file_noop
      checking for
      /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld
      option to reload object files... -r
      checking for objdump... no
      checking how to recognize dependent libraries... pass_all
      checking for dlltool... no
      checking how to associate runtime and link libraries... printf %s\n
      checking for ar... ar
      checking for archiver @FILE support... no
      checking for strip... strip
      checking for ranlib... ranlib
      checking command to parse /opt/local/bin/nm output from gcc object... ok
      checking for sysroot... no
      checking for mt... no
      checking if : is a manifest tool... no
      checking for dsymutil... dsymutil
      checking for nmedit... nmedit
      checking for lipo... lipo
      checking for otool... otool
      checking for otool64... no
      checking for -single_module linker flag... yes
      checking for -exported_symbols_list linker flag... yes
      checking for -force_load linker flag... yes
      checking how to run the C preprocessor... gcc -E
      checking for ANSI C header files... yes
      checking for sys/types.h... yes
      checking for sys/stat.h... yes
      checking for stdlib.h... yes
      checking for string.h... yes
      checking for memory.h... yes
      checking for strings.h... yes
      checking for inttypes.h... yes
      checking for stdint.h... yes
      checking for unistd.h... yes
      checking for dlfcn.h... yes
      checking for objdir... .libs
      checking if gcc supports -fno-rtti -fno-exceptions... yes
      checking for gcc option to produce PIC... -fno-common -DPIC
      checking if gcc PIC flag -fno-common -DPIC works... yes
      checking if gcc static flag -static works... no
      checking if gcc supports -c -o file.o... yes
      checking if gcc supports -c -o file.o... (cached) yes
      checking whether the gcc linker
      (/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld)
      supports shared libraries... yes
      checking dynamic linker characteristics... darwin13.4.0 dyld
      checking how to hardcode library paths into programs... immediate
      checking whether stripping libraries is possible... yes
      checking if libtool supports shared libraries... yes
      checking whether to build shared libraries... yes
      checking whether to build static libraries... yes
      checking how to run the C++ preprocessor... g++ -E
      checking for ld used by g++...
      /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld
      checking if the linker
      (/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld)
      is GNU ld... no
      checking whether the g++ linker
      (/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld)
      supports shared libraries... yes
      checking for g++ option to produce PIC... -fno-common -DPIC
      checking if g++ PIC flag -fno-common -DPIC works... yes
      checking if g++ static flag -static works... no
      checking if g++ supports -c -o file.o... yes
      checking if g++ supports -c -o file.o... (cached) yes
      checking whether the g++ linker
      (/Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/ld)
      supports shared libraries... yes
      checking dynamic linker characteristics... darwin13.4.0 dyld
      checking how to hardcode library paths into programs... immediate
      checking whether byte ordering is bigendian... no
      checking whether compiler supports C++11... yes
      checking for library containing sem_init... none required
      checking for ANSI C header files... (cached) yes
      checking whether time.h and sys/time.h may both be included... yes
      checking for sys/wait.h that is POSIX.1 compatible... yes
      checking sys/ipc.h usability... yes
      checking sys/ipc.h presence... yes
      checking for sys/ipc.h... yes
      checking sys/shm.h usability... yes
      checking sys/shm.h presence... yes
      checking for sys/shm.h... yes
      checking limits.h usability... yes
      checking limits.h presence... yes
      checking for limits.h... yes
      checking malloc.h usability... no
      checking malloc.h presence... no
      checking for malloc.h... no
      checking for stdbool.h that conforms to C99... yes
      checking for _Bool... no
      checking whether #! works in shell scripts... yes
      checking for special C compiler options needed for large files... no
      checking for _FILE_OFFSET_BITS value needed for large files... no
      checking for wchar_t... yes
      checking for long long int... yes
      checking for mbstate_t... yes
      checking for leptonica... configure: error: leptonica not found

      make

      make: *** No targets specified and no makefile found. Stop.

      make install

      make: Nothing to be done for `install'.

      Any ideas? Thanks!

      Regards,
      Sati

      On Fri, Feb 20, 2015 at 2:33 AM, Tobias Elze tobias-elze@users.sf.net
      wrote:

      Hi Sati,

      As for the Chinese documents, I'm sorry I don't know what went wrong.
      However, the problem on the Mac is easy to explain:

      Your tesseract version on the Mac is too old to process pdfs directly -
      only versions from 3.03 can do that. pdfsandwich has a fallback option in
      such cases: It uses the software hocr2pdf, which is part of the exact-image
      package. However, this software is not installed on your Mac.

      So you have two options to solve this:

      (1) recommended: Install a newer version of tesseract (>=3.03). If you
      don't have access to this, then

      (2) install exact-image, which contains hocr2pdf.

      I hope that helps?

      Tobias

      Status: open
      Group: v1.0 (example)
      Created: Sat Feb 14, 2015 03:20 PM UTC by Sati
      Last Updated: Thu Feb 19, 2015 01:35 AM UTC
      Owner: nobody

      The following warning came up when installing pdfsandwich:

      Lintian check results for /home/sati/Downloads/pdfsandwich_0.1.3_i386.deb:
      E: pdfsandwich: control-file-has-bad-owner md5sums james/james != root/root
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/pdfsandwich 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/ 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/changelog.Debian.gz 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/
      1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/changelog 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid
      usr/share/doc/pdfsandwich/copyright 1000/1000
      E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/man/ 1000/1000

      which subsequently led to the following error:

      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337893 files and directories currently installed.)
      Removing edisplay (0.8.9-3build1) ...
      Removing exactimage (0.8.9-3build1) ...
      Removing unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for mime-support (3.54ubuntu1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 removed doc-base file...
      Registering documents with scrollkeeper...
      Selecting previously unselected package exactimage.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337830 files and directories currently installed.)
      Preparing to unpack .../exactimage_0.8.9-3build1_i386.deb ...
      Unpacking exactimage (0.8.9-3build1) ...
      Selecting previously unselected package unpaper.
      Preparing to unpack .../unpaper_0.4.2-1_i386.deb ...
      Unpacking unpaper (0.4.2-1) ...
      Processing triggers for man-db (2.6.7.1-1) ...
      Processing triggers for doc-base (0.10.5) ...
      Processing 1 added doc-base file...
      Registering documents with scrollkeeper...
      Setting up exactimage (0.8.9-3build1) ...
      Setting up unpaper (0.4.2-1) ...
      Selecting previously unselected package pdfsandwich.
      (Reading database ...
      (Reading database ... 5%
      (Reading database ... 10%
      (Reading database ... 15%
      (Reading database ... 20%
      (Reading database ... 25%
      (Reading database ... 30%
      (Reading database ... 35%
      (Reading database ... 40%
      (Reading database ... 45%
      (Reading database ... 50%
      (Reading database ... 55%
      (Reading database ... 60%
      (Reading database ... 65%
      (Reading database ... 70%
      (Reading database ... 75%
      (Reading database ... 80%
      (Reading database ... 85%
      (Reading database ... 90%
      (Reading database ... 95%
      (Reading database ... 100%
      (Reading database ... 337889 files and directories currently installed.)
      Preparing to unpack .../pdfsandwich_0.1.3_i386.deb ...
      Unpacking pdfsandwich (0.1.3) ...
      dpkg: dependency problems prevent configuration of pdfsandwich:
      pdfsandwich depends on tesseract-ocr (>= 3.00).

      dpkg: error processing package pdfsandwich (--install):
      dependency problems - leaving unconfigured
      Processing triggers for man-db (2.6.7.1-1) ...

      How should I resolve this issue? Thanks!

      Sent from sourceforge.net because you indicated interest in
      https://sourceforge.net/p/pdfsandwich/bugs/5/

      To unsubscribe from further messages, please visit
      https://sourceforge.net/auth/subscriptions/

       

      Related

      Bugs: #5

  • Tobias Elze

    Tobias Elze - 2015-02-22

    Hi Sati,

    As for tesseract, version 3.03 doesn't seem to be available in the
    repositories. I tried to download the source code an install on my own but
    ./configure cannot detect leptonica in the system.

    Sounds like some dependency is missing (leptonica). Maybe in the tesseract forums they can help you? Most probably, someone installed it on Mac before.

     
  • Tobias Elze

    Tobias Elze - 2015-07-09
    • status: open --> closed
     

Log in to post a comment.

MongoDB Logo MongoDB