The following warning came up when installing pdfsandwich:
Lintian check results for /home/sati/Downloads/pdfsandwich_0.1.3_i386.deb:
E: pdfsandwich: control-file-has-bad-owner md5sums james/james != root/root
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/bin/pdfsandwich 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/changelog.Debian.gz 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/ 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/changelog 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/doc/pdfsandwich/copyright 1000/1000
E: pdfsandwich: wrong-file-owner-uid-or-gid usr/share/man/ 1000/1000
which subsequently led to the following error:
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 337893 files and directories currently installed.)
Removing edisplay (0.8.9-3build1) ...
Removing exactimage (0.8.9-3build1) ...
Removing unpaper (0.4.2-1) ...
Processing triggers for man-db (2.6.7.1-1) ...
Processing triggers for mime-support (3.54ubuntu1) ...
Processing triggers for doc-base (0.10.5) ...
Processing 1 removed doc-base file...
Registering documents with scrollkeeper...
Selecting previously unselected package exactimage.
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 337830 files and directories currently installed.)
Preparing to unpack .../exactimage_0.8.9-3build1_i386.deb ...
Unpacking exactimage (0.8.9-3build1) ...
Selecting previously unselected package unpaper.
Preparing to unpack .../unpaper_0.4.2-1_i386.deb ...
Unpacking unpaper (0.4.2-1) ...
Processing triggers for man-db (2.6.7.1-1) ...
Processing triggers for doc-base (0.10.5) ...
Processing 1 added doc-base file...
Registering documents with scrollkeeper...
Setting up exactimage (0.8.9-3build1) ...
Setting up unpaper (0.4.2-1) ...
Selecting previously unselected package pdfsandwich.
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 337889 files and directories currently installed.)
Preparing to unpack .../pdfsandwich_0.1.3_i386.deb ...
Unpacking pdfsandwich (0.1.3) ...
dpkg: dependency problems prevent configuration of pdfsandwich:
pdfsandwich depends on tesseract-ocr (>= 3.00).dpkg: error processing package pdfsandwich (--install):
dependency problems - leaving unconfigured
Processing triggers for man-db (2.6.7.1-1) ...
How should I resolve this issue? Thanks!
Hi,
The critical error is this one:
So it seems that there is no tesseract >= 3.00 available in your software packaging system.
What's your operating system exactly? And did you check which version of tesseract it provides, if any?
Tobias
Hi,
My OS is Ubuntu 14.04LTS.
The tesseract version I am using is 3.03, which fits the requirement of >=
3.00. Wonder what's wrong?
sati@zephyr:~$ tesseract -v
Regards,
Sati
On Sun, Feb 15, 2015 at 12:40 AM, Tobias Elze tobias-elze@users.sf.net
wrote:
Related
Bugs:
#5Hi Sati,
I see. That's the same Ubuntu version that I have here, really strange. However, I have a 64 bit OS, and as I understand, yours is 32 bit. As I don't have access to a 32 bit system, I can't try out the 32 bit deb package of pdfsandwich right now. There might be something wrong with the package then.
Could you try to generate the package from the sources? It's very easy. You need to install the package "ocaml". Then download the sources, unpack them, and type
make deb
This should automatically generate the 32 bit deb package, if you run this on a 32 bit system.
Does that work, and if so, can you install that package then?
Tobias
Hi, Tobias,
Mine is a 64-bit system, actually. As I have forgotten which one it was, I
attempted with both 32 and 64-bit packages, but none worked.
How should I proceed in this case, then?
Thank you!
Regards,
Sati
On Mon, Feb 16, 2015 at 1:49 AM, Tobias Elze tobias-elze@users.sf.net
wrote:
Related
Bugs:
#5Hi Sati,
Well, you could still try to compile it from sources. First, you need to install ocaml and subversion, if you don't have this installed yet. You can do this either through the sofware center or on the command line:
apt-get install ocaml subversion
Second, open a terminal and paste the following:
svn checkout svn://svn.code.sf.net/p/pdfsandwich/code/trunk/src pdfsandwich
cd pdfsandwich
./configure
make
sudo make install
That's all. If you run pdfsandwich and it does not work, first try to run it with the additional option -verbose, which gives you detailed information which likely help you to understand what's the problem. If pdfsandwich, for instance, complains that some program, like tesseract, is not installed, you need to install the respective software first.
You may also want to read through the manual - there are some usage examples:
http://www.tobias-elze.de/pdfsandwich/
Does that help?
Hi, Tobias,
Thanks a lot! The installation was successful.
I have nonetheless two separate issues here - wonder if you could help:
Running pdfsandwich on my Ubuntu machine for a single-page English document
generated satisfactory results with a few errors here and there.
As for a similar Traditional Chinese document, the translation was
unsuccessful. As you can see from the samples attached below, gs27_ocr.pdf
was rotated to the left and the 'layered text' was garbled and off.
Here's a verbose record of the process:
pdfsandwich -verbose -lang chi_tra gs27.pdf
I've also installed pdfsandwich on my Macbook. The program ran fine without
any indication of errors, or so it seems, but the product was a 2kb pdf
file with nothing on it.
The verbose process is as follows:
pdfsandwich -verbose -lang chi_tra 1.pdf
Testing out with an English document produces the same results.
Have you any clue as to how to tackle these issues? Thanks!
Regards,
Sati
On Thu, Feb 19, 2015 at 9:35 AM, Tobias Elze tobias-elze@users.sf.net
wrote:
Related
Bugs:
#5Hi Sati,
As for the Chinese documents, I'm sorry I don't know what went wrong. However, the problem on the Mac is easy to explain:
Your tesseract version on the Mac is too old to process pdfs directly - only versions from 3.03 can do that. pdfsandwich has a fallback option in such cases: It uses the software hocr2pdf, which is part of the exact-image package. However, this software is not installed on your Mac.
So you have two options to solve this:
(1) recommended: Install a newer version of tesseract (>=3.03). If you don't have access to this, then
(2) install exact-image, which contains hocr2pdf.
I hope that helps?
Tobias
Hi, Tobias,
Processing of exact-image failed because it does not build on Mavericks or
later.
Error: exact-image does not build on Mavericks or later.
As for tesseract, version 3.03 doesn't seem to be available in the
repositories. I tried to download the source code an install on my own but
./configure cannot detect leptonica in the system.
tesseract-3.03 satibodhi$ ./configure
make
make install
Any ideas? Thanks!
Regards,
Sati
On Fri, Feb 20, 2015 at 2:33 AM, Tobias Elze tobias-elze@users.sf.net
wrote:
Related
Bugs:
#5Hi Sati,
Sounds like some dependency is missing (leptonica). Maybe in the tesseract forums they can help you? Most probably, someone installed it on Mac before.