From: Thomas S. <ts...@la...> - 2009-09-06 14:41:14
|
On Sat, 5 Sep 2009, Jozef Misutka wrote: > > Date: Sat, 5 Sep 2009 13:10:19 +0200 > > From: ms...@gm... > > To: pdf...@li... > > Subject: Re: [Pdfedit-support] Save file as text from the command line > > > > On Fri, Sep 04, 2009 at 07:52:43PM +0000, Jozef Misutka wrote: > > > > > > hi, > > > > > > i changed the algorithm of pdftotext a bit but it is far from what i > > > would like it to be. nevertheless, i can provide you with source code > > > of you tool using our pdfedit library extract text function but you > > > would have to compile it by your own. will it help? > > > > Isn't this just overkill for something that is already scripted and it > > no, because it is definitely a useful tool. > > will do it on monday as i am away from pc. > > /jozo I agree. A better pdftotext would certainly be a benefit, as it could improve many existing scripts using it. Nevertheless I would like to learn how to use pdfedit from the command line as well. Thomas > > is only matter how to call the script? Martin, could you help with this? > > > > > > > > /jozo > > > > > > ---------------------------------------- > > > > Date: Fri, 4 Sep 2009 11:52:46 +0200 > > > > From: ts...@la... > > > > To: pdf...@li... > > > > Subject: Re: [Pdfedit-support] Save file as text from the command line > > > > > > > > On Fri, 4 Sep 2009, Alister Hood wrote: > > > > > > > >> Sorry if someone else replied and I missed it. > > > >> I don't know how to do this with pdfedit, but you could alternatively > > > >> try the pdftotext tool from xpdf, or pdftohtml if that is more > suitable > > > >> for your purpose. > > > >> > > > >> Alister > > > > > > > > I am currently using pdftotext in my script. However, it doesn't work > > > > well. That means, it drops a lot of spaces between words which makes > the > > > > output almost unuseable. This may be a problem with the PDF-input, but > > > > I have no influence on this. For this reason I tried to use pdfedit > and > > > > found, that it's much better: the output is perfect. > > > > > > > >>From the man page I can see that there is a command line mode. I found > the > > > > script savealltext.qs on the wiki. But I can't figure out how to use > this > > > > from the command line. I still guess it must be easy, but I have no > > > > success so far. Unfortunately I could not find any examples of how to > use > > > > pdfedit in command line mode. > > > > > > > > Thomas > > > > > > > > > > > >> -----Original Message----- > > > >> From: Thomas Spahni [mailto:ts...@la...] > > > >> Sent: Thursday, 3 September 2009 12:21 a.m. > > > >> To: pdf...@li... > > > >> Subject: [Pdfedit-support] Save file as text from the command line > > > >> > > > >> Hello > > > >> > > > >> I'm a new subscriber on this list; greetings to everyone. > > > >> > > > >> I have a bash script which at some point should translate a PDF file > to > > > >> plain text. Let's say we have foobar.pdf and want to convert it to > > > >> foobar.txt. I can do this from the GUI but I'm unable to figure out > what > > > >> > > > >> the command should be to do the same from the command line. > > > >> > > > >> Yes, I read the docs, manpage, wiki, archives, but still no luck. Yor > > > >> help > > > >> would be very much appreciated. > > > >> > > > >> Details: PDFedit 0.4.2 from the SuSE-11.1 packman repo. > > > >> > > > >> Best ragards, > > > >> Tom |