From: Jonathan P. <jo...@sn...> - 2002-02-25 21:29:22
|
At 20:32 25/02/02 +0000, Jonathan Stowe wrote: >The 'text' in a PDF file is stored as a page description, a sort of image >file really, so the text that appears in the acrobat viewer doesn't >actually appear literally in the file. If you are on a Unix-like system >you can see this for yourself by doing 'strings *.pdf'. Simple Search >really is *simple*, it looks for the plaintext strings that are specified >on the form in the specified files, it won't find them in a PDF file. I notice these people have the beginnings of a Perl PDF library - http://www.sanface.com/PDF-lib/ but it's not useable yet. Maybe some london.pm people might want to help out :-) |