From: Franck C. <fra...@fr...> - 2001-11-27 22:19:10
|
Hi! Where can i find the doc2html.pl file on your web site ? Regards, Franck |
From: Douglas K. <kl...@he...> - 2004-05-25 23:02:17
|
I've downloaded doc2html.pl and have been experimenting with it to process pdf files. I've found that pdf2html.pl works but doc2html.pl which should be calling pdf2html.pl doesn't work and isn't calling pdf2html.pl (I've edited both files to fill in local pathnames). I think that it's comparing the magic number '^\320\317\021\340' given in the store_methods sub-routine for pdf files with the beginning of the file which it reads in the read_magic sub-routine and finding they don't match. Before I pursue this further, is this a known problem? TIA. Douglas ======== Douglas Kline kl...@he... |
From: David A. <D.J...@so...> - 2004-05-26 09:17:47
|
In my version of doc2html.pl the magic number '^\320\317\021\340' is the test for a Microsoft file (Word, Excel or Powerpoint). The test for a PDF file should use '%PDF-|\0PDF CARO\001\000\377'. David Adams Corporate Information Services Information Systems Services University of Southampton ----- Original Message ----- From: "Douglas Kline" <kl...@he...> To: <htd...@li...> Sent: Wednesday, May 26, 2004 12:02 AM Subject: [htdig] doc2html.pl > > I've downloaded doc2html.pl and have been experimenting with it to process pdf > files. I've found that pdf2html.pl works but doc2html.pl which should be > calling pdf2html.pl doesn't work and isn't calling pdf2html.pl (I've edited > both files to fill in local pathnames). I think that it's comparing the magic > number '^\320\317\021\340' given in the store_methods sub-routine for pdf files > with the beginning of the file which it reads in the read_magic sub-routine and > finding they don't match. Before I pursue this further, is this a known > problem? TIA. > > Douglas > > ======== > Douglas Kline > kl...@he... > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: Oracle 10g > Get certified on the hottest thing ever to hit the market... Oracle 10g. > Take an Oracle 10g class now, and we'll give you the exam FREE. > http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click > _______________________________________________ > ht://Dig general mailing list: <htd...@li...> > ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html > List information (subscribe/unsubscribe, etc.) > https://lists.sourceforge.net/lists/listinfo/htdig-general > |
From: Gilles D. <gr...@sc...> - 2001-11-28 04:06:25
|
According to Franck Collineau: > Where can i find the doc2html.pl file on your web site ? See http://www.htdig.org/FAQ.html#q4.9, where you'll find a link to the place on the web site where contributed parser and converter scripts are kept. -- Gilles R. Detillieux E-mail: <gr...@sc...> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 |