From: Erik S. <eri...@we...> - 2003-08-14 15:22:48
|
Hi list! I've a problem with (digging and) merging excel files. I read the FAQ tried different values in the config- and the doc2html-file etc. I use xlhtml-0.4.9.3. Below are excerpts from the files used. I can process .doc and .pdf files without any problems. The excel file seems to be digged. But not merged because no excerpt has been generated. I recognize that no title is printed by htdig as for the other files. Kind regards, Erik ----------------- ardopt-htdig.conf: ----------------- ardopt-htdig.conf has size set to 200000. ----------------- htdig results: ----------------- pick: 192.168.0.206, # servers = 1 43:43:2:http://192.168.0.206/~ardopt/software/upload/RB/KarlGustav/ARDOPT_keywords.xls?articleID=1: Retrieval command for http://192.168.0.206/~ardopt/software/upload/RB/KarlGustav/ARDOPT_keywords.xls?articleID=1: GET /~ardopt/software/upload/RB/KarlGustav/ARDOPT_keywords.xls?articleID=1 HTTP/1.0 User-Agent: htdig/3.1.6 (unc...@ht...intainer) Referer: http://192.168.0.206/~ardopt/software/index.php?pc=1&sc=1_2&articleID=1 Host: 192.168.0.206 Header line: HTTP/1.1 200 OK Header line: Date: Thu, 14 Aug 2003 12:17:24 GMT Header line: Server: Apache/2.0.46 (Linux/SuSE) Header line: Last-Modified: Wed, 13 Aug 2003 16:03:23 GMT Converted Wed, 13 Aug 2003 16:03:23 GMT to Wed, 13 Aug 2003 16:03:23 Header line: ETag: "234b7-3600-8f12a8c0" Header line: Accept-Ranges: bytes Header line: Content-Length: 13824 Header line: Connection: close Header line: Content-Type: application/excel Header line: returnStatus = 0 Read 8192 from document Read 5632 from document Read a total of 13824 bytes size = 13824 pick: 192.168.0.206, # servers = 1 [ ***** I miss the title ...] ----------------- htmerge results: ----------------- Deleted, no excerpt: 43/http://192.168.0.206/~ardopt/software/upload/RB/KarlGustav/ARDOPT_keywords.xls?articleID=1 ----------------- doc2html.pl: ----------------- [ ***** TRIED THIS ... ] #Microsoft Excel to HTML converter # (get it from www.xlHtml.org) my $XLS2HTML = ''; [ ***** AND THIS ... ] #MicroSoft Excel to .CSV converter # (you dont need this if you have xlHtml) # (if you do want it, you can get it with catdoc) my $CATXLS = '/usr/bin/xls2csv'; [ ***** ADDED application/excel ...] # Microsoft Excel spreadsheet if ($XLS2HTML) { $mime_type = "application/msexcel|application/excel|application/vnd.ms-excel"; $cmd = $XLS2HTML; # xlHtml uses filename as title, change this: $cmdl = "$cmd -fw $Input | $ED \"s#<TITLE>$Input</TITLE>#<TITLE>[$name]</TITLE>#\""; $magic = '^\320\317\021\340'; &store_html_method('Excel (xlHtml)',$cmd,$cmdl,$mime_type,$magic); __________________________________________________________________________ Die sicherste Form der Kommunikation: E-Mails verschluesseln, Spam-Filter, Adressverifizierung, digitale Unterschrift: http://freemail.web.de |