From: Gilles D. <gr...@sc...> - 2002-02-12 20:54:52
|
According to Gabriele Bartolini: > I am working on a PHP wrapper project for ht://Dig. I read an > interesting guide on the contributed work, but I think it is kinda old now, > especially by keeping in mind new versions of PHP. > > Basically, I would like to create an XML file as output of the htsearch > program, then use an XML parser from the PHP script. The PHP opens a pipe > to the htsearch program and the XML reads its pointer. > > I got some problems as far as the excerpt is concerned. I was just > wondering, if somebody of you is interested on it. And of course has some > ideas and opinion! As Neal suggested, check out the xmlsearch code, which is also bundled in the contrib directory of 3.1.6. You can avoid any HTML code in the excerpts by turning off the add_anchors_to_excerpt attribute, and changing start_ellipses, end_ellipses, start_highlight, and end_highlight. I suppose these attribute definitions should be added to xml.conf in contrib/xmlsearch. I think there will also be a problem with the 3.2 betas, not just with excerpts but with all $&(var) expansions, in that all accented characters are mapped back to ISO-8850-1 character entities, which, if I understand correctly, are invalid in XML. -- Gilles R. Detillieux E-mail: <gr...@sc...> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 |