From: Ionut N. <io...@ef...> - 2001-11-23 15:54:54
|
Hello I have posted a (wish)bug a couple of days ago regarding HTML translations performed by htdig (#484345). I should ave brought the issue on the list first (as Gilles Detillieux suggessted), so I'll just bring up the issue now. htdig does supports (afaik) 3 translations: 1. lg & gt (< >) 2. amp (&) 3. quot (") However, there are some more escapes that I think would be helpful to have. For instance, ' (apostrophe '). Gilles said ' is not supported in HTML - that is correct; however, xhtml1.0 brings in XML well formed documents - in XML, you cannot use ' - ' is escaped as ' XHTML1.0 notes can be found at: http://www.w3.org/TR/xhtml1/ look at A2. Entity sets - special characters http://www.w3.org/TR/xhtml1/DTD/xhtml-special.ent The problem is that there are many more escape sequences (in the &something; style); there are some ways to do it 1. by having a translation table - in which case htdig will translate everything there so that htsearch will not misescape them while displaying results (e.g. from a XHTML source file which has say € when searching the browser will display € instead of the euro sign cause htsearch escapes € into &euro;). 2. Eliminate translations from htdig; htsearch will have to stop escaping what is found in the DB in the &something; form. I think the second way is better. I'm not sure if I explained clearly I'll try to explain again if necessary. Is it possible/desirable ? Thank you, Ionut Nistor io...@ef... |