From: Jon M. <jo...@te...> - 2006-03-13 16:24:50
|
Peter Crowther wrote: >>Before taking this radical approach it would be important to >>determine >>that the replacement would be equally capable. (E.g. capable >>of handling >>accent insensitive searching which is vital for searching non-english >>text by users who aren't sure how to type accented characters >>into the >>search tool or for searching text which may have optional accents or >>incorrect orthography, not to say Japanese text which may have been >>entered in either of two phonetic systems.) >> >> > >I *think* http://exist.sourceforge.net/xquery.html#N10474 says eXist, at >least, can do this natively. > > I think it's saying that the built in query engine can process the right kind of query but I've found an unresolved bug report from earlier this month on the Saxon SourceForge project that shows that it doesn't work [bug1444006]. It's also not clear to me that the indexing in eXist can be configured to support such queries - the documentation on creating indices makes no mention of collations at all. So I'm suspicious that this kind of query could be very slow - also searching for text with multiple key words won't be helped at all since eXist supports these via extention syntax to XML Query which doesn't provide the option to specify the collation or match strength like the standard XML Query text matching functions. This is all based on a very quick trawl through the eXist web site so I might be wrong. I think Bodington XML repository is still a nose ahead of the competition and won't be put out to pasture just yet. ;-) Jon |