From: Joe W. <jo...@gm...> - 2022-05-16 12:36:36
|
Hi Stefan, How strange. This definitely sounds like a bug. Would you please open a bug report at https://github.com/eXist-db/exist/issues/new/choose and, if possible, supply an XQSuite test to help the developers reproduce the problem? Thanks, Joe On Mon, May 16, 2022 at 4:20 AM Stefan Dumont <du...@bb...> wrote: > Hi Dannes, > > here is the index configuration: > > https://gist.github.com/StefanDumont/0ab72c24e069f2231ac28bf8cff0b90e > > Everything works fine exept that obviously text in an Oxygen Processing > Instruction is indexed and found by ft:query(). Checking again the > problem, I also found out, that the problem occurs only, when I use a > wildcard in ft:query() (with XML syntax). When I search for the specific > word ("Natural" vs "Natur*") there is no search result (i.e. correct > behaviour). > > Thanks > > Stefan > > Am 15.05.2022 um 18:29 schrieb Dannes Wessels: > > Hi, > > > > Please could you share your index configuration? > > > > Cheers > > > > Dannes > > > >> On 11 May 2022, at 20:36, Stefan Dumont <du...@bb...> wrote: > >> > >> Hi all, > >> > >> since we're using Oxygen XML Author for editing our TEI-XML, the > editors also use the possibility to comment (temporarily) the edited text. > Oxygen stores these comments as XML Processing Instructions (PI) in the > TEI-XML. Of course, we don't want to show them. But by indexing text with > the lucene index these PIs are also indexed and therefore will be find by > ft:query(). Is there a way to ignore them for indexing like other elements? > I didn't find a way ... > >> > >> Thanks for hints & kind regards > >> > >> Stefan > >> > >> -- > >> Berlin-Brandenburgische Akademie der Wissenschaften > >> TELOTA - Digital Humanities > >> Jägerstraße 22/23 > >> 10117 Berlin > >> > >> Tel.: 030 / 20 370 -492 > >> du...@bb... > >> http://www.bbaw.de/die-akademie/mitarbeiter/dumont > >> http://www.bbaw.de/telota > >> > >> > >> _______________________________________________ > >> Exist-open mailing list > >> Exi...@li... > >> https://lists.sourceforge.net/lists/listinfo/exist-open > > -- > Berlin-Brandenburgische Akademie der Wissenschaften > TELOTA - Digital Humanities > Jägerstraße 22/23 > 10117 Berlin > > Tel.: 030 / 20 370 -492 > du...@bb... > http://www.bbaw.de/die-akademie/mitarbeiter/dumont > http://www.bbaw.de/telota > > > _______________________________________________ > Exist-open mailing list > Exi...@li... > https://lists.sourceforge.net/lists/listinfo/exist-open > |