#5 UndefinedMetaTags auto only indexes first meta tag

closed-fixed
None
1
2001-01-15
2001-01-10
No

Only the first meta tag is indexed when using auto, it seems, and the following tags are considered to be plain text as part of that tag (see the -D output below).

> cat c test.xml

IndexDir test.xml
UndefinedMetaTags auto
IndexContents XML .xml

<meta1>
Random
</meta1>
<meta3>
This is metatest3
Just a sample
</meta3>
<desc>
This is <junk_to remove>the</more_junk> DESCRIPTION of test.xml
</desc>
<meta4> meta four </meta4>

> ../src/swish-e -c c

Indexing Data Source: "File-System"
Indexing test.xml..

Checking file "test.xml"...
test.xml - Using XML filter -
Adding automatic MetaName meta1 <<-- only the first meta?
(28 words)

a: Meta:8 test.xml Strct:1 Freq:1 Pos:8
desc: Meta:8 test.xml Strct:1 Freq:2 Pos:11,24
description: Meta:8 test.xml Strct:1 Freq:1 Pos:20
four: Meta:8 test.xml Strct:1 Freq:1 Pos:27
is: Meta:8 test.xml Strct:1 Freq:2 Pos:5,13
junk: Meta:8 test.xml Strct:1 Freq:2 Pos:14,19
just: Meta:8 test.xml Strct:1 Freq:1 Pos:7
meta: Meta:8 test.xml Strct:1 Freq:1 Pos:26
meta1: Meta:8 test.xml Strct:1 Freq:1 Pos:2
meta3: Meta:8 test.xml Strct:1 Freq:2 Pos:3,10
meta4: Meta:8 test.xml Strct:1 Freq:2 Pos:25,28
metatest3: Meta:8 test.xml Strct:1 Freq:1 Pos:6
more: Meta:8 test.xml Strct:1 Freq:1 Pos:18
of: Meta:8 test.xml Strct:1 Freq:1 Pos:21
random: Meta:8 test.xml Strct:1 Freq:1 Pos:1
remove: Meta:8 test.xml Strct:1 Freq:1 Pos:16
sample: Meta:8 test.xml Strct:1 Freq:1 Pos:9
test: Meta:8 test.xml Strct:1 Freq:1 Pos:22
the: Meta:8 test.xml Strct:1 Freq:1 Pos:17
this: Meta:8 test.xml Strct:1 Freq:2 Pos:4,12
to: Meta:8 test.xml Strct:1 Freq:1 Pos:15
xml: Meta:8 test.xml Strct:1 Freq:1 Pos:23

Discussion

  • Jose Ruiz

    Jose Ruiz - 2001-01-11
    • status: open --> open-fixed
     
  • Bill Moseley

    Bill Moseley - 2001-01-15

    (been fixed, so I closed it)

     
  • Bill Moseley

    Bill Moseley - 2001-01-15
    • status: open-fixed --> closed-fixed
     

Log in to post a comment.