From: Adam B. <ad...@fr...> - 2003-02-14 01:27:28
|
On Friday 14 February 2003 11:14, Adam Brown wrote: > Hi, > > I am indexing this page using Htdig 3.1.6: > http://wire.org.au/information/violence/domestic/womens_stories/one_rural_w >omans_story.html The page contains the words "woman's" and "womans" but not > "woman'. > > The search page is located at: http://wire.org.au/public_search.html > > When I search for "rural woman's" or "rural womans" I get no hits. However > when I search for "woman" the page is returned. > > My understanding is that using the default Htdig settings that "woman's" > gets indexed as "womans". So surely a search for 'womans' should be > successful. > > Can anyone shed any light on this problem? > > thanks > > Adam > > Researching further: Results from htdig -vvvv indicate that the word "woman" is indexed, not "womans" A search for "women's" (note the e) returns a hit. I looked in the ispell dictionary file english.0 and the listings for the two words are: woman/MY women/MS Is it the case that Htdig reduces the search word "woman's" to "womans" which doesn't register a hit because "woman" is recorded in the database and "womans" is not a valid extension of "woman"? I use the setting: valid_punctuation: .-_/!#$%^&'() Need help with this. thanks Ad |