|
From: <uta...@we...> - 2001-10-29 13:45:28
|
Hi Jamie,
thanks a lot for your help. I solved my problem with 'url_rewrite_rules'
and I'am very happy :-).
I could also remove the value of the parameter with a regular expression:
url_rewrite_rules: (.*)&_last=[0-9]*&(.*) \\1&\\2
--
Uta Becht
"Jamie Anstice" <jam...@sl...> schrieb am 26.10.01:
Won't this just reject the whole URL? If I understand the problem, Uta
wants
to throw out the session parameter but leave the rest intact, so the page
if fetched
once only. I've come across this issue before, and I've got a patch for
3.2.x (which I
ported from our 3.1.6 version) but I think you could also use
url_rewrite_rules too.
My patch is somewhat more specific than url_rewrite_rules, in that it just
removes
unwanted parameters + value from the URL. I'll post it along with my
patch for
ignoring the alt text from images (which is driven from a config option)
in a few days -
I'm currently experimenting with tweaking the scoring to make an alternate
'or'
behaviour which scores up results which contains more than one search
term.
Short explanation by way of example: say I'm indexing a university
website. The
chemistry department has a whole bunch of pages with the word chemistry
all
through them, and one page telling students where to buy replacement lab
glassware.
An 'and' search for 'chemistry glassware' finds this page and nothing
else, an 'and'
search for 'chemistry glassware sales' finds nothing. An 'or' search for
'chemistry
glassware sales' is swamped by the occurance of 'chemistry' and
'glassware' is
lost in the noise. What I'm doing is factoring in the number of distinct
words from
the search phrase found in the result to bump up score for pages with more
than
one search term. Initial results look quite promising, but will need a
bit of tuning to
improve search speed.
Jamie Anstice
Search Engineer
S.L.I. Systems
jam...@sl...
ph: 64 961 3262
mobile: 64 21 264 9347
Geoff Hutchison <ghu...@ws...>
Sent by: htd...@li...
26/10/01 01:49
To: "Uta Becht" <uta...@we...>
cc: "htdig" <htd...@li...>
Subject: Re: [htdig-dev] Problemes with bad query-string
At 11:12 AM +0200 10/25/01, Uta Becht wrote:
>Can someone give me an idea at which position of htdig I should
>eleminate this bad query_parameter ??
Why not use bad_querystr:
<http://www.htdig.org/attrs.html#bad_querystr>
--
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-dev mailing list
htd...@li...
https://lists.sourceforge.net/lists/listinfo/htdig-dev
_______________________________________________
htdig-dev mailing list
htd...@li...
https://lists.sourceforge.net/lists/listinfo/htdig-dev
________________________________________________________________
Lotto online tippen! Egal zu welcher Zeit, egal von welchem Ort.
Mit dem WEB.DE Lottoservice. http://tippen2.web.de/?x=13
|