Level2 scoring issues when bags of different sizes

Status: Alpha

Brought to you by: krivard, matthewfhurst, pdlug, pradeeprk, wwcohen

#2 Level2 scoring issues when bags of different sizes

Status: open

Owner: nobody

Labels: None

Priority: 5

Updated: 2008-02-28

Created: 2008-02-28

Creator: Anonymous

Private: No

I've encountered problems with Level2 not accurately scoring bags of different lengths. I'm including a patch which solved the problem for me.

The problem presented itself when Level2.score(s, t) was called with size(s) < size(t). For example:

s = {'Frances', 'Fyfe'}
t = {'Mary', 'Frances', 'Fyfe'}

Level2.score(s,t) -> 1.0
level2.score(t,s) -> 0.83

The problem is, I believe, that the algorithm always iterates over s. What should happen, in my opinion, is that the algorithm should iterate over the larger of the two sets.

I'm including a patch which does just that. I hope this is helpful.

Discussion

Nobody/Anonymous - 2008-02-28

patch to Level2

level2.patch

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Level2 scoring issues when bags of different sizes

Group

Searches

Help

#2 Level2 scoring issues when bags of different sizes

Discussion