INSEMTIVES - Incentives for Semantics / Bugs / #51 getResourcesByTerms not dealing with free-text tags corretly

Juan Pane - 2011-10-28

assigned_to: juanpane --> pravdin
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Juan Pane - 2011-10-28

Assigning the ticket to Viktor

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Germán Toro del Valle - 2011-10-31

Increasing priority to highest value...

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Germán Toro del Valle - 2011-10-31

priority: 5 --> 9
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Viktor Pravdin - 2011-10-31

Fixed, please check.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Viktor Pravdin - 2011-10-31

assigned_to: pravdin --> gtorodelvalle
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Germán Toro del Valle - 2011-10-31

Right now the same search: http://insemtives.science.unitn.it/platform-rest/knowledge-service/getResourcesByTerms?json=%7B%22includeTargets%22%3Atrue%2C%22maxSpecificityDistance%22%3A2%2C%22maxGeneralityDistance%22%3A0%2C%22operator%22%3A%22or%22%2C%22senses%22%3A%5B%7B%22term%22%3A%22multimedia%22%2C%22_specializationType%22%3A%22org.insemtives.platform.unitn.commons.model.QueryTerm%22%7D%5D%7D

returns results for resources annotated using the term ("multimedia") with and without senses. The point is that according to the default values of the parameters the resources annotated using a sense gets much bigger scores although the sense was not specified by the user in the query. There is an email about this to centralize all the discussions.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Germán Toro del Valle - 2011-10-31

assigned_to: gtorodelvalle --> juanpane
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Juan Pane - 2011-11-01

Assigning the ticket to Viktor

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Juan Pane - 2011-11-01

assigned_to: juanpane --> pravdin
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Viktor Pravdin - 2012-01-11

This issue was discussed in the email, so let me summarize it here. The core of the problem is to define how we should treat the case when the synset URI is absent in the request. In general, there can be three cases:
1) The synset URI is present and has some value. In this case the service will return only the terms with the given sense
2) The synset URI is present and its value is null. In this case the service will return only the free-text terms
3) The synset URI is absent. At the moment the service returns all terms, and the terms with the senses get the higher scoring with the default scoring parameters. The scoring can be tweaked (e.g., setting conceptTermWeight and termWeight to the same value) to let the free-text terms and terms with sense to have the same rank.

Please let us know if this resolves the issue or if you need some additional actions to be taken. As far as I know German is not available at the moment, so I assign it to Daniel.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Viktor Pravdin - 2012-01-11

assigned_to: pravdin --> danielfdez
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Daniel Fernández Casado - 2012-01-24

assigned_to: danielfdez --> pravdin
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Daniel Fernández Casado - 2012-01-24

Hi Victor.

Last week we wanted to focus on the experiment. Now, I am trying to reproduce your three cases that you have defined two weeks ago.

For example:

1) http://insemtives.science.unitn.it/test/platform-rest/knowledge-service/getResourcesByTerms?json=\{"includeTargets":true,"maxSpecificityDistance":5,"maxGeneralityDistance":5,"operator":"or","senses":[{"term":"multimedia","synsetUri":"http://www.w3.org/2006/03/wn/wn20/instances/synset-multimedia-noun-1","_specializationType":"org.insemtives.platform.unitn.commons.model.QueryTerm"}]}

2) http://insemtives.science.unitn.it/test/platform-rest/knowledge-service/getResourcesByTerms?json=
{"includeTargets":true,"maxSpecificityDistance":5,"maxGeneralityDistance":5,"operator":"or","senses":[{"term":"multimedia","synsetUri":null,"_specializationType":"org.insemtives.platform.unitn.commons.model.QueryTerm"}]}

3) http://insemtives.science.unitn.it/test/platform-rest/knowledge-service/getResourcesByTerms?json=
{"includeTargets":true,"maxSpecificityDistance":5,"maxGeneralityDistance":5,"operator":"or","senses":[{"term":"multimedia","_specializationType":"org.insemtives.platform.unitn.commons.model.QueryTerm"}]}

No differences on results between cases "2" and "3". The expected output in "3" is the union between "1" and "2". Is the approach correct? If so, the problem is not solved: ( I await your comments.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Viktor Pravdin - 2012-01-24

Sorry, I probably didn't explain it correctly. The three item list below is the proposal which I gathered from the long list of emails, and it's not implemented yet. My message below was to ask if the proposal was correct and that no points were missing, so if it is the desired behavior please confirm it and then we can start implementing it; if it's not please correct the proposal.
Right now the knowledge service treats the cases 2 and 3 in the same way.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Viktor Pravdin - 2012-01-24

assigned_to: pravdin --> danielfdez
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Daniel Fernández Casado - 2012-01-24

assigned_to: danielfdez --> pravdin
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Daniel Fernández Casado - 2012-01-24

I think that the proposal looks good because it gives the enough flexibility to choose the behavior that best work for each use case. Also, everything that is customizable and configurable (conceptTermWeight and termWeight parameters) is very positive for the project. The proposal was correct.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

getResourcesByTerms not dealing with free-text tags corretly

Group

Searches

Help

#51 getResourcesByTerms not dealing with free-text tags corretly

Discussion