Doubts about S-Match

Cloud6b
2013-02-14
2013-05-28
  • Cloud6b

    Cloud6b - 2013-02-14

    When you perform a search lets say :
    ewn office -k -g -over :

    Overview of noun office

    The noun office has 7 senses
    1. (62) office , business office  -
    - (place of business where professional or clerical duties are performed; "he re
    nted an office in the new building")
    2. (24) agency , federal agency , go
    vernment agency , bureau , office
    , authority  - (an administrative unit
    of government; "the Central Intelligence Agency"; "the Census Bureau"; "Office o
    f Management and Budget"; "Tennessee Valley Authority")
    3. (6) function , office , part [part%1:04
    :01::], role  - (the actions and activities assigned to or requ
    ired or expected of a person or group; "the function of a teacher"; "the governm
    ent must do its part"; "play its role")
    4. (2) office , power  - ((of a government o
    r government official) holding an office means being in power; "being in office
    already gives a candidate a great advantage"; "during his first year in office";
    "during his first year in power"; "the power of the president")
    5. office , office staff  - (professi
    onal or clerical workers in an office; "the whole office was late the morning of
    the blizzard")
    6. office  - (a religious rite or service prescribed by eccle
    siastical authorities; "the offices of the mass")
    7. position , post , berth
    , office , spot , billet , p
    lace , situation  - (a job in an organiza
    tion; "he occupied a post in the treasury")
    se

    How are they associating agency with office?

    2. Also, is anytime description of the sense set by the command -setgloss considered by semantic matching. If yes, how? Because the documentation of the S-Match says it uses gloss based element matcher (syntactic matcher). We comprehended that it gloss level matcher is used for matching the gloss and string based matcher for string comparison .

    we added two words
    postsms with sensekey postsms%1:15:00::  with gloss as "Method to send sms"
    smsservice sensekey smsservice%1:15:00:: with same gloss "Method to send sms"

    So, if senses are not same, then S-match does not perform gloss based matching???

     
  • Aliaksandr Autayeu

    1) I didn't get the question.
    2) "anytime description"? What is that? -setgloss sets the gloss. That's it.

    Now, to S-Match. It has several gloss-based matchers: http://semanticmatching.org/javadocs/it/unitn/disi/smatch/matchers/element/gloss/package-summary.html  They are different, not necessarily syntactic. Which matchers are enabled depends on the configuration. Gloss matchers are not enabled by default. The reason for that is explained in papers.

    Senses with equal glosses. If physically the senses are different (sensekeys, offsets), then they will be treated differently, unless some matcher (such as a gloss-based) establishes their equality.

     
  • Cloud6b

    Cloud6b - 2013-02-15

    For the second question : I meant that if two words have different sensekeys yet have the same gloss then S-match does not show semantic similarity between them. All I wanted to clarify is , gloss based description is not taken into consideration for matching.
    We would try enabling gloss matchers for considering the gloss for mapping purpose. Our project might have cases where different words may not have semantic relations(same sensekeys)  yet they mean the same and in that case gloss can help us out to automate the process.

    For the first question :
    Noun Office has  7 senses. In the second sense : agency is listed along with office. So, Was it that agency was added under the office sensekey and hence they are shown to be semantically related??

    In general : I wanted to clarify that if you add a word to the existing sensekey , then the lemma of the sensekey is replace with the lemma of the word added.
    eg :

    ewn postsms%1:15:00:: -addword smsservice

    Now, you search for the word smsservice and it happens to show the sensekey for smsservice is smsservice%1:15:00::

     
  • Aliaksandr Autayeu

    the same gloss then S-match does not show semantic similarity between them.

    see above - by default gloss matchers are disabled.

    may not have semantic relations(same sensekeys)

    Same sensekeys means they're the same.

    Our project might have cases where different words may not have semantic relations(same sensekeys)  yet they mean the same and in that case gloss can help us out to automate the process.

    This does not sound like a good practice to me. I'd avoid that. I'd better add words to the same synset (existing or new).

    For the first question :
    Noun Office has  7 senses. In the second sense : agency is listed along with office. So, Was it that agency was added under the office sensekey and hence they are shown to be semantically related??

    Synset is a set of synonyms = words with the same meaning. Synset has a gloss, which expresses the meaning of the synset. Sensekey is a kind of pointer which points to a particular word in a particular synset (and thus points to that synset too). In this case you have a synset with a gloss

    (an administrative unit
    of government; "the Central Intelligence Agency"; "the Census Bureau"; "Office o
    f Management and Budget"; "Tennessee Valley Authority")

    and lots of words in it, each having a sensekey. All these words are "equal", they are synonyms. Semantic relation between all of them is synonymity. S-Match approximates that to "equivalence".

    add a word to the existing sensekey

    You add a word to a synset. Sensekey points to a particular word in a synset and thus represents synset too, but you can't "add a word to a sensekey".

    When you add a word to a synset, the synset will have a new word in it. And that new word will have a sensekey. And the old word will have a sensekey too. These sensekeys will be different. Each one will point to its respective word and both point to the same synset.

    Hope that explains it. Let me know if it's not :)

     
  • Cloud6b

    Cloud6b - 2013-02-18

    Hey Hi,….Thanks a lot for the explanation. It has made many things clear.

     
  • Cloud6b

    Cloud6b - 2013-02-22

    What could be the possible reason for GUI to take long time for processing …even though the sample files given as input contains very few nodes lets say less than 5 ??

     
  • Cloud6b

    Cloud6b - 2013-02-25

    Have raised a ticket. It is seen that the processing time has increased due to gloss based matchers. It also depends on number of elements in one file to be mapped. Right now, one of the mappings with 11 nodes and all gloss based mappings takes 30 min and then GUI becomes unresponsive

     
  • Aliaksandr Autayeu

    OK, thank you. Taking a look there.

     

Log in to post a comment.