Fix PairedTTest and SignTest

Search engine and data mining applications and ClueWeb datasets.

Brought to you by: cammiemw, david_fisher, gregorybrooks, jamiecallan, sm-harding

#111 Fix PairedTTest and SignTest

Milestone: Next_Release

Status: accepted

Owner: Lemur Project

Labels: galago (62) eval (11)

Priority: 1

Updated: 2016-01-07

Created: 2014-03-28

Creator: John Foley

Private: No

We need to implement Stat.studentTProb and Stat.binomialProb. It would be nice if this was done in the context of a test case.

Discussion

David Fisher - 2015-02-02

assigned_to: John Foley
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

David Fisher - 2015-09-05

Can you use http://commons.apache.org/proper/commons-math/userguide/special.html to address the pieces that we need to implement?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Lemur Project - 2015-10-01

The commons-math3 library has a lot of useful stats (and other) stuff in it. I've locally implemented the t-test and SignTest using the distribution.TDistribution and stat.inference.BinomialTest classes inside of Stat.java. The library allows other implementations as well.

Still some uncertainty about testing focus here. The t-test is two tailed and the SignTest one tailed (upper) test. They should probably both be one tailed (is treatment better than baseline), but perhaps just "yes, they're different" is OK too. Also some issues with proper degress of freedom to be used.

I'm thinking of committing the changes anyways, since any galago eval using baseline and treatment arguments is going to throw an exception and break. But then maybe breakage is better than a not quite correct result.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Lemur Project - 2016-01-07

status: open --> accepted

assigned_to: John Foley --> Stephen Harding
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Lemur Project - 2016-01-07

Implemented directly via Apache commons-math3 library.

Issues remain with boosted Sign Test where booting process somehow results in numbers of improved samples being greater than the number of different samples.

Needs more work.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.