grid_search = GridSearchCV(grid_clf , linear_SVC_params, n_jobs = 100, score_func = f1_score)

This variation seems to be due to the n_jobs number. If I keep it 10 it varies, if kept 100 it doesn't!

Can anyone throw light on why this is taking place? The documentation suggests that this variable specifies only the number of jobs to be run in parallel and I hence assume it's for efficiency of the program.

I would like to open an issue on the git.

On Wed, May 18, 2011 at 7:49 PM, Alexandre Gramfort <alexandre.gramfort@inria.fr> wrote:
> I am using the tolerance parameter as 0.0001 for libLinear which is the
> default setting. How much further should I reduce?

It will depend on how big is your C. A good test is to check that the
performance
does not depend anymore on this parameter.

to illustrate this, try to run

http://scikit-learn.sourceforge.net/auto_examples/linear_model/plot_logistic_path.html

(I used tol=1e-6) with the default tol and you will see a much less nicer path.

we need a warm restart !

Alex

------------------------------------------------------------------------------
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its
next-generation tools to help Windows* and Linux* C/C++ and Fortran
developers boost performance applications - including clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general



--
Regards,

Denzil Correa
Ph.D Scholar
Indraprastha Institute of Information Technology, Delhi 
http://www.iiitd.ac.in/