TST add HalvingSearchCV to common test #20203

glemaitre · 2021-06-03T12:01:59Z

Follow-up of #20202

This PR implements:

do not have a mechanism to set refit in the __init__ by adding a private method overwritten in BaseHalvingSearchCV;
enable the common test for this estimator.

glemaitre · 2021-06-08T12:46:16Z

@NicolasHug Could you have a look at this PR. We quite change the __init__ and I would like to be sure that we did not overlook something.

…ch_cv

NicolasHug

Are the changes to refit needed?
I personally don't find the static method to be much more readable but maybe there's a good reason?

If we keep these changes, we'll need to remove _refit_callable I think. And we might as well ensure that self.refit is a bool for SH.

glemaitre · 2021-06-08T13:13:10Z

Are the changes to refit needed?
I personally don't find the static method to be much more readable but maybe there's a good reason?

self.refit != refit passed during __init__ and it looks like against our API contract.

we'll need to remove _refit_callable I think.

I forgot to remove it indeed. Here, the idea was just to move at as a static method to be overwritten in successive halving.

And we might as well ensure that self.refit is a bool for SH.

Right, we can do that and add a test. A question regarding this point: is there a reason for not supporting multimeric?

NicolasHug · 2021-06-08T13:26:42Z

self.refit != refit passed during init and it looks like against our API contract.

Hm, fair. Could we just set self.refit = refit after the call to super().__init__ then? No strong opinion, I'm fine with the proposed changes as well

is there a reason for not supporting multimeric?

I think it's because with multimetric scoring we would not know which scorer to use at each round to select the "survivors" for the next iteration. It sounds like refit could do that, but not really: what if you just want multimetric scoring with no refit?

glemaitre · 2021-06-08T15:01:16Z

Hm, fair. Could we just set self.refit = refit after the call to super().init then? No strong opinion, I'm fine with the proposed changes as well

Would not it be better to pass it to super (as now), and then add the check that we only support single metric with boolean in the parameter checks?

I think it's because with multimetric scoring we would not know which scorer to use at each round to select the "survivors" for the next iteration. It sounds like refit could do that, but not really: what if you just want multimetric scoring with no refit?

In grid-search, we use scoring to specify the multiple metrics and refit to choose which one to use to refit.
Indeed, I was just interested to know if methodologically there was a blocker. It seems more like an enhancement feature to have if we would like it at some point. This is enough of an answer for my curiosity :)

NicolasHug · 2021-06-08T15:08:14Z

and then add the check that we only support single metric with boolean in the parameter checks?

yes I think this check will be useful regardless of the strategy!

In grid-search, we use scoring to specify the multiple metrics and refit to choose which one to use to refit.
Indeed, I was just interested to know if methodologically there was a blocker

There kinda is IMHO: we use refit in grid-search indeed but refit='accuracy' says "use this scorer when refitting the best estimator, at the very end of the search". Here we need something that says "use this scorer to select the k best candidates at each iteration". While it's similar to what refit does, the semantics and different enough for refit not to be a good solution I think

sklearn/model_selection/_search_successive_halving.py

ogrisel

The current state of this PR looks good to me. I did not follow the full discussion, but have your concerns been addressed @NicolasHug?

sklearn/model_selection/tests/test_successive_halving.py

sklearn/model_selection/_search_successive_halving.py

NicolasHug

thanks @glemaitre , LGTM modulo my 2 previous comments but I'm sure they'll be properly addressed (or ignored if not relevant) :)

sklearn/model_selection/_search_successive_halving.py

glemaitre · 2021-06-15T15:05:25Z

Also, are all of them needed? I would assume that column_or_1d could be enough here, but maybe I'm wrong

probably check_classification_targets as well. But it is true that the two others can be delayed when the validation will take place in the underlying classifier.

Let me make the changes.

…ch_cv

jeremiedbb

LGTM. Thanks @glemaitre !

ogrisel · 2021-06-15T16:17:06Z

Thanks @glemaitre and everybody else!

glemaitre added 2 commits June 3, 2021 13:54

TST make sure to test SearchCV on both classification and regression

9ddd87e

TST add HavingSearchCV to common test

39d120f

glemaitre marked this pull request as draft June 3, 2021 12:02

glemaitre added the No Changelog Needed label Jun 3, 2021

glemaitre added 2 commits June 3, 2021 15:16

iter

eabd9df

iter

6c5b4e7

github-actions bot added the module:model_selection label Jun 3, 2021

iter

ba865d4

thomasjpfan mentioned this pull request Jun 4, 2021

TST Adds Halving CV into common test #20134

Closed

glemaitre added 4 commits June 4, 2021 13:06

iter

e76fc85

iter

3dc2a66

this might work finally

a76f7bd

iter

f3eccfe

glemaitre marked this pull request as ready for review June 8, 2021 12:43

Merge remote-tracking branch 'origin/main' into add_test_halving_sear…

be58b9c

…ch_cv

NicolasHug reviewed Jun 8, 2021

View reviewed changes

iter

cbdb885

glemaitre added 5 commits June 9, 2021 10:21

iter

4ed6fa2

iter

f4d946d

iter

c3871fa

doc

f4bdced

merge master

0d4f5e6

glemaitre commented Jun 9, 2021

View reviewed changes

sklearn/model_selection/_search_successive_halving.py Outdated Show resolved Hide resolved

glemaitre commented Jun 9, 2021

View reviewed changes

sklearn/model_selection/_search_successive_halving.py Show resolved Hide resolved

ogrisel approved these changes Jun 11, 2021

View reviewed changes

NicolasHug reviewed Jun 12, 2021

View reviewed changes

sklearn/model_selection/tests/test_successive_halving.py Show resolved Hide resolved

sklearn/model_selection/_search_successive_halving.py Outdated Show resolved Hide resolved

NicolasHug approved these changes Jun 15, 2021

View reviewed changes

sklearn/model_selection/_search_successive_halving.py Outdated Show resolved Hide resolved

glemaitre added 3 commits June 15, 2021 17:15

iter

dac22f2

Merge remote-tracking branch 'origin/main' into add_test_halving_sear…

27d80e0

…ch_cv

iter

833afca

jeremiedbb approved these changes Jun 15, 2021

View reviewed changes

ogrisel merged commit 6484c4f into scikit-learn:main Jun 15, 2021

lesteve mentioned this pull request Aug 4, 2021

Halving*SearchCV selects estimator with nan score as best estimator #20678

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

TST add HalvingSearchCV to common test #20203

TST add HalvingSearchCV to common test #20203

Uh oh!

glemaitre commented Jun 3, 2021 •

edited

Loading

Uh oh!

glemaitre commented Jun 8, 2021

Uh oh!

NicolasHug left a comment

Uh oh!

glemaitre commented Jun 8, 2021

Uh oh!

NicolasHug commented Jun 8, 2021

Uh oh!

glemaitre commented Jun 8, 2021

Uh oh!

NicolasHug commented Jun 8, 2021

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Uh oh!

Uh oh!

glemaitre commented Jun 15, 2021

Uh oh!

jeremiedbb left a comment

Uh oh!

ogrisel commented Jun 15, 2021

Uh oh!

Uh oh!

Uh oh!

TST add HalvingSearchCV to common test #20203

TST add HalvingSearchCV to common test #20203

Uh oh!

Conversation

glemaitre commented Jun 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Jun 8, 2021

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Jun 8, 2021

Uh oh!

NicolasHug commented Jun 8, 2021

Uh oh!

glemaitre commented Jun 8, 2021

Uh oh!

NicolasHug commented Jun 8, 2021

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glemaitre commented Jun 15, 2021

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Jun 15, 2021

Uh oh!

Uh oh!

glemaitre commented Jun 3, 2021 •

edited

Loading