[MRG] MAINT add base class for voting and stacking #15084

glemaitre · 2019-09-24T17:25:41Z

Create a base class for Voting* and Stacking*. They both are an ensemble of multiple learners type.
They could share the get_params, set_params and validation of estimators (as well as the fitted attributes then).

This base class could be contrasted with the ensemble of single learner type such as boosting (adaboost, GBDT), RF and Bagging.

glemaitre · 2019-09-24T17:35:26Z

@thomasjpfan @ogrisel @rth @adrinjalali

So the naming of the base class is terrible but I wanted to have a WIP PR such that we see what is in common and if it makes sense to merge code.

NB: the tests will fail because I did not add support for None to drop an estimator (only available in the voting and not in the stacking). This is easily fixed and would ease the deprecation.

WDYT?

sklearn/ensemble/_stacking.py

thomasjpfan

I like this refactoring. The _BaseEnsembleHeterogeneousEstimator class has have well defined boundaries.

sklearn/ensemble/base.py

glemaitre · 2019-10-01T15:30:57Z

Good to be reviewed. I will open a PR to deprecate None support and use 'drop' instead.

doc/whats_new/v0.22.rst

sklearn/ensemble/base.py

sklearn/ensemble/tests/test_stacking.py

sklearn/ensemble/base.py

glemaitre · 2019-10-02T14:08:08Z

Having the init make it explicit that the estimators parameter is the common denominator for all inherited classes. I am fine keeping it.

…

On Wed, 2 Oct 2019 at 13:07, Nicolas Hug ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In sklearn/ensemble/base.py <#15084 (comment)> : > @@ -178,3 +182,76 @@ def _partition_estimators(n_estimators, n_jobs): starts = np.cumsum(n_estimators_per_job) return n_jobs, n_estimators_per_job.tolist(), [0] + starts.tolist() + + +class _BaseHeterogeneousEnsemble(MetaEstimatorMixin, _BaseComposition, + metaclass=ABCMeta): + """Base class for ensemble learners based on heterogeneous estimators.""" + _required_parameters = ['estimators'] + + @Property + def named_estimators(self): + return Bunch(**dict(self.estimators)) + + @AbstractMethod + def __init__(self, estimators): + self.estimators = estimators i'm suggesting to not have the init method — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#15084?email_source=notifications&email_token=ABY32P6RROAFMVM32WZOLMTQMR6FTA5CNFSM4I2CTMN2YY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCGTRNOY#discussion_r330490319>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABY32PYGSAUSWSRVBIUX6RLQMR6FTANCNFSM4I2CTMNQ> .

-- Guillaume Lemaitre Scikit-learn @ Inria Foundation https://glemaitre.github.io/

NicolasHug

I guess I'm becoming increasingly skeptical about the relevance of inheritance in some cases (like here where all it does is set a single attribute). Makes the code easy to write, but often harder to understand.

But LGTM anyway.

thomasjpfan · 2019-10-02T14:45:57Z

doc/whats_new/v0.22.rst

+- |Fix| Stacking and Voting estimators now ensure that their underlying
+  estimators are either all classifiers or all regressors.
+  We introduced a new base class
+  :class:`ensemble.base._BaseHeterogeneousEnsemble` to raise consistent error


Should we include a private class in the whats new? This can be something like:

Stacking and Voting estimators now raise consistent error messages.

I might be misunderstood what @NicolasHug meant by adding a link?
Did you mean mentioning the class or do you expect something else?

Since we are not generating a new API doc for _BaseHeterogeneousEnsemble, there is nothing to link to: https://76528-843222-gh.circle-artifacts.com/0/doc/whats_new/v0.22.html#sklearn-ensemble

The links referred to the Stacking and Voting estimators, sorry if that wasn't clear. I agree we shouldn't link a private class. (and I'm also fine not linking the estimators... it's just a nit)

Oh ok make sense.

thomasjpfan · 2019-10-04T15:09:49Z

doc/whats_new/v0.22.rst


+- |Fix| Stacking and Voting estimators now ensure that their underlying
+  estimators are either all classifiers or all regressors.
+  We introduced a new base class


We do not need a "new base class" part?

:class:ensemble.StackingClassifier, :class:ensemble.StackingRegressor, :class:ensemble.VotingClassifier, and :class:ensemeble.VotingRegressor now raise consistent error messages.

thomasjpfan · 2019-10-05T00:49:36Z

Thank you @glemaitre !

glemaitre added 5 commits September 24, 2019 19:23

MAINT add base class for voting and stacking

b8ba41c

Merge remote-tracking branch 'origin/master' into is/15056

44c1068

iter

d7b8919

iter

cde806b

PEP8

c5c8628

ogrisel reviewed Sep 25, 2019

View reviewed changes

sklearn/ensemble/_stacking.py Outdated Show resolved Hide resolved

thomasjpfan reviewed Sep 25, 2019

View reviewed changes

sklearn/ensemble/base.py Outdated Show resolved Hide resolved

sklearn/ensemble/base.py Outdated Show resolved Hide resolved

support None and drop

e2d0535

glemaitre changed the title ~~[WIP] MAINT add base class for voting and stacking~~ [MRG] MAINT add base class for voting and stacking Oct 1, 2019

glemaitre added 2 commits October 1, 2019 17:32

Merge remote-tracking branch 'origin/master' into is/15056

91a9cac

remove parameters from base class

b733971

glemaitre mentioned this pull request Oct 1, 2019

[MRG] Deprecate None in VotingClassifer and VotingRegressor #15090

Closed

iter

1f6bd25

NicolasHug reviewed Oct 1, 2019

View reviewed changes

sklearn/ensemble/base.py Outdated Show resolved Hide resolved

address nicolas comments

0338c86

NicolasHug approved these changes Oct 2, 2019

View reviewed changes

thomasjpfan approved these changes Oct 2, 2019

View reviewed changes

fix whats new

5ea1c4b

thomasjpfan approved these changes Oct 4, 2019

View reviewed changes

Update v0.22.rst

bf8cc38

thomasjpfan merged commit 7dd03e0 into scikit-learn:master Oct 5, 2019

Uh oh!

[MRG] MAINT add base class for voting and stacking #15084

[MRG] MAINT add base class for voting and stacking #15084

Uh oh!

Conversation

glemaitre commented Sep 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Sep 24, 2019

Uh oh!

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Oct 1, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Oct 2, 2019 via email

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Oct 2, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre Oct 2, 2019

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Oct 2, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 2, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre Oct 3, 2019

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Oct 4, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre Oct 4, 2019

Choose a reason for hiding this comment

Uh oh!

thomasjpfan commented Oct 5, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

glemaitre commented Sep 24, 2019 •

edited

Loading