MAINT Use check_scalar in BaseGradientBoosting #21632

genvalen · 2021-11-11T02:03:37Z

Reference Issues/PRs

This PR requires #21990 to be merged in first.
Addresses #20724 and #21927
#DataUmbrella

What does this implement/fix? Explain your changes.

Summary of changes to BaseGradientBoosting:

Add tests to ensure
GradientBoostingClassifier and
GradientBoostingRegressor
raise proper errors when invalid arguments are passed in.
Use the helper function check_scalar from sklearn.utils to validate the scalar parameters.

Test and validation progress:

In both estimators

In GradientBoostingRegressor

alpha

References

Any other comments?

For the unchecked tasks, validation is coming from BaseDecisionTree, however, tests have been added for them here.

sklearn/ensemble/tests/test_gradient_boosting.py

genvalen · 2021-12-13T04:30:32Z

Notes for parameter ranges in Trees
(param: type, range)
learning_rate: float, (0.0, inf)
n_estimators: int, [1, inf)
min_samples_split: int -> [2, inf], float-> (0, 1]
min_samples_leaf: int -> [1, inf], float-> (0, 1]
min_weight_fraction_leaf: float-> [0, 0.5]
max_depth: int, if not none, then [1, inf)
min_impurity_decrease: float, [0, inf)
subsample: float, (0, 1]
alpha: float, (0, 1)
max_features: int -> [1, number of features], float -> (0, 1], or string (not checked with check_scalar)
ccp_alpha: float, [0, inf)
verbose: int, [0, inf), or np.bool_
max_leaf_nodes: int, if not none, then [2, inf)
warm_start: check that it is int, (inf, inf) or np.bool_
validation_fraction: float, (0, 1)
n_iter_no_change: int -> [1, inf)
tol: float, (0, inf)

…nto BaseGradientBoosting_add_check_scalar

genvalen · 2022-01-08T02:40:27Z

Hi @glemaitre: just want to note that for the 7 remaining params in the PR task list (and also max_features), the validation is coming from BaseDecisionTree.  

I have added tests for these params in BaseGradientBoosting that fail right now, but they should pass once BaseDecisionTree #21990 gets merged in. (they are commented out for now)

For these remaining params, please let me know if it would be helpful to include the validation explicitly within BaseGradientBoosting, too. Thank you!

…nto BaseGradientBoosting_add_check_scalar

genvalen · 2022-01-31T05:59:31Z

Hi @glemaitre and @ogrisel, please review this at you convenience. Thank you!

sklearn/ensemble/_gb.py

sklearn/ensemble/tests/test_gradient_boosting.py

Co-authored-by: Guillaume Lemaitre <[email protected]>

glemaitre

LGTM

doc/whats_new/v1.1.rst

Co-authored-by: Guillaume Lemaitre <[email protected]>

thomasjpfan · 2022-02-07T18:27:05Z

sklearn/ensemble/_gb.py

+                max_val=self.n_features_in_,
+                include_boundaries="both",
+            )


Suggested change

max_val=self.n_features_in_,

include_boundaries="both",

)

include_boundaries="left",

)

Co-authored-by: Thomas J. Fan <[email protected]>

thomasjpfan

LGTM

Co-authored-by: Guillaume Lemaitre <[email protected]> Co-authored-by: Thomas J. Fan <[email protected]>

genvalen added 2 commits November 10, 2021 20:25

n_estimators: update unit tests

bc19c99

n_estimators: add validation with check_scalar

907a165

github-actions bot added the module:ensemble label Nov 11, 2021

genvalen added 4 commits November 10, 2021 21:15

learning_rate: update unit tests

c4665b1

learning_rate: validate using check_scalar

982f3b9

subsample: update tests

894a44c

subsample: validate using check_scalar

8439db5

reshamas added the Sprint label Nov 16, 2021

reshamas mentioned this pull request Dec 8, 2021

Use the function check_scalar for parameters validation #21927

Closed

41 tasks

genvalen added 2 commits December 11, 2021 12:54

n_iter_no_change: add tests and validation

1ed5fd0

max_features: add tests and validation

1864754

genvalen commented Dec 12, 2021

View reviewed changes

sklearn/ensemble/tests/test_gradient_boosting.py Outdated Show resolved Hide resolved

genvalen added 13 commits December 12, 2021 23:46

Move checks into _check_params method

f101326

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

081a3b9

…nto BaseGradientBoosting_add_check_scalar

Remove comments

8642163

warm_start: add tests and validation

875450c

validation_fraction: add tests and validation

3725121

tol: add tests and validation

cd5fccc

verbose: add tests and validation

0848f1d

change location of warm_start check so it reaches the param

5a9ae3f

small edits

be481d7

update order

8a606c7

alpha: add tests and validation

878f7f0

update max_features

f7dd19a

add tests (commented out) for params validated through tree.fit

568fe0f

genvalen added 3 commits January 7, 2022 21:58

boundary edit

f467970

update check scalar calls to explicitly reference "name" param

20759d5

small edits

6ecd65d

genvalen added 7 commits January 10, 2022 22:34

clean tests

b5d5a79

add max_leaf_nodes tests, apply black

27abf8f

update verbose

c03ed1e

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

c4c831b

…nto BaseGradientBoosting_add_check_scalar

Update tests and add tests related to BaseDecisisonTree

b4c3c1e

spelling fix

44400c9

spelling fix

8f15eb1

genvalen changed the title ~~[WIP] MAINT Use check_scalar in BaseGradientBoosting~~ [MRG] MAINT Use check_scalar in BaseGradientBoosting Jan 31, 2022

genvalen marked this pull request as ready for review January 31, 2022 05:52

glemaitre added the No Changelog Needed label Jan 31, 2022

glemaitre reviewed Jan 31, 2022

View reviewed changes

sklearn/ensemble/_gb.py Show resolved Hide resolved

sklearn/ensemble/_gb.py Outdated Show resolved Hide resolved

sklearn/ensemble/_gb.py Show resolved Hide resolved

sklearn/ensemble/tests/test_gradient_boosting.py Show resolved Hide resolved

genvalen and others added 5 commits January 31, 2022 10:30

Update sklearn/ensemble/_gb.py

3855bd4

Co-authored-by: Guillaume Lemaitre <[email protected]>

Update sklearn/ensemble/_gb.py

8f70a3f

Co-authored-by: Guillaume Lemaitre <[email protected]>

Update sklearn/ensemble/_gb.py

e41e48f

Co-authored-by: Guillaume Lemaitre <[email protected]>

Update doc/whats_new/v1.1.rst

2690a65

Spelling fix

24a757c

glemaitre approved these changes Feb 7, 2022

View reviewed changes

glemaitre reviewed Feb 7, 2022

View reviewed changes

doc/whats_new/v1.1.rst Outdated Show resolved Hide resolved

Update doc/whats_new/v1.1.rst

7629208

Co-authored-by: Guillaume Lemaitre <[email protected]>

thomasjpfan reviewed Feb 7, 2022

View reviewed changes

genvalen and others added 2 commits February 7, 2022 13:28

Update sklearn/ensemble/_gb.py

9b44f96

Co-authored-by: Thomas J. Fan <[email protected]>

max_features: update tests and upper bound in bdt

38e7f3e

thomasjpfan changed the title ~~[MRG] MAINT Use check_scalar in BaseGradientBoosting~~ MAINT Use check_scalar in BaseGradientBoosting Feb 7, 2022

STY Run black linting

3be8e16

thomasjpfan approved these changes Feb 7, 2022

View reviewed changes

thomasjpfan merged commit b80138f into scikit-learn:main Feb 7, 2022

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request Feb 9, 2022

MAINT Use check_scalar in BaseGradientBoosting (scikit-learn#21632)

d2acadd

Co-authored-by: Guillaume Lemaitre <[email protected]> Co-authored-by: Thomas J. Fan <[email protected]>

thomasjpfan added a commit to thomasjpfan/scikit-learn that referenced this pull request Mar 1, 2022

MAINT Use check_scalar in BaseGradientBoosting (scikit-learn#21632)

bc5d60e

Co-authored-by: Guillaume Lemaitre <[email protected]> Co-authored-by: Thomas J. Fan <[email protected]>

genvalen deleted the BaseGradientBoosting_add_check_scalar branch March 10, 2023 20:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT Use check_scalar in BaseGradientBoosting #21632

MAINT Use check_scalar in BaseGradientBoosting #21632

Uh oh!

genvalen commented Nov 11, 2021 •

edited

Loading

Uh oh!

Uh oh!

genvalen commented Dec 13, 2021 •

edited

Loading

Uh oh!

genvalen commented Jan 8, 2022

Uh oh!

genvalen commented Jan 31, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Uh oh!

Uh oh!

thomasjpfan Feb 7, 2022

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

MAINT Use check_scalar in BaseGradientBoosting #21632

MAINT Use check_scalar in BaseGradientBoosting #21632

Uh oh!

Conversation

genvalen commented Nov 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

genvalen commented Dec 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

genvalen commented Jan 8, 2022

Uh oh!

genvalen commented Jan 31, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thomasjpfan Feb 7, 2022

Choose a reason for hiding this comment

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

genvalen commented Nov 11, 2021 •

edited

Loading

genvalen commented Dec 13, 2021 •

edited

Loading