ENH improve HGBT predict classes #27844

lorentzenchr · 2023-11-25T15:01:54Z

Reference Issues/PRs

This PR avoids the call to predict_proba when executing predict in HGBT.

What does this implement/fix? Explain your changes.

Any other comments?

github-actions · 2023-11-25T15:03:12Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: b26e3d2. Link to the linter CI: here}

thomasjpfan

Codewise this looks more efficient. Did you find a measurable performance benefit?

In any case, LGTM.

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py

betatim

Looks good to me. I assume it is faster because it does less work (we save the detour via self._loss).

One thing that would have helped me understand why this is an equivalent thing to do is more documentation on what _raw_predict returns. Mostly because my editor couldn't help me find the implementation of self._loss.predict_proba. There is a _loss.c in the directory, but no _loss.pyx, so I guess the actual code for this comes from sklearn/_loss/loss.py?? (you see confusion reigns supreme :D)

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py

lorentzenchr · 2023-12-08T17:51:29Z

@betatim Think GLMs: You compute a raw prediction in "link space". This is called linear predictor for GLM, just X @ coef. Then you use a one-to-one function (the inverse link function) to map it back to the scale of the target y, e.g. expit(raw) for binary classification.

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py

Co-authored-by: Guillaume Lemaitre <[email protected]>

…scikit-learn into hgbt_predict_class

glemaitre · 2024-01-09T09:40:15Z

@jeremiedbb it seems this has not been merged for the 1.4. Would you be inclined in including it in the 1.4 or should we postpone for 1.5?

lorentzenchr · 2024-01-22T06:52:23Z

The whatsnew entry needs to be moved to 1.5, right?

glemaitre · 2024-01-22T10:15:30Z

Indeed. We need to. I'll make the fix.

Co-authored-by: Guillaume Lemaitre <[email protected]>

ENH improve HGBT predict classes

2796346

github-actions bot added the module:ensemble label Nov 25, 2023

lorentzenchr added 3 commits November 25, 2023 16:04

DOC add whatsnew

a60cae9

Merge branch 'main' into hgbt_predict_class

bc671dc

Merge branch 'main' into hgbt_predict_class

bb21781

lorentzenchr added the Quick Review For PRs that are quick to review label Nov 28, 2023

Merge branch 'main' into hgbt_predict_class

43d1df5

thomasjpfan approved these changes Dec 7, 2023

View reviewed changes

thomasjpfan added the Waiting for Second Reviewer First reviewer is done, need a second one! label Dec 7, 2023

betatim reviewed Dec 8, 2023

View reviewed changes

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py Outdated Show resolved Hide resolved

betatim approved these changes Dec 8, 2023

View reviewed changes

thomasjpfan reviewed Dec 8, 2023

View reviewed changes

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py Outdated Show resolved Hide resolved

CLN adress review comments

c703e2d

Merge branch 'main' into hgbt_predict_class

7def229

glemaitre reviewed Dec 11, 2023

View reviewed changes

sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py Outdated Show resolved Hide resolved

lorentzenchr and others added 4 commits December 11, 2023 16:22

Update sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py

99ba80a

Co-authored-by: Guillaume Lemaitre <[email protected]>

Merge branch 'main' into hgbt_predict_class

ac4be97

Merge branch 'hgbt_predict_class' of https://github.com/lorentzenchr/…

14f08d1

…scikit-learn into hgbt_predict_class

Merge branch 'main' into hgbt_predict_class

b26e3d2

thomasjpfan added this to the 1.5 milestone Jan 20, 2024

thomasjpfan merged commit 897c0c5 into scikit-learn:main Jan 20, 2024

glemaitre mentioned this pull request Jan 22, 2024

DOC fix some entries location of the changelog #28217

Merged

lorentzenchr deleted the hgbt_predict_class branch January 23, 2024 20:06

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request Feb 10, 2024

ENH improve HGBT predict classes (scikit-learn#27844)

c48d7d7

Co-authored-by: Guillaume Lemaitre <[email protected]>

umaannamalai mentioned this pull request May 21, 2024

Fix sklearn ensemble tests. newrelic/newrelic-python-agent#1148

Merged

lorentzenchr added Performance and removed Waiting for Second Reviewer First reviewer is done, need a second one! labels May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH improve HGBT predict classes #27844

ENH improve HGBT predict classes #27844

Uh oh!

lorentzenchr commented Nov 25, 2023

Uh oh!

github-actions bot commented Nov 25, 2023 •

edited

Loading

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

betatim left a comment

Uh oh!

Uh oh!

lorentzenchr commented Dec 8, 2023

Uh oh!

Uh oh!

glemaitre commented Jan 9, 2024

Uh oh!

lorentzenchr commented Jan 22, 2024

Uh oh!

glemaitre commented Jan 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

ENH improve HGBT predict classes #27844

ENH improve HGBT predict classes #27844

Uh oh!

Conversation

lorentzenchr commented Nov 25, 2023

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Nov 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

betatim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lorentzenchr commented Dec 8, 2023

Uh oh!

Uh oh!

glemaitre commented Jan 9, 2024

Uh oh!

lorentzenchr commented Jan 22, 2024

Uh oh!

glemaitre commented Jan 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Nov 25, 2023 •

edited

Loading