MNT: trees/forests/GBT: deprecate `"friedman_mse"` criterion #32708

cakedev0 · 2025-11-13T20:35:56Z

Reference Issues/PRs

Towards #32700 (deprecation before complete removal).

Fixes:

BUG: tree/forest regressor: impurity decrease calculation is wrong for criterion "friedman_mse" #32707
BUG: trees: criterion="friedman_mse" is buggy for multi-output #32718

What does this implement/fix? Explain your changes.

Remove class FriedmanMSE(MSE) in sklearn/tree/_criterion.pyx
Deprecate "friedman_mse" for trees & forests (if criterion="friedman_mse": criterion="squared_error" + deprecation warning)
Deprecate criterion param for gradient boosting
Adapt the doc/docstrings/tests/... accordingly

github-actions · 2025-11-13T20:36:53Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: bd8187a. Link to the linter CI: here}

cakedev0 · 2025-11-14T15:19:40Z

sklearn/tree/_classes.py

+        Training using "absolute_error" is significantly slower
+        than when using "squared_error".


Unrelated to this PR, but now that MAE criterion is at much 10x slower than the MSE, and usually more like 5x slower. Is it really significantly slower? It will still fit fairly fast for most tabular datasets (less than let's say 10M points).

I am inclined to remove this sentence. The reason for choosing either squared error or absolute error should not be fit time, but use case / statistically driven.

Let's remove.

Happy to 😄

cakedev0 · 2025-11-14T17:07:42Z

sklearn/ensemble/tests/test_gradient_boosting.py

+@pytest.mark.skip("Skip for now")
 def test_huber_exact_backward_compat():


I propose to delete this test, as the changes proposed by this PR are not exactly backward compatible, given that the criterion calculations now use a different but equivalent formula.

Another option would be to update it with the current values, but I feel that a such test prevents legitimate changes/improvements that slightly affect any calculation.

I wouldn't skip. What's the difference after the change in this test? What should be the tolerance for it to pass?

What should be the tolerance for it to pass?

rtol=0.1, which doesn't make much sense for a test named test_huber_exact_backward_compat

In this test, the model 100% overfits, so the values checked in the asserts are mostly 0 + some float precision noise I think 😅 , at least the one for which rtol=0.1 would be needed.

So I propose to transform this test into test_huber_overfit, see my commit 4009fd3

adrinjalali

Otherwise LGTM

doc/whats_new/upcoming_changes/sklearn.ensemble/32708.api.rst

sklearn/ensemble/_gb.py

adrinjalali · 2025-11-18T12:00:22Z

sklearn/ensemble/tests/test_gradient_boosting.py

+@pytest.mark.skip("Skip for now")
 def test_huber_exact_backward_compat():


I wouldn't skip. What's the difference after the change in this test? What should be the tolerance for it to pass?

adrinjalali

LGTM. Maybe @lorentzenchr would like to have a look.

lorentzenchr · 2025-11-19T20:25:09Z

~~-1 for me at the moment. We need to clarify the issue first, see my comment there #32700 (comment).~~

Edit: Discussion resolved.

ogrisel

Since, the concerns raised by #32700 (comment) have been addressed in the follow-up discussion, I think we can finalize the review and merge of this PR.

Besides the following points, it looks good to me.

ogrisel · 2025-12-26T16:26:03Z

doc/modules/model_evaluation.rst

  ... )
  >>> cross_val_score(estimator, X, y, cv=5, scoring=mean_pinball_loss_95p)
-  array([13.6, 9.7, 23.3, 9.5, 10.4])
+  array([14.3,  9.8, 23.9,  9.4, 10.8])


Is this change cause by rounding error discrepancies when switching from criterion="friedman_mse" to criterion="squared_error"? Or is this a consequence of #32707?

It's caused by rounding error discrepancies. #32707 only affects the scale of the impurity I think, see the fix I made temporarily in PR #32699, and here the scale of impurity shouldn't impact the results (trees are controlled by max_depth only).

But the calculation of impurity (criterion.impurity_improvement) being implemented differently, the rounding errors differ, and when you have several features with very similar (or the same) splits, the selected split might differ.

This happens for all the losses, except for loss="absolute_error": I think the gradient being -1 or 1 for this loss, there are no rounding errors (integers everywhere, and integers small enough to fit in the mantissa of a float64).

ogrisel · 2025-12-26T16:27:43Z

doc/whats_new/upcoming_changes/sklearn.ensemble/32708.api.rst

+  and :class:`ensemble.GradientBoostingRegressor`,
+  as it had no actual effect.


I think we should rather document this a fix and rephrase this entry accordingly, since this PR changes the default behavior of the estimators by changing the underlying implementation from the buggy "friedman_mse" implementation to the correct "squared_error" implementation (#32707).

Hum... I wrote both a .fix.rst and a .api.rst, each one focuses on a different aspect. Let me know what you think.

doc/whats_new/upcoming_changes/sklearn.tree/32708.api.rst

lorentzenchr · 2025-12-28T07:37:26Z

sklearn/ensemble/tests/test_gradient_boosting.py

-@skip_if_32bit
-def test_huber_exact_backward_compat():
-    """Test huber GBT backward compat on a simple dataset.
-
-    The results to compare against are taken from scikit-learn v1.2.0.
-    """


Please undo the changes of this test. I should still pass. If not, this PR introduces a regression.

This test doesn't pass after the changes in this PR because of round errors. As mentioned above, in this test (test_huber_exact_backward_compat) the model 100% overfits, so the values checked in the asserts are mostly 0/<some_int> + some float precision noise. Checking the exact values means checking the exact float precision noise, preventing almost any change, even meaningful and valid, from passing it.

If not, this PR introduces a regression

No. This test was just not a meaningful test (as I already found and fixed many since I started working on scikit-learn...).

Typically, this test fails when you change:

median = _weighted_percentile(y_true, sample_weight, 50) to median = _weighted_percentile(y_true, sample_weight, 50, average=True) in HuberLoss.fit_intercept_only. Even though I think we consider that average=True is a better option for computing the weighted median. At least that's what I assumed when I wrote the PR #32100 and was, in a sense, confirmed by @ogrisel who asked me to test my logic against _weighted_percentile(..., average=True).

I agree that we will need to open another PR for to use average=True in. HuberLoss.fit_intercept_only and check that optimizing the HuberLoss on a dataset with symmetrically distributed target data and constant features returns the same as np.median, whether the sum of integer sample weights is even or odd and remove this arbitrary bias.

I'd be happy to do that 😄 We have the same thing for the losses AbsoluteError and PinballLoss.

sklearn/ensemble/_forest.py

lorentzenchr · 2025-12-28T07:44:10Z

sklearn/tree/tests/test_export.py

        export_graphviz(clf, out, class_names=[])


-def test_friedman_mse_in_graphviz():


Can this test be modified instead of removed? Or is it tested elsewhere?

This test was mostly testing that the string "friedman_mse" was present in appropriate places in the output. We have some much more complete tests in this file (for the squared error criterion). So I think it's fine removing it.

Alternatively, I can keep this test and just ignore the deprecation warnings. And we'll remove it when we remove "friedman_mse" completely.

I would rename the test to test_criterion_in_gradient_boosting_graphviz and check that the name of the new criterion ("squared_error") is present in the nodes.

I think this would be redundant with test_graphviz_toy where we compare with exact outputs (including examples including "squared_error".

Edit: ah no, this test fits GradientBoostingClassifier, ok let's do what you suggest.

Though TBH, I don't think this test makes a lot of sense: why testing the export of GB.estimators_[0] and not RF.estimators_[0], ET.estimators_[0], etc.?

I would rather suggest a test (parametrized by the criterion) that fits a decision tree, and tests the presence of the criterion name in the exported string. So we test all criteria. And we avoid importing from sklearn.ensemble in a test from sklearn.tree which looks like a bad pattern in most cases.

sklearn/tree/_classes.py

sklearn/ensemble/_gb.py

sklearn/tree/_classes.py

Co-authored-by: Christian Lorentzen <[email protected]>

ogrisel · 2025-12-29T11:11:21Z

The HTTP 403 on fetch_olivetti_faces on circle ci is another occurrence of IP deny-listing by figshare.com we experience in the past. Its resolution will be tracked on #32961.

…ntile.py

ogrisel · 2025-12-29T11:25:48Z

I pushed a few cosmetic fixes in examples/ensemble/plot_gradient_boosting_quantile.py to trigger the CI on them as part of this PR, hopefully make it easier to check that the change of rounding errors induced by the swap to the "squared_error" criterion in this PR does not meaningfully impact its message.

However, I think that the rendering of that example by the CI will not be possible as long as #32961 is not resolved. So here is the output of a local run tun on my machine:

to be compared with the plots and tables in https://scikit-learn.org/dev/auto_examples/ensemble/plot_gradient_boosting_quantile.html as currently rendered on main.

While we do observe some discrepancies at the third digits for some of the loss values, I am willing to believe that they are caused by the same cause as explained in #32708 (comment) and the qualitative message of the example remains unchanged.

ogrisel · 2025-12-29T11:36:48Z

Similar observations and conclusion for:

examples/ensemble/plot_gradient_boosting_oob.py

https://scikit-learn.org/stable/auto_examples/ensemble/plot_gradient_boosting_oob.html

examples/ensemble/plot_gradient_boosting_regression.py

https://scikit-learn.org/stable/auto_examples/ensemble/plot_gradient_boosting_regression.html

examples/ensemble/plot_gradient_boosting_regularization.py

https://scikit-learn.org/stable/auto_examples/ensemble/plot_gradient_boosting_regularization.html

doc/whats_new/upcoming_changes/sklearn.ensemble/32708.api.rst

doc/whats_new/upcoming_changes/sklearn.ensemble/32708.fix.rst

Updates to changelog Co-authored-by: Christian Lorentzen <[email protected]>

use friedman formula in MSE; empty FriedmanMSE class

0bfc54c

github-actions bot added cython module:tree labels Nov 13, 2025

cakedev0 added 8 commits November 13, 2025 22:43

revert to initial mse proxy_impurity_improvement

051d2fd

skip sensitive test for now

8dd27c5

fix doc

66ff7bf

WIP removing friedman_mse from public API

01f73a8

fix; add warning

8da9bb7

deprecate criterion in GB

21e9e9b

fix

d630465

add changelog

8a4ffa0

cakedev0 commented Nov 14, 2025

View reviewed changes

cakedev0 added 3 commits November 14, 2025 16:31

fix changelog num

b98c24e

Merge remote-tracking branch 'upstream/main' into mnt/remove_friedman_1

6a25837

improve changelog

beacbcd

cakedev0 marked this pull request as ready for review November 14, 2025 16:59

cakedev0 changed the title ~~MNT: start removing FriedmanMSE class~~ MNT: trees/forests/GBT: deprecate "friedman_mse" criterion Nov 14, 2025

cakedev0 commented Nov 14, 2025

View reviewed changes

cakedev0 mentioned this pull request Nov 14, 2025

Gradient Boosting: Tree splitting criterion"friedman_mse" isn't different from normal "mse" #32700

Open

adrinjalali reviewed Nov 18, 2025

View reviewed changes

cakedev0 added 3 commits November 18, 2025 13:07

Merge remote-tracking branch 'upstream/main' into mnt/remove_friedman_1

886f21a

test_huber_exact_backward_compat -> test_huber_overfit

4009fd3

hide friedman_mse

bd8187a

adrinjalali approved these changes Nov 19, 2025

View reviewed changes

cakedev0 mentioned this pull request Nov 19, 2025

BUG: trees: criterion="friedman_mse" is buggy for multi-output #32718

Open

arrdel mentioned this pull request Dec 12, 2025

fix: Correct impurity decrease calculation for friedman_mse criterion #32896

Closed

ogrisel approved these changes Dec 26, 2025

View reviewed changes

ogrisel added this to Tree-based models Dec 26, 2025

ogrisel moved this to In Progress in Tree-based models Dec 26, 2025

ogrisel mentioned this pull request Dec 26, 2025

DOC: add explanations about criterion="friedman_mse" in decision trees doc #32204

Closed

lorentzenchr reviewed Dec 28, 2025

View reviewed changes

sklearn/ensemble/_gb.py Show resolved Hide resolved

sklearn/tree/_classes.py Show resolved Hide resolved

Apply suggestions from code review

1d11b24

Co-authored-by: Christian Lorentzen <[email protected]>

github-actions bot added the CI:Linter failure The linter CI is failing on this PR label Dec 28, 2025

cakedev0 and others added 3 commits December 28, 2025 13:15

rephrasing of changelogs

c98cd20

Update sklearn/ensemble/_forest.py

cc06735

Co-authored-by: Christian Lorentzen <[email protected]>

fix linting

1c96663

github-actions bot removed the CI:Linter failure The linter CI is failing on this PR label Dec 28, 2025

cakedev0 added 2 commits December 28, 2025 13:56

added tests for warnings

7b44545

Merge remote-tracking branch 'upstream/main' into mnt/remove_friedman_1

84fe9d1

Cosmetic improvements in examples/ensemble/plot_gradient_boosting_qua…

d720f9a

…ntile.py

lorentzenchr reviewed Dec 29, 2025

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.ensemble/32708.api.rst Outdated Show resolved Hide resolved

doc/whats_new/upcoming_changes/sklearn.ensemble/32708.fix.rst Outdated Show resolved Hide resolved

cakedev0 and others added 5 commits December 29, 2025 18:02

added back test_criterion_in_gradient_boosting_graphviz

7e779d0

removed note about AE being significantly slower than SE

38afe9d

Apply suggestions from code review

86d5631

Updates to changelog Co-authored-by: Christian Lorentzen <[email protected]>

correct changelog

bf68550

changelog: slight rephrase

45cbc6b

		Training using "absolute_error" is significantly slower
		than when using "squared_error".

		@pytest.mark.skip("Skip for now")
		def test_huber_exact_backward_compat():

		and :class:`ensemble.GradientBoostingRegressor`,
		as it had no actual effect.

		export_graphviz(clf, out, class_names=[])


		def test_friedman_mse_in_graphviz():

Uh oh!

MNT: trees/forests/GBT: deprecate "friedman_mse" criterion #32708

Are you sure you want to change the base?

MNT: trees/forests/GBT: deprecate "friedman_mse" criterion #32708

Uh oh!

Conversation

cakedev0 commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

github-actions bot commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cakedev0 Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

lorentzenchr commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cakedev0 Dec 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ogrisel Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cakedev0 Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

MNT: trees/forests/GBT: deprecate `"friedman_mse"` criterion #32708

MNT: trees/forests/GBT: deprecate `"friedman_mse"` criterion #32708

cakedev0 commented Nov 13, 2025 •

edited

Loading

github-actions bot commented Nov 13, 2025 •

edited

Loading

cakedev0 Nov 14, 2025 •

edited

Loading

lorentzenchr commented Nov 19, 2025 •

edited

Loading

cakedev0 Dec 28, 2025 •

edited

Loading

ogrisel Dec 29, 2025 •

edited

Loading

cakedev0 Dec 29, 2025 •

edited

Loading

ogrisel commented Dec 29, 2025 •

edited

Loading

ogrisel commented Dec 29, 2025 •

edited

Loading