FIX scoring != None for RidgeCV should used unscaled y for evaluation #29842

glemaitre · 2024-09-13T16:59:59Z

While discussing with @jeromedockes, we recall to have observed something weird in the RidgeCV code. I check a bit closer and I open this PR to highlight what is the potential problem.

In RidgeCV, when having sample_weight we scale the data using the sqrt(sample_weight):

scikit-learn/sklearn/linear_model/_ridge.py

Lines 2133 to 2136 in 35164b3

    
           if sample_weight is not None: 
        
               X, y, sqrt_sw = _rescale_data(X, y, sample_weight) 
        
           else: 
        
               sqrt_sw = np.ones(n_samples, dtype=X.dtype)

The idea is that the mean squared error can be expressed as:

scikit-learn/sklearn/linear_model/_base.py

Lines 212 to 223 in 35164b3

    
               For many linear models, this enables easy support for sample_weight because 
        
                   (y - X w)' S (y - X w) 
        
               with S = diag(sample_weight) becomes 
        
                   ||y_rescaled - X_rescaled w||_2^2 
        
               when setting 
        
                   y_rescaled = sqrt(S) y 
        
                   X_rescaled = sqrt(S) X

Those "centered" data are used to optimize the ridge loss. Later in the code, we want to compute a score that can be an arbitrary metric via a scorer.

scikit-learn/sklearn/linear_model/_ridge.py

Lines 2158 to 2169 in 35164b3

    
           predictions = y - (c / G_inverse_diag) 
        
           if self.store_cv_results: 
        
               self.cv_results_[:, i] = predictions.ravel() 
        
           score_params = score_params or {} 
        
           alpha_score = self._score( 
        
               predictions=predictions, 
        
               y=y, 
        
               n_y=n_y, 
        
               scorer=scorer, 
        
               score_params=score_params, 
        
           )

The problem here is that predictions is computed efficiently as provided in the GCV paper. But these predictions are in the "scaled" space and it seems incorrect to compute any metric in this space with an arbitrary metric. Instead, we should unscale these predictions and the scaled true targets to compute the metric in the original space.

This is what this PR is intended to. I did not add any non-regression test (I assume that using the MedAE should lead to some failures) because I wanted to be sure that what I'm saying is correct.

@jeromedockes @ogrisel @lorentzenchr Does the above description make sense to you?

Edit: It seems that it relates to #13998 and #15648

Probably, I should check the tests that were written in #15648

github-actions · 2024-09-13T17:01:17Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: e9bd778. Link to the linter CI: here}

ogrisel · 2024-09-16T13:15:07Z

Cross-linking #16298 as it might be related.

glemaitre · 2024-09-17T21:56:02Z

So I added a test that was failing on main but is passing in this PR.
So we should be better. However, I discovered a new bug when dealing with sample_weight when scoring != None and with several target.

I'm going to open another PR to not overload this PR.

sklearn/linear_model/_ridge.py

glemaitre · 2024-09-18T16:43:12Z

I added a new parametrization to check that we support multioutput properly.

doc/whats_new/v1.6.rst

sklearn/linear_model/_ridge.py

sklearn/linear_model/tests/test_ridge.py

Co-authored-by: Christian Lorentzen <[email protected]>

lorentzenchr · 2024-09-18T18:34:11Z

@glemaitre Could you fix the typos in the whatsnew entry?

…scikit-learn#29842) Co-authored-by: Christian Lorentzen <[email protected]>

FIX scoring != None for RidgeCV should used unscaled y for evaluation

29fc71c

github-actions bot added the module:linear_model label Sep 13, 2024

glemaitre marked this pull request as draft September 16, 2024 08:19

glemaitre added 3 commits September 17, 2024 22:55

take into account the intercept

fbbfeac

TST make sure that we are in the original space

843a541

DOC update the changelog

c3215bc

glemaitre marked this pull request as ready for review September 17, 2024 21:52

Merge remote-tracking branch 'origin/main' into issue_ridgecv_scaling

1d1da78

glemaitre mentioned this pull request Sep 17, 2024

FIX RidgeCV works with multioutput and sample-weight non-default score #29877

Merged

thomasjpfan reviewed Sep 18, 2024

View reviewed changes

sklearn/linear_model/_ridge.py Outdated Show resolved Hide resolved

address thomas review + n_target test

af1847d

lorentzenchr approved these changes Sep 18, 2024

View reviewed changes

glemaitre and others added 2 commits September 18, 2024 18:56

Apply suggestions from code review

089ade8

Co-authored-by: Christian Lorentzen <[email protected]>

address christian comment

e9bd778

thomasjpfan approved these changes Sep 18, 2024

View reviewed changes

thomasjpfan enabled auto-merge (squash) September 18, 2024 17:04

thomasjpfan merged commit 69c1d79 into scikit-learn:main Sep 18, 2024
28 checks passed

glemaitre mentioned this pull request Sep 18, 2024

DOC fix typos introduced in #29842 #29886

Merged

lorentzenchr pushed a commit that referenced this pull request Sep 18, 2024

DOC fix typos introduced in #29842 (#29886)

938bce5

kbharat1210 pushed a commit to kbharat1210/scikit-learn that referenced this pull request Sep 25, 2024

FIX scoring != None for RidgeCV should used unscaled y for evaluation (…

be657ec

…scikit-learn#29842) Co-authored-by: Christian Lorentzen <[email protected]>

kbharat1210 pushed a commit to kbharat1210/scikit-learn that referenced this pull request Sep 25, 2024

DOC fix typos introduced in scikit-learn#29842 (scikit-learn#29886)

55cd8a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX scoring != None for RidgeCV should used unscaled y for evaluation #29842

FIX scoring != None for RidgeCV should used unscaled y for evaluation #29842

Uh oh!

glemaitre commented Sep 13, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Sep 13, 2024 •

edited

Loading

Uh oh!

ogrisel commented Sep 16, 2024

Uh oh!

glemaitre commented Sep 17, 2024

Uh oh!

Uh oh!

glemaitre commented Sep 18, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Sep 18, 2024

Uh oh!

Uh oh!

	if sample_weight is not None:
	X, y, sqrt_sw = _rescale_data(X, y, sample_weight)
	else:
	sqrt_sw = np.ones(n_samples, dtype=X.dtype)

	For many linear models, this enables easy support for sample_weight because

	(y - X w)' S (y - X w)

	with S = diag(sample_weight) becomes

	\|\|y_rescaled - X_rescaled w\|\|_2^2

	when setting

	y_rescaled = sqrt(S) y
	X_rescaled = sqrt(S) X

	predictions = y - (c / G_inverse_diag)
	if self.store_cv_results:
	self.cv_results_[:, i] = predictions.ravel()

	score_params = score_params or {}
	alpha_score = self._score(
	predictions=predictions,
	y=y,
	n_y=n_y,
	scorer=scorer,
	score_params=score_params,
	)

Uh oh!

FIX scoring != None for RidgeCV should used unscaled y for evaluation #29842

FIX scoring != None for RidgeCV should used unscaled y for evaluation #29842

Uh oh!

Conversation

glemaitre commented Sep 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Sep 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

ogrisel commented Sep 16, 2024

Uh oh!

glemaitre commented Sep 17, 2024

Uh oh!

Uh oh!

glemaitre commented Sep 18, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Sep 18, 2024

Uh oh!

Uh oh!

glemaitre commented Sep 13, 2024 •

edited

Loading

github-actions bot commented Sep 13, 2024 •

edited

Loading