[MRG+1] Fix BIC/AIC for Lasso #9022

agramfort · 2017-06-06T18:41:28Z

Reference Issue

taking over #6080

What does this implement/fix? Explain your changes.

AIC/BIC criterion in LassoLarsIC was buggy ...

Any other comments?

Nope

agramfort · 2017-06-06T18:42:24Z

sklearn/linear_model/least_angle.py

-        with np.errstate(divide='ignore'):
-            self.criterion_ = n_samples * np.log(mean_squared_error) + K * df
+        eps64 = np.finfo('float64').eps
+        self.criterion_ = (n_samples * mean_squared_error / (sigma2 + eps64) +


I added the eps64 to avoid the division by zero

OK, it took me a while to understand that the reason that we don't need the log is that the MSE is actually already the log-likelihood.

Maybe add a comment pointing any future reader to eqn 53/54 of https://web.stanford.edu/~hastie/Papers/dflasso.pdf? It's true the paper is already in the docstring though, so feel free to just ignore this comment.

also maybe document somewhere that the criterion is off by a constant and scaled by n, compared to the actual value, but that this doesn't affect relative comparisons? The wikipedia page for AIC has a section calling this "software unreliability"...

agramfort · 2017-06-06T18:43:07Z

sklearn/linear_model/tests/test_least_angle.py

    X = diabetes.data
    y = diabetes.target
-    X = np.c_[X, rng.randn(X.shape[0], 4)]  # add 4 bad features
+    X = np.c_[X, rng.randn(X.shape[0], 5)]  # add 4 bad features


I need this to have enough alpha on the grid so alpha_bic > alpha_aic (otherwise it was equal)

comment is now out of sync with code

comment is out of sync with code

agramfort · 2017-06-06T18:43:25Z

sklearn/linear_model/tests/test_least_angle.py

-    X = y.reshape(-1, 1)
-    lars = linear_model.LassoLarsIC(normalize=False)
-    assert_no_warnings(lars.fit, X, y)
-    assert_true(np.any(np.isinf(lars.criterion_)))


not needed anymore. There is no log anymore

The information criterion calculation is not compatible with the original paper Zou, Hui, Trevor Hastie, and Robert Tibshirani. "On the “degrees of freedom” of the lasso." The Annals of Statistics 35.5 (2007): 2173-2192. APA

agramfort · 2017-06-08T07:38:27Z

good to go on my end. Travis is happy

vene · 2017-06-08T08:48:48Z

doc/whats_new.rst


   - Fixed a memory leak in our LibLinear implementation. :issue:`9024` by
     :user:`Sergei Lebedev <superbobry>`
+   - Fix AIC/BIC criterion computation in LassoLarsIC by `Alexandre Gramfort`_


space before?

GaelVaroquaux · 2017-06-10T15:00:59Z

LGTM.

+1 for merge

agramfort · 2017-06-10T15:40:47Z

@vene you give me the other +1 plz?

vene · 2017-06-10T15:49:47Z

sklearn/linear_model/least_angle.py

            raise ValueError('criterion should be either bic or aic')

        R = y[:, np.newaxis] - np.dot(X, coef_path_)  # residuals
        mean_squared_error = np.mean(R ** 2, axis=0)


here we take the mean of sum of squares, only to multiply by n_samples afterwards. It seems mean_squared_error is not used elsewhere in the local scope, we could save some cycles.

vene · 2017-06-10T15:58:47Z

Hmm I'd like a stronger test but I don't have any good ideas of how to get one... this was subtle until I found the equations in the paper, the wikipedia page was not super helpful...

vene · 2017-06-10T16:04:00Z

LGTM apart from the minor comments. @agramfort , if you're busy but agree with my comments I'll be happy to make the changes myself and merge.

agramfort · 2017-06-11T07:23:55Z

@vene please take over. No time anymore :(

DOC comments and docstring on criterion computation

vene · 2017-06-11T22:29:12Z

My comment have been addressed and @GaelVaroquaux 's +1 should still stand. Travis passes; Appveyor failures seem irrelevant and also present on master. (AFAIK appveyor is just back from an outage.)

Merging. Thanks @agramfort and @mehmetbasbug !

* correcting information criterion calculation in least_angle.py The information criterion calculation is not compatible with the original paper Zou, Hui, Trevor Hastie, and Robert Tibshirani. "On the “degrees of freedom” of the lasso." The Annals of Statistics 35.5 (2007): 2173-2192. APA * FIX : fix AIC/BIC computation in LassoLarsIC * update what's new * fix test * fix test * address comments * DOC comments and docstring on criterion computation

agramfort mentioned this pull request Jun 6, 2017

correcting information criterion calculation in least_angle.py #6080

Closed

agramfort commented Jun 6, 2017

View reviewed changes

agramfort changed the title ~~Fix BIC/AIC for Lasso~~ [MRG] Fix BIC/AIC for Lasso Jun 6, 2017

Mehmet Basbug and others added 3 commits June 7, 2017 09:54

FIX : fix AIC/BIC computation in LassoLarsIC

bf165ff

update what's new

ee9802f

agramfort force-pushed the fix_bic_aic_lasso branch from 581c3d0 to ee9802f Compare June 7, 2017 07:56

agramfort added 2 commits June 7, 2017 14:06

fix test

fb19a5c

fix test

8ba7351

vene reviewed Jun 8, 2017

View reviewed changes

address comments

d50227f

GaelVaroquaux changed the title ~~[MRG] Fix BIC/AIC for Lasso~~ [MRG+1] Fix BIC/AIC for Lasso Jun 10, 2017

vene reviewed Jun 10, 2017

View reviewed changes

vene and others added 2 commits June 11, 2017 11:46

DOC comments and docstring on criterion computation

824f98f

Merge pull request #12 from vene/fixaic

b1addca

DOC comments and docstring on criterion computation

vene merged commit 6a35622 into scikit-learn:master Jun 11, 2017

martin-hahn mentioned this pull request Feb 17, 2018

LassoLarsIC delivers wrong coef_ when the model can fit the data with variance 0 #10641

Closed

flyjuice mentioned this pull request Aug 5, 2019

LassoLarsIC information criteria incorrectly calculated #14566

Closed

Uh oh!

[MRG+1] Fix BIC/AIC for Lasso #9022

[MRG+1] Fix BIC/AIC for Lasso #9022

Uh oh!

Conversation

agramfort commented Jun 6, 2017

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

agramfort commented Jun 8, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

GaelVaroquaux commented Jun 10, 2017

Uh oh!

agramfort commented Jun 10, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vene commented Jun 10, 2017

Uh oh!

vene commented Jun 10, 2017

Uh oh!

agramfort commented Jun 11, 2017 via email

Uh oh!

vene commented Jun 11, 2017

Uh oh!

Uh oh!