[MRG+1] Fixes #10393 Fixed error when fitting RidgeCV with integers #10397

mabelvj · 2018-01-03T18:27:12Z

[Fix bug]: added support for float alphas in class _RidgeGCV(LinearModel), lines 1050 and 1052.
[Test]: added a test to check that using negative alphas does not raise an error.

amueller · 2018-01-03T19:15:01Z

negative alphas should raise an error, right?

mabelvj · 2018-01-03T20:04:50Z

Yes! Sorry, I got confused and thought it should not raise an error. It's fixed now, testing negative and positive alphas both integers and float.

jnothman

Please use Fixes #10393 in the PR description, rather than something ad hoc like #Fixes issue #10393 so that GitHub knows to close the issue automatically when this is merged.

A first glance:

jnothman · 2018-01-03T22:11:37Z

sklearn/linear_model/ridge.py

        error = scorer is None

        for i, alpha in enumerate(self.alphas):
+            if float(alpha) < 0:


I don't think we need the float cast here...

glemaitre · 2018-01-10T23:14:54Z

sklearn/linear_model/tests/test_ridge.py

+    ridge = RidgeCV(alphas)
+    assert_raises(ValueError, ridge.fit, X, y)
+
+    # Positive alphas


Do we need to test for positive alphas which are float. I think that we already are doing so in all the tests, isn't it?

glemaitre · 2018-01-10T23:16:26Z

sklearn/linear_model/tests/test_ridge.py

+    ridge = RidgeCV(alphas)
+    ridge.fit(X, y)
+
+    # Negative integers


I would separate the tests for negative alphas since that they should raise error.
You can make a test called test_ridgecv_neg_alphas() with a parametrize pytest for the integer and floating type.

glemaitre · 2018-01-10T23:16:42Z

sklearn/linear_model/tests/test_ridge.py

                                      decimal=6)


+def test_ridgecv_alphas():


rename test_ridgecv_int_alphas

glemaitre · 2018-01-10T23:17:19Z

sklearn/linear_model/tests/test_ridge.py



+def test_ridgecv_alphas():
+    # Test that no error is raised when fitting RidgeCV


I would remove this comment since that it is obvious from the renaming from the function

glemaitre · 2018-01-10T23:21:11Z

sklearn/linear_model/tests/test_ridge.py

+
+    # Integers
+    alphas = (1, 10, 100)
+    ridge = RidgeCV(alphas)


put directly a list when instantiating: RidgeCV(alphas=[1, 10, 100]).
You could also make sure that a numpy array with integer is also converted. In this case use a parametrized test

@pytest.mark.parametrize( "alphas", [(np.array([1, 10, 100])), ([1, 10, 100])]) def test_ridge_cv_alphas(alphas): X = ... y = ... ridge = RidgeCV(alphas) ridge.fit(X, y)

glemaitre · 2018-01-10T23:26:04Z

Also I would make the conversion directly from __init__:

https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/linear_model/ridge.py#L886

alphas = np.asarray(alphas, dtype=np.float64)

glemaitre · 2018-01-10T23:28:26Z

sklearn/linear_model/ridge.py

        error = scorer is None

        for i, alpha in enumerate(self.alphas):
+            if alpha < 0:


The checking needs to be done outside from the loop. Otherwise we start to compute some stuff to actually break it at the end.

So something like:

if np.any(alphas < 0): raise ValueError("alphas cannot be negative. Got {} containing some negative value instead.")

glemaitre · 2018-01-10T23:28:52Z

sklearn/linear_model/tests/test_ridge.py

+    # Negative integers
+    alphas = (-1, -10, -100)
+    ridge = RidgeCV(alphas)
+    assert_raises(ValueError, ridge.fit, X, y)


you need to use assert_raises_regex to match the string

mabelvj · 2018-01-30T00:02:02Z

Hi! Is it ok now?

jnothman · 2018-01-30T00:13:36Z

If you think the work is complete, please change WIP in the title to MRG

jnothman · 2018-01-30T00:14:56Z

sklearn/linear_model/tests/test_ridge.py

+    ]
+
+    @pytest.mark.parametrize("alpha_input, alpha_expected", testdata_alpha)
+    def test_conversion(alpha_input, alpha_expected):


this is never executed.

jnothman · 2018-01-30T00:15:10Z

sklearn/linear_model/tests/test_ridge.py

                                      decimal=6)


+def test_ridgecv_alpha_conversion_to_array():


I think you can remove this line and dedent the rest of this function

Great, I'm new to tests in python. Already fixed it.

jnothman

Thanks

jnothman · 2018-01-30T20:50:24Z

sklearn/linear_model/tests/test_ridge.py

+
+@pytest.mark.parametrize("alpha_input, alpha_expected", testdata_alpha)
+def test_conversion(alpha_input, alpha_expected):
+    assert((RidgeCV(alpha_input).get_params()['alphas'] ==


I'm not actually sure what this is trying to test. Is it trying to test that the input is validated and turned into floats before fit? We don't usually do this, because the user may also set them with set_params.

I also don't think this is currently asserting that the alphas are floats, only that they are unchanged or equivalent.

And I think we have common tests which do that. I'm short, I don't think this test adds anything in its current form.

ok, I'll remove it, just put it as a suggestion from the other reviewer. At least I've learned how these tests work.

jnothman · 2018-01-30T20:52:02Z

sklearn/linear_model/ridge.py

          normalize=False, random_state=None, solver='auto', tol=0.001)

    """
+


Please avoid introducing unnecessary and unrelated changes like this. It makes it hard to review your work, and may introduce merge conflicts for other changes in the works.

jnothman · 2018-01-30T20:53:39Z

sklearn/linear_model/ridge.py

                 cv=None, gcv_mode=None,
                 store_cv_values=False):
-        self.alphas = alphas
+        self.alphas = np.asarray(alphas, dtype=np.float64)


Usually we do not alter parameters in __init__, because they can also be set in other ways. We delay all validation until fit (except in old code)

my mistake, an error of the file. changed again

jnothman · 2018-01-30T21:55:15Z

sklearn/linear_model/tests/test_ridge.py

-            alpha_expected).all())
-
-
-def test_ridgecv_int_alphas():


why did you remove this?

jnothman · 2018-01-30T21:56:15Z

sklearn/linear_model/ridge.py

                 cv=None, gcv_mode=None,
                 store_cv_values=False):
-        self.alphas = np.asarray(alphas, dtype=np.float64)
+        self.alphas = np.asarray(alphas)


Oh, I see that this was already done in master. These days we would avoid such validation.

jnothman

flake8 error.

Otherwise LGTM

jnothman · 2018-01-31T01:41:34Z

Please add an entry to the change log under Bug Fixes at doc/whats_new/v0.20.rst. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

lesteve · 2018-02-13T09:43:03Z

doc/whats_new/v0.20.rst


 - :class:`decomposition.IncrementalPCA` in Python 2 (bug fix)
 - :class:`isotonic.IsotonicRegression` (bug fix)
- :class:`linear_model.ARDRegression` (bug fix)


You seem to have removed some text from v0.20.rst, probably without realising. Please look at your diff and re-add the text you remove.

lesteve · 2018-02-13T09:44:23Z

sklearn/linear_model/tests/test_ridge.py

-DENSE_FILTER = lambda X: X
-SPARSE_FILTER = lambda X: sp.csr_matrix(X)
+
+def DENSE_FILTER(X): return X


Please avoid changes that are not related to your PR. It makes the review less pleasant for everyone involved. Can you put back the lambdas?

I'm sorry, I don't know why this keeps happening. I had already reverted those changes.

mabelvj · 2018-02-15T23:55:02Z

I'm sorry for all the mess, I'm new to open source and did not know how to deal with remote changes and do the pull --rebase, that's why some the parts got removed. I updated the documentation adding my line and then reverted again the issue with lambdas.

qinhanmin2014

LGTM overall, @mabelvj please try to avoid unrelevant changes (there's still some extra blank lines). Also, please try to fill current line before starting a new line.

qinhanmin2014 · 2018-03-03T15:05:21Z

doc/whats_new/v0.20.rst

  overridden when using parameter ``copy_X=True`` and ``check_input=False``.
  :issue:`10581` by :user:`Yacine Mazari <ymazari>`.

+- Fixed a bug in :class:`linear_model.RidgeCV` where using negative integer 


What do you mean by this? negative integer -> integer since the bug is mainly about unexpected error when using integer alpha? (negative integer will be rejected right)

You're right, both integers raise error.

qinhanmin2014 · 2018-03-03T15:05:42Z

doc/whats_new/v0.20.rst


 - Add test :func:`estimator_checks.check_methods_subset_invariance` to check
  that estimators methods are invariant if applied to a data subset.
-  :issue:`10420` by :user:`Jonathan Ohayon <Johayon>`


Try to get rid of this strange diff.

I don't know, the file in the master has already that line. Should I remove it?

If you actually don't change anything and find it hard to get rid of it, you might just keep it. (Hope there won't be some strange things when merging)

qinhanmin2014 · 2018-03-03T15:08:18Z

sklearn/linear_model/ridge.py

                 cv=None, gcv_mode=None,
                 store_cv_values=False):
-        self.alphas = alphas
+        self.alphas = np.asarray(alphas)


Why doing so?

I was suggested to make that changes a few lines above: to add a conversion in the __init__ and then remove the float. It's done in the init of _RidgeGCV.

qinhanmin2014 · 2018-03-03T15:08:33Z

sklearn/linear_model/tests/test_ridge.py

+                        "alphas cannot be negative.",
+                        ridge.fit, X, y)
+
+    # Negative alphas


I think this test is redundant. We don't need too much tests for such a minor issue.

I added that test because the initial error stated: ValueError: Integers to negative integer powers are not allowed. So I had to add a line to raise an error for negative alphas and in the tests I was testing it worked.

You don't persuade me here but I won't focus too much on that.
I just think such a minor thing doesn't deserve so many tests.

qinhanmin2014 · 2018-03-08T13:28:45Z

@mabelvj Thanks for the explanation. I don't think I'll focus too much on these minor things, so please:
(1) resolved the conflict
(2) avoid all unrelevant changes (please double check your diff here)
((3) better try to fill current line before starting a new line and remove some unnecessary blank line)
I think it's very close from being merged.

…negative alphas and added a test

…alphas do not raise error.

…hecking of negative alphas

…lphas, and check alpha conversion to array. Added raise error when any of alphas is negative

… of integer alphas to float

…on of integer alphas to float

…ambdas

qinhanmin2014

LGTM. I've pushed some minor change about the format.

qinhanmin2014 · 2018-03-08T15:32:53Z

Thanks @mabelvj :)

mabelvj changed the title ~~[WIP] Fixes #10393 Fixed error when fitting RidgeCV with negative alpha~~ [WIP] Fixes #10393 Fixed error when fitting RidgeCV with negative alphas Jan 3, 2018

mabelvj changed the title ~~[WIP] Fixes #10393 Fixed error when fitting RidgeCV with negative alphas~~ [WIP] Fixes #10393 Fixed error when fitting RidgeCV with integers Jan 3, 2018

jnothman reviewed Jan 3, 2018

View reviewed changes

glemaitre requested changes Jan 10, 2018

View reviewed changes

glemaitre reviewed Jan 10, 2018

View reviewed changes

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch from d93e99c to 2f71665 Compare January 17, 2018 15:55

jnothman reviewed Jan 30, 2018

View reviewed changes

mabelvj changed the title ~~[WIP] Fixes #10393 Fixed error when fitting RidgeCV with integers~~ [MRG] Fixes #10393 Fixed error when fitting RidgeCV with integers Jan 30, 2018

jnothman reviewed Jan 30, 2018

View reviewed changes

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch from b6dc752 to 4444390 Compare January 30, 2018 21:26

jnothman reviewed Jan 30, 2018

View reviewed changes

jnothman approved these changes Jan 31, 2018

View reviewed changes

jnothman changed the title ~~[MRG] Fixes #10393 Fixed error when fitting RidgeCV with integers~~ [MRG+1] Fixes #10393 Fixed error when fitting RidgeCV with integers Feb 8, 2018

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch 2 times, most recently from f1b47c0 to 4f9d213 Compare February 8, 2018 18:32

lesteve reviewed Feb 13, 2018

View reviewed changes

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch from 0049219 to 9b4a319 Compare February 15, 2018 23:44

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch from 9b4a319 to e3a6d72 Compare February 16, 2018 00:11

qinhanmin2014 reviewed Mar 3, 2018

View reviewed changes

qinhanmin2014 mentioned this pull request Mar 3, 2018

integers in RidgeCV alpha #10393

Closed

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch 2 times, most recently from ec918fa to e3a6d72 Compare March 8, 2018 13:58

mabelvj added 10 commits March 8, 2018 15:30

[WIP] Fixes scikit-learn#10393 Fixed error when fitting RidgeCV with …

9919e65

…negative alphas and added a test

[WIP] Fixes scikit-learn#10393 Negatives alphas raise error. Integer …

06b07ef

…alphas do not raise error.

[WIP] Fixes scikit-learn#10393 Changed float(alpha) to alpha in the c…

0160dbf

…hecking of negative alphas

[WIP] Fixes scikit-learn#10393 Added tests for int alphas, negative a…

8b70b85

…lphas, and check alpha conversion to array. Added raise error when any of alphas is negative

[MRG] Fixes scikit-learn#10393 Fixed parametrized test for conversion…

5bedf02

… of integer alphas to float

[MRG] Fixes scikit-learn#10393 Removed parametrized test for conversi…

6c2cc71

…on of integer alphas to float

[MRG] Fixes scikit-learn#10393 Put back negative alphas test

063cf37

[MRG] Fixes scikit-learn#10393 Corrected flake8 error

6e48d91

[WIP] Fixes scikit-learn#10393 Corrected documentation and restored l…

54a6d00

…ambdas

[MRG] Fixes scikit-learn#10393 corrected line doc v0.20

7ce9ba4

mabelvj force-pushed the FIXES_issue_10393_integers_in_RidgeCV_alpha branch from e3a6d72 to 7ce9ba4 Compare March 8, 2018 14:34

minor format

34b1ebe

qinhanmin2014 approved these changes Mar 8, 2018

View reviewed changes

qinhanmin2014 merged commit 03dd287 into scikit-learn:master Mar 8, 2018



		def test_ridgecv_alphas():
		# Test that no error is raised when fitting RidgeCV

		normalize=False, random_state=None, solver='auto', tol=0.001)

		"""

Uh oh!

[MRG+1] Fixes #10393 Fixed error when fitting RidgeCV with integers #10397

[MRG+1] Fixes #10393 Fixed error when fitting RidgeCV with integers #10397

Uh oh!

Conversation

mabelvj commented Jan 3, 2018 • edited by glemaitre Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amueller commented Jan 3, 2018

Uh oh!

mabelvj commented Jan 3, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Jan 10, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mabelvj commented Jan 30, 2018

Uh oh!

jnothman commented Jan 30, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jan 31, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mabelvj commented Feb 15, 2018

Uh oh!

qinhanmin2014 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mabelvj Mar 8, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mabelvj commented Jan 3, 2018 •

edited by glemaitre

Loading

mabelvj Mar 8, 2018 •

edited

Loading

mabelvj Mar 8, 2018 •

edited

Loading