[MRG+1] Raise warning in scikit-learn/sklearn/linear_model/cd_fast.pyx for cases when the main loop exits without reaching the desired tolerance #11754

ghost · 2018-08-05T22:13:36Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This pull request adds ConvergenceWarnings to the enet_coordinate_descent* solvers found in scikit-learn/sklearn/linear_model/cd_fast.pyx for cases when the main loop exits without reaching the desired tolerance.

Any other comments?

Tests have been included in both sklearn/linear_model/tests/test_coordinate_descent.py and sklearn/linear_model/tests/test_sparse_coordinate_descent.py

agramfort · 2018-08-06T08:38:41Z

sklearn/linear_model/tests/test_coordinate_descent.py

+    n_classes = 2
+    X = np.ones([n_samples, n_features]) * 1e50
+    y = np.ones([n_samples, n_classes])
+    assert_warns(ConvergenceWarning, clf.fit, X, y)


to test this use tiny data and set max_iter to a very tiny number. it will make testing faster.

besides this is already done in the estimator eg:

https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/linear_model/coordinate_descent.py#L486

why is it not enough for you? bug?

For the cases identified in #10813, the estimator raises no warning. Essentially, there are numerical issues causing the duality gap and tolerance to be equal to zero - as such the warning won't be raised in the estimator.

rth · 2018-08-06T10:01:19Z

sklearn/linear_model/cd_fast.pyx

+                else:
+                    with gil:
+                        warnings.warn("Objective did not converge."
+                        " You might want to increase the number of iterations.",


Can we include the desired and the achieved tolerance in the warning message?

rth · 2018-08-06T10:02:38Z

sklearn/linear_model/tests/test_coordinate_descent.py

+    clf = Lasso(precompute=True)
+    n_samples = 15500
+    n_features = 500
+    X = np.ones([n_samples, n_features]) * 1e50


in np.ones(shape), typically shape is a tuple not a list.

jnothman · 2018-08-07T01:03:40Z

sklearn/linear_model/cd_fast.pyx

@@ -302,6 +303,13 @@ def enet_coordinate_descent(np.ndarray[floating, ndim=1] w,
                if gap < tol:
                    # return if we reached desired tolerance
                    break
+                else:


Shouldn't this be an else clause of the for loop, not of the if statement?

Please add a test that no warning is raised if the optimisation reaches convergence

Since the function returns gap, can't we do this outside of the function??

Issue #10813 suggested raising the warning when the max number of iterations is reached and the desired tolerance has yet to be achieved. It's not obvious to me how to test for that outside of the function because it doesn't return max_iter.

Should this be changed to look for instances where gap and tolerance are both equal to zero - indicating possible numerical issues?

max_iter is passed in, so surely it is available to the caller test

jnothman · 2018-08-07T01:06:15Z

Please rename this PR to describe what it is actually changing. It is not doing what the title says

rth

Actually, this check is already implemented in a factorized way for a number of enet_coordinate_descent* functions here, similarly to @jnothman's suggestion in https://github.com/scikit-learn/scikit-learn/pull/11754/files#r208071463.

What is missing is to find other places where these functions are used and implement a similar check, namely just,

sklearn/covariance/graph_lasso_.py
225:                        coefs, _, _, _ = cd_fast.enet_coordinate_descent_gram(

ghost · 2018-11-14T22:54:37Z

The strict inequality in those functions misses the cases raised in the original issue. Essentially, there are numerical issues causing the duality gap and tolerance to be equal to zero - as such the warning won't be raised in the estimator. Should the check in those functions be changed?

solvers

agramfort · 2019-02-25T10:48:35Z

rebased. Now ConvergeWarning comes from the cython code for those who use directly the cython functions.

agramfort · 2019-02-25T14:02:54Z

@rth can you have a look?

GaelVaroquaux

LGTM, 👍 for merge

But, a caveat: if this PR ends up cranking up the number of warnings (I've tried to check, but it's really hard to assess), I would push for reverting it.

rth

Thanks !

rth · 2019-02-27T13:01:20Z

We do have a bunch of warnings in examples now (cf. https://circleci.com/gh/scikit-learn/scikit-learn/48650?utm_campaign=vcs-integration-link&utm_medium=referral&utm_source=github-build-link)

amueller · 2019-03-05T16:48:26Z

sklearn/linear_model/cd_fast.pyx

@@ -794,5 +817,11 @@ def enet_coordinate_descent_multi_task(floating[::1, :] W, floating l1_reg,
                if gap < tol:
                    # return if we reached desired tolerance
                    break
+                else:


after this warning message was printed, the for-loop goes on if max_iter is not reached, right?
And if max_iter is reached before the condition in 767 happens then it won't converge but never warn?

…r cases when the main loop exits without reaching the desired tolerance (scikit-learn#11754)

…t.pyx for cases when the main loop exits without reaching the desired tolerance (scikit-learn#11754)" This reverts commit 97a1b0e.

…r cases when the main loop exits without reaching the desired tolerance (scikit-learn#11754)

agramfort reviewed Aug 6, 2018

View reviewed changes

rth reviewed Aug 6, 2018

View reviewed changes

jnothman reviewed Aug 7, 2018

View reviewed changes

ghost changed the title ~~[MRG] Fix Linear models take unreasonable longer time in certain data size.~~ [MRG] Raise warning in scikit-learn/sklearn/linear_model/cd_fast.pyx for cases when the main loop exits without reaching the desired tolerance Aug 7, 2018

rth requested changes Sep 14, 2018

View reviewed changes

brentfagan and others added 7 commits February 25, 2019 11:38

Add convergence warning to enet_coordinate_descent solvers

5d273ad

Change UserWarning to ConvergenceWarning in enet_coordinate_descent

711733b

solvers

Add tests to check ConvergenceWarning in enet solvers

9f9b6ce

Fix style

ce020f1

Add duality gap and tolerance to warning

95c6726

Make data smaller and lower max_iter

9718eb5

use parametrize

4cfe7c5

lint

dc7e3ee

GaelVaroquaux approved these changes Feb 26, 2019

View reviewed changes

GaelVaroquaux requested a review from rth February 26, 2019 17:01

rth approved these changes Feb 27, 2019

View reviewed changes

rth merged commit 3e715fd into scikit-learn:master Feb 27, 2019

amueller mentioned this pull request Mar 5, 2019

Confusing convergence warning in coordinate descent. #13394

Closed

amueller reviewed Mar 5, 2019

View reviewed changes

jnothman mentioned this pull request Apr 24, 2019

DOC what's new cleaning #13706

Merged

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

FIX Raise warning in scikit-learn/sklearn/linear_model/cd_fast.pyx fo…

97a1b0e

…r cases when the main loop exits without reaching the desired tolerance (scikit-learn#11754)

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

FIX Raise warning in scikit-learn/sklearn/linear_model/cd_fast.pyx fo…

332ff91

…r cases when the main loop exits without reaching the desired tolerance (scikit-learn#11754)

Uh oh!

[MRG+1] Raise warning in scikit-learn/sklearn/linear_model/cd_fast.pyx for cases when the main loop exits without reaching the desired tolerance #11754

[MRG+1] Raise warning in scikit-learn/sklearn/linear_model/cd_fast.pyx for cases when the main loop exits without reaching the desired tolerance #11754

Uh oh!

Conversation

ghost commented Aug 5, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Aug 7, 2018

Uh oh!

rth left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ghost commented Nov 14, 2018

Uh oh!

agramfort commented Feb 25, 2019

Uh oh!

agramfort commented Feb 25, 2019

Uh oh!

GaelVaroquaux left a comment

Choose a reason for hiding this comment

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

rth commented Feb 27, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rth left a comment •

edited

Loading