Fix for spectral clustering error when using 'amg' solver #13707

whitews · 2019-04-24T13:10:32Z

Reference Issues/PRs

Fixes #13393. See also PR #12316

What does this implement/fix? Explain your changes.

Fixes LinAlgError when using spectral clustering with the amg solver

Any other comments?

This PR is derived from the previous PR #12316 submitted by Andrew Knyazev (lobpcg). In that PR, Andrew fixed issue #13393 and also added a new label assignment option 'clusterQR'. It was requested that the PR be split to separate the fix and the new label assignment functionality. This PR contains Andrew's fix for the AMG bug.

)

jnothman

Thanks for this @whitews

I'm not sure what chance it has to get into 0.21, but just in case:
Please add an entry to the change log at doc/whats_new/v0.21.rst. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

sklearn/manifold/spectral_embedding_.py

whitews · 2019-04-24T14:02:30Z

Updated the change log. Let me know if anything else is needed.

sklearn/manifold/spectral_embedding_.py

sklearn/manifold/tests/test_spectral_embedding.py

whitews · 2019-04-25T02:01:15Z

Don't understand the codecov/patch failure, all local tests are passing for me. What does this failure mean?

jnothman · 2019-04-25T03:48:30Z

Codecov checks for lines of code that are never accessed during tests.

whitews · 2019-04-25T14:00:11Z

@jnothman This seems pretty clean now, everything is passing.

sklearn/manifold/spectral_embedding_.py

lobpcg · 2019-04-26T19:36:58Z

I think that the mathematically proper fix is changing in sklearn/manifold/spectral_embedding_.py the present

        laplacian = _set_diag(laplacian, 1 + 1e-5, norm_laplacian)

        # noinspection PyUnboundLocalVariable
        ml = smoothed_aggregation_solver(check_array(laplacian, 'csr'))

into

        laplacian = _set_diag(laplacian, 1, norm_laplacian)

        # noinspection PyUnboundLocalVariable
        ml = smoothed_aggregation_solver(check_array(laplacian + 1e-5 * sparse.eye(laplacian.shape[0], 'csr'))

so that the LOBPCG solver is still called on the unchanged Laplacian, but only the AMG preconditioner is fed with the shifted Laplacian. I have updated my #13393 to highlight this.

I am unsure how the memory allocation would work in my suggestion above. May be to save memory one can do something like:

        laplacian = _set_diag(laplacian, 1, norm_laplacian)
        laplacian = laplacian + 1e-5 * sparse.eye(laplacian.shape[0]
        # noinspection PyUnboundLocalVariable
        ml = smoothed_aggregation_solver(check_array(laplacian, 'csr'))
        laplacian = laplacian - 1e-5 * sparse.eye(laplacian.shape[0]

lobpcg · 2019-04-26T21:59:46Z

sklearn/manifold/tests/test_spectral_embedding.py

+    centers = np.eye(n_clusters, n_features)
+    S, true_labels = make_blobs(n_samples=n_samples, centers=centers,
+                                cluster_std=1., random_state=42)
+


May be check separately norm_laplacian = False and norm_laplacian = True . The latter is the default, the only option currently checked.

jnothman · 2019-05-22T10:58:37Z

@whitews are you continuing with this?

whitews · 2019-05-23T17:27:01Z

@jnothman Yes, I've updated the PR. Have the tests changed? Getting a 404 response for a deb package in the np_atlas Azure build.

jnothman · 2019-05-23T22:15:48Z

The tests haven't changed, but the world they operate in has. We will merge a fix soon.

jnothman · 2019-05-27T01:25:07Z

Yes, bit I've been low on time to review this and other pull requests. I hope one of us can get to it soon.

lobpcg · 2019-07-23T15:17:20Z

@jnothman is there a plan trying to finish this one?

jnothman

Thanks @whitews.

Btw, I've confirmed that the test fails on master, which is good.

doc/whats_new/v0.21.rst

sklearn/manifold/spectral_embedding_.py

Co-Authored-By: Joel Nothman <[email protected]>

jnothman · 2019-08-02T02:06:46Z

Please merge the master from upstream to avoid the Circle CI failure.

…into spec-clust-amg-fix

whitews · 2019-08-02T21:15:16Z

@jnothman I think this is ready. Are there any outstanding requests?

jnothman

Otherwise LGTM. Awaiting another review. Thanks @whitews and @lobpcg.

doc/whats_new/v0.22.rst

lobpcg · 2019-08-13T20:13:49Z

https://scikit-learn.org/stable/modules/generated/sklearn.cluster.spectral_clustering.html, says that using amg eigen solver may lead to instabilities. This can be now removed, I think.

ogrisel

LGTM, I pushed some cosmetic commits, will merge when CI is green.

jnothman · 2019-08-30T02:44:08Z

Great! Thanks @whitews and @lobpcg

whitews added 3 commits April 24, 2019 08:50

change AMG tolerance default & laplacian shift (fixes scikit-learn#13393

a0f1f7b

)

add spectral clustering test for AMG solver

6e5ecf6

update docs with edits from Andrew Knyazev (& some fixed)

d61cf3b

whitews changed the title ~~Spec clust amg fix~~ Fix for spectral clustering error when using 'amg' solver Apr 24, 2019

jnothman reviewed Apr 24, 2019

View reviewed changes

sklearn/manifold/spectral_embedding_.py Outdated Show resolved Hide resolved

sklearn/manifold/spectral_embedding_.py Outdated Show resolved Hide resolved

whitews added 2 commits April 24, 2019 09:39

revert tolerance value changes, not needed for AMG solver fix

b0c4356

update v0.21 changelog noting scikit-learn#13393 fix

0c8390b

jnothman reviewed Apr 24, 2019

View reviewed changes

whitews added 5 commits April 24, 2019 19:36

simplify diag correction in spectral_embedding

f64decb

revert the reversion: increased tolerances are required

cf126b2

use importorskip instead of try/except clause for availability of pyamg

d9fc5ee

reference issue in amg solver failure test

1f544ea

clarify random seed change for spectral embedding amg failure test

5a3a058

whitews added 2 commits April 25, 2019 08:40

Merge branch 'master' into spec-clust-amg-fix

bba21b0

leave original tolerance for 'lobpcg' eigen solver

346cff0

lobpcg reviewed Apr 26, 2019

View reviewed changes

sklearn/manifold/spectral_embedding_.py Outdated Show resolved Hide resolved

lobpcg mentioned this pull request Apr 26, 2019

[MRG] clusterQR method added to spectral segmentation #12316

Closed

lobpcg reviewed Apr 26, 2019

View reviewed changes

whitews added 4 commits May 23, 2019 12:59

implement original shift code from lobpcg, add comment

98b17ec

Merge branch 'master' into spec-clust-amg-fix

302850c

fix long line

2eed15e

only shift laplacian for the solver, then un-shift back to original

783e6e4

jnothman closed this Jul 25, 2019

jnothman reopened this Jul 25, 2019

Merge branch 'master' into spec-clust-amg-fix

65bf8e9

jnothman reviewed Jul 25, 2019

View reviewed changes

doc/whats_new/v0.21.rst Outdated Show resolved Hide resolved

sklearn/manifold/spectral_embedding_.py Outdated Show resolved Hide resolved

sklearn/manifold/spectral_embedding_.py Outdated Show resolved Hide resolved

whitews and others added 5 commits August 1, 2019 21:49

Update sklearn/manifold/spectral_embedding_.py

4a2e3df

Co-Authored-By: Joel Nothman <[email protected]>

remove noinspection comment

0362c39

removing spectral clustering bug text

f46f91b

add spectral clustering fix contribution

61d54de

fix markup in last commit

c48fbe0

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

95f2a99

…into spec-clust-amg-fix

jnothman approved these changes Aug 4, 2019

View reviewed changes

doc/whats_new/v0.22.rst Outdated Show resolved Hide resolved

doc/whats_new/v0.22.rst Show resolved Hide resolved

mention SpectralEmbedding & SpectralClustering classes in release notes

a452c95

thomasjpfan added the Waiting for Reviewer label Aug 5, 2019

This was referenced Aug 13, 2019

Fix plot_coin_segamentation speed issue #13383 #13652

Closed

Amg arpack workaround fix #14647

Merged

ogrisel added 3 commits August 29, 2019 12:19

oMerge remote-tracking branch 'origin/master' into spec-clust-amg-fix

e601a8c

Update AMG docstring and improve codestyle

de645ba

Stricter check in pyamg test

0501603

ogrisel approved these changes Aug 29, 2019

View reviewed changes

ogrisel merged commit 372092c into scikit-learn:master Aug 29, 2019

massich mentioned this pull request Sep 6, 2019

ENH propagate eigen_tol to all eigen solver #11968

Closed

This was referenced Jan 3, 2020

test_spectral_embedding_amg_solver_failure random failure #16011

Closed

[MRG+1] Better non-regression test for spectral embedding AMG solver issue #16014

Merged

lobpcg deleted the spec-clust-amg-fix branch October 26, 2021 02:13

Uh oh!

Fix for spectral clustering error when using 'amg' solver #13707

Fix for spectral clustering error when using 'amg' solver #13707

Uh oh!

Conversation

whitews commented Apr 24, 2019

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

whitews commented Apr 24, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

whitews commented Apr 25, 2019

Uh oh!

jnothman commented Apr 25, 2019 via email

Uh oh!

whitews commented Apr 25, 2019

Uh oh!

Uh oh!

lobpcg commented Apr 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lobpcg Apr 26, 2019

Choose a reason for hiding this comment

Uh oh!

jnothman commented May 22, 2019

Uh oh!

whitews commented May 23, 2019

Uh oh!

jnothman commented May 23, 2019 via email

Uh oh!

jnothman commented May 27, 2019

Uh oh!

lobpcg commented Jul 23, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jnothman commented Aug 2, 2019

Uh oh!

whitews commented Aug 2, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lobpcg commented Aug 13, 2019

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman commented Aug 30, 2019 via email

Uh oh!

Uh oh!

lobpcg commented Apr 26, 2019 •

edited

Loading