-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
[Closed] adding clusterQR to spectral clustering, and LOBPCG as an SVD solver to PCA and Truncated PCA #12291
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…ors computed, and fixed a small bug
Update from original
This pull request introduces 2 alerts when merging 73a54d0 into 877e3f3 - view on LGTM.com new alerts:
Comment posted by LGTM.com |
This pull request introduces 1 alert when merging 2771657 into e6359e2 - view on LGTM.com new alerts:
Comment posted by LGTM.com |
I apologize for unintentionally putting 2 unrelated issues: clusterQR to spectral clustering, and LOBPCG as an SVD solver to PCA and Truncated PCA, in a single pull request. This is my first pull request :-) I am also unsure if I follow the best or even proper practices and protocols, with multiple attempts, trying to clear all checks... I am open to good advice, please. |
Could you please split this PR to have the contribution of the lobpcg_svd method along with it's use as an additional solver in PCA / TruncatedSVD and the clusterQR method in its own PR (possibly on top of the first one). Also please add unittest tests for the Finally, can you tell when is |
@ogrisel Thanks for your comment! Please see below:
How do I do it, technically speaking, please? I push my local changes to my fork, and then my pull request pull all the changes made...
I am working on it now, to be added here soon, after I fix all the current errors (only one left!)
I'll try. It may be not so easy, because the better performance is expected for large size cases. Do you know examples doing large-scale SVD? |
git checkout -b clusterQR # alias the current state of your local branch as a new branch for your clusterQR code and everything mixed in with it
git commit # save any ongoing work in that branch to loose it
git checkout master
git fetch https://github.com/scikit-learn/scikit-learn.git master # make sure you get up to date master
git reset --hard origin/master # this will reset your local master to the state of the scikit-learn master to start clean
git checkout -b lobpcg-svd # create a new local branch dedicated to lobpcg-svd
# report only the changes related to lobpcg to here (without anything related to clusterQR)
# add some tests
git commit # note: please use a descriptive commit message
git push https://github.com/lobpcg/scikit-learn.git lobpcg-svd # push your new branch to your fork
# create a new pull request from the lobpcg-svd branch of your fork to the scikit-learn master branch If you need you can always do If you need more help with git and github, please have a look at the online resources: https://help.github.com/articles/git-and-github-learning-resources/ |
It depends on what you call large scale. Anyway the examples should ideally take less than 30s to run because they are all executed each time we build the online documentation. Have a look at the existing examples: git grep "PCA" examples/
git grep "TruncatedSVD" examples/ |
clusterQR method added to spectral segmentation in PR #12316 |
Reference Issues/PRs
Fixes #12164, #12079, #12080
What does this implement/fix? Explain your changes.
#12164 adds clusterQR method to 'kmeans' and 'discretize' in spectral clustering
#12079 adds LOBPCG as an SVD solver in PCA
#12080 adds LOBPCG solver to Truncated PCA
Any other comments?
Currently closed and moved to #12319