-
-
Notifications
You must be signed in to change notification settings - Fork 26k
DOC Rework plot_document_clustering.py example #23528
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…nto doc_clustering
Can you please merge the |
…nto doc_clustering
The CircleCI hosting job is triggered but there is no "check the rendered doc" direct link to the artifact yet, see #23534 (comment) for more details. |
The doc ci is broken because the k-means ++ init on very sparse data can select initial centroids than never get updated and this cause problems in the silhouette clustering evaluation. The example build log is very verbose and it takes time to render in firefox when scrolling. Here is a PR to make the log size much more manageable: #23557. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A couple of comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for this contribution, @ArturoAmorQ! 🙂
Here are some comments and suggestion.
In a nutshell, I suggest that:
-
sentences can sometimes be shorter and their meaning more accurate
-
Sphinx references can be adapted as
:func:`~*`
should be changed to:
:class:`~*`
when referencing classes.
-
cross-reference can be made when content already exist in the documentation
Co-authored-by: Julien Jerphanion <[email protected]>
Co-authored-by: Olivier Grisel <[email protected]>
…nto doc_clustering
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks great! Just a final batch of suggestions.
Thank you very much @ArturoAmorQ!
Co-authored-by: Olivier Grisel <[email protected]>
Co-authored-by: Julien Jerphanion <[email protected]>
…into doc_clustering
There seem to be a problem with |
…nto doc_clustering
I merged |
It seems to work indeed. |
Merged! Thank you very much @ArturoAmorQ ! I think the linked example for adjustment for chance could also benefit from a tutorialization: |
Thank you, @ArturoAmorQ! |
Co-authored-by: Olivier Grisel <[email protected]> Co-authored-by: Guillaume Lemaitre <[email protected]> Co-authored-by: Julien Jerphanion <[email protected]>
Co-authored-by: Olivier Grisel <[email protected]> Co-authored-by: Guillaume Lemaitre <[email protected]> Co-authored-by: Julien Jerphanion <[email protected]>
Co-authored-by: Olivier Grisel <[email protected]> Co-authored-by: Guillaume Lemaitre <[email protected]> Co-authored-by: Julien Jerphanion <[email protected]>
Reference Issues/PRs
Related to #22928 and #23266
What does this implement/fix? Explain your changes.
This is the third release of the revamped examples to serve as a tutorial series on text analysis.
Any other comments?
Side effect: Implements notebook style as intended in #22406