[MRG+1] Remove the MLComp text categorization example #8264

rth · 2017-02-01T23:12:53Z

This PR fixes issue #8229 (document classification example should use 'latin-1' encoding) by removing the example (as suggested by @lesteve ).

This also raises a deprecation warning when load_mlcomp is used to load the 20 newsgoups example, where fetch_20newsgroups should preferably be used instead. However, as both datasets are not strictly identical, maybe that's not the best solution.

The MLComp text categorization example is mostly redundant with the other text categorization example while being less complete, and incites user to use a more complex way of loading the 20 newsgoups dataset via load_mlcomp instead of fetch_20newsgroups.

lesteve · 2017-02-02T13:41:35Z

This also raises a deprecation warning when load_mlcomp is used to load the 20 newsgoups example, where fetch_20newsgroups should preferably be used instead. However, as both datasets are not strictly identical, maybe that's not the best solution.

Chatting with @ogrisel he agrees that deprecating load_mlcomp is the way to go. According to http://mlcomp.org/, they are going to shut down the website in March 2017.

lesteve · 2017-02-02T13:44:19Z

sklearn/datasets/mlcomp.py

@@ -68,6 +68,11 @@ def load_mlcomp(name_or_id, set_="raw", mlcomp_root=None, **kwargs):
    if not os.path.exists(mlcomp_root):
        raise ValueError("Could not find folder: " + mlcomp_root)

+    if name_or_id in ['20news-18828', '20news-19997', '20news-bydate']:
+        raise DeprecationWarning("please consider using "


Deprecate the whole function with a @deprecated decorator. Also mention which version it will be deprecated in (0.19) and which version it will be removed (0.21). Look either at the contributing guidelines or other deprecation messages in the scikit-learn code.

rth · 2017-02-02T14:14:03Z

Thanks for the review @lesteve . I addressed your comments.

lesteve · 2017-02-02T15:22:18Z

The MLComp text categorization example is mostly redundant with the other text categorization example while being less complete

Can I get a second opinion on this? @raghavrv or @jnothman for example.

jnothman

Perhaps note somewhere in docs (deprecation message or docstring) that the site itself is closing down. Otherwise LGTM

rth · 2017-02-03T18:38:44Z

Thanks for the review @jnothman ! I addressed your comment.
CI is failing due to travis-ci/travis-ci#7264, will try to trigger a new build a bit later..

lesteve · 2017-02-07T10:01:07Z

The Travis problem has been fixed, I have restarted the build.

lesteve · 2017-02-08T08:28:49Z

Looks good, merging, thanks a lot!

) and deprecate load_mlcomp.

rth added 2 commits February 1, 2017 23:46

Removing the mlcomp_sparse_document_classification.py

21c7810

Add deprecation warning

03d95ba

lesteve reviewed Feb 2, 2017

View reviewed changes

Addressing review comments

b1ddebb

jnothman reviewed Feb 3, 2017

View reviewed changes

jnothman changed the title ~~[MRG] Remove the MLComp text categorization example~~ [MRG+1] Remove the MLComp text categorization example Feb 3, 2017

rth force-pushed the mlp_example branch from 5b77781 to 5c3f908 Compare February 4, 2017 13:35

Mentioned the that the website is closing in the warning message

4662111

rth force-pushed the mlp_example branch from 5c3f908 to 4662111 Compare February 4, 2017 17:11

lesteve merged commit 542c02b into scikit-learn:master Feb 8, 2017

rth deleted the mlp_example branch February 8, 2017 09:50

sergeyf pushed a commit to sergeyf/scikit-learn that referenced this pull request Feb 28, 2017

[MRG+1] Remove the MLComp text categorization example (scikit-learn#8264

a85943c

) and deprecate load_mlcomp.

Przemo10 mentioned this pull request Mar 17, 2017

update fork (#1) #8606

Closed

Sundrique pushed a commit to Sundrique/scikit-learn that referenced this pull request Jun 14, 2017

[MRG+1] Remove the MLComp text categorization example (scikit-learn#8264

ee79716

) and deprecate load_mlcomp.

NelleV pushed a commit to NelleV/scikit-learn that referenced this pull request Aug 11, 2017

[MRG+1] Remove the MLComp text categorization example (scikit-learn#8264

96bc0fb

) and deprecate load_mlcomp.

paulha pushed a commit to paulha/scikit-learn that referenced this pull request Aug 19, 2017

[MRG+1] Remove the MLComp text categorization example (scikit-learn#8264

45a93ac

) and deprecate load_mlcomp.

maskani-moh pushed a commit to maskani-moh/scikit-learn that referenced this pull request Nov 15, 2017

[MRG+1] Remove the MLComp text categorization example (scikit-learn#8264

7a88825

) and deprecate load_mlcomp.

lemonlaug pushed a commit to lemonlaug/scikit-learn that referenced this pull request Jan 6, 2021

[MRG+1] Remove the MLComp text categorization example (scikit-learn#8264

45fa802

) and deprecate load_mlcomp.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG+1] Remove the MLComp text categorization example #8264

[MRG+1] Remove the MLComp text categorization example #8264

Uh oh!

rth commented Feb 1, 2017

Uh oh!

lesteve commented Feb 2, 2017 •

edited

Loading

Uh oh!

lesteve Feb 2, 2017

Uh oh!

rth commented Feb 2, 2017

Uh oh!

lesteve commented Feb 2, 2017 •

edited

Loading

Uh oh!

jnothman left a comment

Uh oh!

rth commented Feb 3, 2017

Uh oh!

lesteve commented Feb 7, 2017

Uh oh!

lesteve commented Feb 8, 2017

Uh oh!

Uh oh!

Uh oh!

[MRG+1] Remove the MLComp text categorization example #8264

[MRG+1] Remove the MLComp text categorization example #8264

Uh oh!

Conversation

rth commented Feb 1, 2017

Uh oh!

lesteve commented Feb 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lesteve Feb 2, 2017

Choose a reason for hiding this comment

Uh oh!

rth commented Feb 2, 2017

Uh oh!

lesteve commented Feb 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

rth commented Feb 3, 2017

Uh oh!

lesteve commented Feb 7, 2017

Uh oh!

lesteve commented Feb 8, 2017

Uh oh!

Uh oh!

lesteve commented Feb 2, 2017 •

edited

Loading

lesteve commented Feb 2, 2017 •

edited

Loading