[MRG + 1] FIX Calling fit_transform instead of transform in Pipeline's fit_predict #7585

gszpak · 2016-10-05T18:16:08Z

Reference Issue

This PR fixes issue #7558.

What does this implement/fix? Explain your changes.

As discussed in #7558, each transformer in pipeline should call fit_transform instead of transform in fit_predict. test_fix_predict_on_pipeline is also fixed.

…redict (scikit-learn#7558)

amueller · 2016-10-05T18:37:34Z

thanks, lgtm :)

amueller

pep8 ;)

amueller · 2016-10-05T18:42:20Z

sklearn/tests/test_pipeline.py


    # first compute the transform and clustering step separately
    scaled = scaler.fit_transform(iris.data)
    separate_pred = km.fit_predict(scaled)

    # use a pipeline to do the transform and clustering in one step
-    pipe = Pipeline([('scaler', scaler), ('Kmeans', km)])
+    pipe = Pipeline([('scaler', scaler_for_pipeline), ('Kmeans', km_for_pipeline)])


this line's too long now :)

amueller · 2016-10-05T19:42:19Z

thanks!

jnothman · 2016-10-05T23:24:26Z

This is actually fixing a regression, and an error on my part :( Tagging with 0.18.1 for backport. Almost LGTM.

jnothman · 2016-10-05T23:25:10Z

sklearn/tests/test_pipeline.py

@@ -277,14 +277,19 @@ def test_fit_predict_on_pipeline():
    # transform and clustering steps separately
    iris = load_iris()
    scaler = StandardScaler()
+    scaler_for_pipeline = StandardScaler()


Please add a comment that this is necessary since Pipeline does not clone the estimators.

…_predict_on_pipeline

jnothman · 2016-10-06T21:19:36Z

LGTM, thanks

…t_predict (scikit-learn#7585) * BUGFIX Calling fit_transform instead of transform in Pipeline's fit_predict (scikit-learn#7558) * PEP8 fixes in test_fit_predict_on_pipeline * Added comment explaining separate estimators for pipeline in test_fit_predict_on_pipeline

BUGFIX Calling fit_transform instead of transform in Pipeline's fit_p…

a7c2870

…redict (scikit-learn#7558)

amueller requested changes Oct 5, 2016

View reviewed changes

amueller changed the title ~~[MRG] FIX Calling fit_transform instead of transform in Pipeline's fit_predict~~ [MRG + 1] FIX Calling fit_transform instead of transform in Pipeline's fit_predict Oct 5, 2016

PEP8 fixes in test_fit_predict_on_pipeline

f8433c3

amueller approved these changes Oct 5, 2016

View reviewed changes

jnothman added this to the 0.18.1 milestone Oct 5, 2016

jnothman added the Bug label Oct 5, 2016

jnothman requested changes Oct 5, 2016

View reviewed changes

Added comment explaining separate estimators for pipeline in test_fit…

e386c18

…_predict_on_pipeline

jnothman approved these changes Oct 6, 2016

View reviewed changes

jnothman merged commit 0a1f6cd into scikit-learn:master Oct 6, 2016

gszpak deleted the fix_pipeline_fit_predict branch October 7, 2016 09:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG + 1] FIX Calling fit_transform instead of transform in Pipeline's fit_predict #7585

[MRG + 1] FIX Calling fit_transform instead of transform in Pipeline's fit_predict #7585

Uh oh!

gszpak commented Oct 5, 2016

Uh oh!

amueller commented Oct 5, 2016

Uh oh!

amueller left a comment

Uh oh!

amueller Oct 5, 2016

Uh oh!

amueller commented Oct 5, 2016

Uh oh!

jnothman commented Oct 5, 2016

Uh oh!

jnothman Oct 5, 2016

Uh oh!

jnothman commented Oct 6, 2016

Uh oh!

Uh oh!

Uh oh!

[MRG + 1] FIX Calling fit_transform instead of transform in Pipeline's fit_predict #7585

[MRG + 1] FIX Calling fit_transform instead of transform in Pipeline's fit_predict #7585

Uh oh!

Conversation

gszpak commented Oct 5, 2016

Reference Issue

What does this implement/fix? Explain your changes.

Uh oh!

amueller commented Oct 5, 2016

Uh oh!

amueller left a comment

Choose a reason for hiding this comment

Uh oh!

amueller Oct 5, 2016

Choose a reason for hiding this comment

Uh oh!

amueller commented Oct 5, 2016

Uh oh!

jnothman commented Oct 5, 2016

Uh oh!

jnothman Oct 5, 2016

Choose a reason for hiding this comment

Uh oh!

jnothman commented Oct 6, 2016

Uh oh!

Uh oh!