Fix column_transformer to use fitparams like Pipeline #21311

jschmidtml · 2021-10-12T15:16:24Z

Reference Issues/PRs

Fixes #19465.

What does this implement/fix? Explain your changes.

This fix implements fitparams in the same way as Pipelines to meet parity

Any other comments?

amueller · 2021-10-12T22:54:40Z

sklearn/tests/test_columntransformer.py

+fit_params = {'standard_scaler__sample_weight': df['equal_sample_weight']}
+ct_wEqualWeight = ct.fit_transform(X=df[['x1']], y=df['y'], **fit_params)
+
+assert_array_equal(sc1_xWeight, ct_xWeight, err_msg= "These should be equal")


pep8: there shouldn't be a space after "=" here. But you can also just remove the err_msg I think.

amueller · 2021-10-12T22:56:24Z

sklearn/compose/_column_transformer.py

        return "(%d of %d) Processing %s" % (idx, total, name)

-    def _fit_transform(self, X, y, func, fitted=False, column_as_strings=False):
+    def _check_fit_params(self, **fit_params):


Is this copied from pipeline or somewhere else?

yes, it came from pipeline. I tried to make it work in the same way.

amueller · 2021-10-12T22:56:52Z

Thanks for the PR, looks good overall, I think :)

This makes sense to unblock this use-case but really we need to fix how to do this properly, cc @adrinjalali ;)

adrinjalali · 2021-10-13T07:59:27Z

Thanks for the PR. But I would rather prioritize sample props and #21284 for the next release, in which case we wouldn't need this solution.

amueller · 2021-10-22T18:56:51Z

@adrinjalali that makes sense. How would the code look like for a simple pipeline with a column transformer to pass sample_weight everywhere?

adrinjalali · 2021-10-23T18:21:21Z

So taking an example from here:

from sklearn.compose import ColumnTransformer
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.preprocessing import OneHotEncoder
column_trans = ColumnTransformer(
    [('categories', OneHotEncoder(dtype='int').fit_requests(sample_weight=True), ['city']),
     ('title_bow', CountVectorizer().fit_requests(sample_weight=True), 'title')],
    remainder='drop', verbose_feature_names_out=False)

And if you put this column transformer in a pipeline, you don't need to do anything extra, since the consumer has already requested the metadata. Then if you call pipeline.fit(X, y, sample_weight=my_weights), it will forward them all to where they're requested.

jschmidtml · 2021-10-26T19:14:02Z

Sorry, just getting back to this. That will work, thank you!

adrinjalali · 2024-03-07T10:01:46Z

ColumnTransformer now supports metadata routing.

Fix column_transformer to use fitparams like Pipeline

29d021d

github-actions bot added the module:compose label Oct 12, 2021

amueller reviewed Oct 12, 2021

View reviewed changes

adrinjalali closed this Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix column_transformer to use fitparams like Pipeline #21311

Fix column_transformer to use fitparams like Pipeline #21311

Uh oh!

jschmidtml commented Oct 12, 2021

Uh oh!

amueller Oct 12, 2021

Uh oh!

amueller Oct 12, 2021

Uh oh!

jschmidtml Oct 13, 2021

Uh oh!

amueller commented Oct 12, 2021 •

edited

Loading

Uh oh!

adrinjalali commented Oct 13, 2021

Uh oh!

amueller commented Oct 22, 2021

Uh oh!

adrinjalali commented Oct 23, 2021

Uh oh!

jschmidtml commented Oct 26, 2021

Uh oh!

adrinjalali commented Mar 7, 2024

Uh oh!

Uh oh!

Uh oh!

Fix column_transformer to use fitparams like Pipeline #21311

Fix column_transformer to use fitparams like Pipeline #21311

Uh oh!

Conversation

jschmidtml commented Oct 12, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

amueller Oct 12, 2021

Choose a reason for hiding this comment

Uh oh!

amueller Oct 12, 2021

Choose a reason for hiding this comment

Uh oh!

jschmidtml Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

amueller commented Oct 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrinjalali commented Oct 13, 2021

Uh oh!

amueller commented Oct 22, 2021

Uh oh!

adrinjalali commented Oct 23, 2021

Uh oh!

jschmidtml commented Oct 26, 2021

Uh oh!

adrinjalali commented Mar 7, 2024

Uh oh!

Uh oh!

amueller commented Oct 12, 2021 •

edited

Loading