Ability to cache FeatureUnion transformers #9008

jnothman · 2017-06-06T13:36:08Z

It seems reasonable to support a memory parameter to FeatureUnion like was recently added to Pipeline (#7990). It is valuable in the sense that parameters in some constituent transformers can be searched over while others are unchanged; those that are unchanged should not need to be re-fit from scratch.

The text was updated successfully, but these errors were encountered:

lsorber · 2017-06-06T14:05:51Z

Couldn't this effect be obtained by wrapping the FeatureUnion transformers in cached Pipelines? That is, assuming the full Pipeline would be cached as suggested in #9007.

jnothman · 2017-06-06T22:01:19Z

A general memoization wrapper wigs solve all sorts of things in this space (and I've written one that has not been accepted). we have instead chosen to make this the responsibility of the container wherein some components are liable to change while others change in a parameter search

…

On 7 Jun 2017 12:05 am, "Laurent Sorber" ***@***.***> wrote: Couldn't this effect be obtained by wrapping the FeatureUnion transformers in cached Pipelines? That is, assuming the full Pipeline would be cached as suggested in #9007 <#9007>. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#9008 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz601lBTBkUvhBK_T3YwMVpPnajXDTks5sBVzDgaJpZM4NxWjY> .

GaelVaroquaux · 2017-06-06T22:06:40Z

A general memoization wrapper wigs solve all sorts of things in this space (and I've written one that has not been accepted).

We could reopen this discussion.

GaelVaroquaux · 2017-06-06T22:09:20Z

It seems reasonable to support a memory parameter to FeatureUnion like was recently added to Pipeline (#7990).

Seems reasonnable indeed to me.

caioaao · 2017-06-07T04:15:24Z

can this be assigned to me? I'm really interested in this as it should be very useful with cases described in #8960

jnothman · 2017-06-07T04:43:35Z

We can't use github assignment: it only allows assignment to team members. But as far as I'm concerned, you're welcome to contribute a patch.

psinger · 2018-07-18T09:50:32Z

Has there be any update on this? It seems to me that FeatureUnion is not cached at all withing a Pipeline.

jnothman · 2018-07-19T20:31:25Z

Hi Philipp, could you give a more explicit example of what you expected and what you got?

psinger · 2018-07-20T11:34:47Z

@jnothman At the beginning I was only doing for test purposes a single FeatureUnion within a pipeline and this did not get cached. Apparently, more than one step need to be done in the pipeline, even if the FeatureUnion consists of multiple steps.

Anyways, it was more of a gut feeling after following the discussion in this thread. I have some FeatureUnion operations including BOW Vectorizers inside and can't see any speed improvements with consecutive executions after using cache. I think the main reason is that, if I am correct, transforms are not cached, rather only fits. And I am not 100% sure if it works properly for FeatureUnion.

By and large, I don't have clear tests on that and thus, I will get back to this thread when I have some more insights into the topic.

nxorable · 2021-10-18T13:41:48Z

This enhancement applies also to ColumnTransformer as well based on my experience.

jnothman added Enhancement Need Contributor labels Jun 6, 2017

jnothman changed the title ~~cache FeatureUnion transformers~~ Ability to cache FeatureUnion transformers Jun 6, 2017

jnothman mentioned this issue Jun 6, 2017

Suggestion: cache all Pipeline steps by default #9007

Open

caioaao mentioned this issue Jun 7, 2017

[MRG+1] Stacking classifier with pipelines API #8960

Closed

7 tasks

lesteve added help wanted and removed Need Contributor labels Oct 18, 2017

cmarmo added the module:pipeline label Dec 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to cache FeatureUnion transformers #9008

Ability to cache FeatureUnion transformers #9008

jnothman commented Jun 6, 2017

lsorber commented Jun 6, 2017

jnothman commented Jun 6, 2017 via email

GaelVaroquaux commented Jun 6, 2017 via email

GaelVaroquaux commented Jun 6, 2017 via email

caioaao commented Jun 7, 2017 •

edited

Loading

jnothman commented Jun 7, 2017

psinger commented Jul 18, 2018

jnothman commented Jul 19, 2018 via email

psinger commented Jul 20, 2018

nxorable commented Oct 18, 2021

Ability to cache FeatureUnion transformers #9008

Ability to cache FeatureUnion transformers #9008

Comments

jnothman commented Jun 6, 2017

lsorber commented Jun 6, 2017

jnothman commented Jun 6, 2017 via email

GaelVaroquaux commented Jun 6, 2017 via email

GaelVaroquaux commented Jun 6, 2017 via email

caioaao commented Jun 7, 2017 • edited Loading

jnothman commented Jun 7, 2017

psinger commented Jul 18, 2018

jnothman commented Jul 19, 2018 via email

psinger commented Jul 20, 2018

nxorable commented Oct 18, 2021

caioaao commented Jun 7, 2017 •

edited

Loading