FIX Adjust tags in RFE to allow nans by default #21807

thomasjpfan · 2021-11-28T04:15:01Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR adjusts RFE to allow nans by default and only changes allow_nan if the underlying estimator has tags.

Any other comments?

I agree with #21743 (comment) for meta-estimators in general. I think meta-estimators should default to allow_nan=True, unless proven otherwise.

bmreiniger · 2023-10-05T18:14:13Z

Can we bring this back to life? It's somewhat niche, but came up again organically with a SelectFromModel with a pipeline estimator (where we wanted to apply some processing for the selection, but not for the resulting output).

adrinjalali · 2023-10-05T20:17:35Z

@thomasjpfan wanna give this an update? happy to review.

github-actions · 2023-10-13T14:41:54Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: cb96116. Link to the linter CI: here}

This reverts commit dbfce87.

thomasjpfan · 2023-10-13T16:38:38Z

This one is a little tricker. Consider the following two pipelines:

pipe1 = make_pipeline(
    SimpleImputer(),
    StandardScaler(),
    LogisticRegression(),
)
pipe2 = make_pipeline(
    StandardScaler(),
    LogisticRegression(),
)

In both cases, the pipeline's first step accepts nans, but an imputer imputes the output and a scalar passes through the nans. This means pipe1 can accept nans, but pipe2 does not accept nans. It is not possible to know if a whole pipeline can accept nans, because the tags does not describe if the estimator output's nans.

adrinjalali

LGTM. I think it's okay to delay the errors / warnings to the step which actually has to deal with it anyway.

ogrisel

Let's also test for RFECV but otherwise, LGTM.

sklearn/feature_selection/tests/test_rfe.py

doc/whats_new/v1.4.rst

Co-authored-by: Olivier Grisel <[email protected]>

thomasjpfan added 2 commits November 27, 2021 23:05

FIX Corrects tags for pipeline and rfe

6ab6e7f

DOC Grammar

6e7aae5

github-actions bot added module:feature_selection module:pipeline module:utils labels Nov 28, 2021

ENH Adds whats new

f0ce4f4

thomasjpfan added 4 commits October 13, 2023 09:44

Merge remote-tracking branch 'upstream/main' into tags-meta-estimators

e697785

Fixes errors with merge

c8ce54e

DOC Move whats new

c795fd7

TST Update test

b5d81bf

thomasjpfan added 3 commits October 13, 2023 10:43

CLN Simplify logic

dbfce87

CLN Simplify logic

915946b

Revert "CLN Simplify logic"

6b08c8f

This reverts commit dbfce87.

thomasjpfan marked this pull request as draft October 13, 2023 15:05

thomasjpfan changed the title ~~FIX Corrects tags for pipeline and RFE~~ FIX Adjust tags in RFE to allow nans by default Oct 13, 2023

thomasjpfan added 2 commits October 13, 2023 11:46

Only change RFE

9e94d58

DOC Remove whats new item

66f11ab

thomasjpfan marked this pull request as ready for review October 13, 2023 16:38

adrinjalali approved these changes Oct 17, 2023

View reviewed changes

adrinjalali added Quick Review For PRs that are quick to review Waiting for Second Reviewer First reviewer is done, need a second one! labels Oct 17, 2023

ogrisel approved these changes Oct 20, 2023

View reviewed changes

sklearn/feature_selection/tests/test_rfe.py Outdated Show resolved Hide resolved

sklearn/feature_selection/tests/test_rfe.py Outdated Show resolved Hide resolved

Parametrize non-regression test to also cover RFECV.

f74eeca

ogrisel reviewed Oct 20, 2023

View reviewed changes

doc/whats_new/v1.4.rst Outdated Show resolved Hide resolved

Update changelog accordingly.

cb96116

ogrisel enabled auto-merge (squash) October 20, 2023 15:10

ogrisel merged commit 3ff6c82 into scikit-learn:main Oct 20, 2023

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Oct 31, 2023

FIX Adjust tags in RFE to allow nans by default (scikit-learn#21807)

1d8e018

Co-authored-by: Olivier Grisel <[email protected]>

REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023

FIX Adjust tags in RFE to allow nans by default (scikit-learn#21807)

acc0aac

Co-authored-by: Olivier Grisel <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX Adjust tags in RFE to allow nans by default #21807

FIX Adjust tags in RFE to allow nans by default #21807

Uh oh!

thomasjpfan commented Nov 28, 2021 •

edited

Loading

Uh oh!

bmreiniger commented Oct 5, 2023

Uh oh!

adrinjalali commented Oct 5, 2023

Uh oh!

github-actions bot commented Oct 13, 2023 •

edited

Loading

Uh oh!

thomasjpfan commented Oct 13, 2023

Uh oh!

adrinjalali left a comment

Uh oh!

ogrisel left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

FIX Adjust tags in RFE to allow nans by default #21807

FIX Adjust tags in RFE to allow nans by default #21807

Uh oh!

Conversation

thomasjpfan commented Nov 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

bmreiniger commented Oct 5, 2023

Uh oh!

adrinjalali commented Oct 5, 2023

Uh oh!

github-actions bot commented Oct 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

thomasjpfan commented Oct 13, 2023

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

thomasjpfan commented Nov 28, 2021 •

edited

Loading

github-actions bot commented Oct 13, 2023 •

edited

Loading