FIX Fix ranking for scipy >= 1.10. #24483

cmarmo · 2022-09-20T20:07:33Z

Reference Issues/PRs

Addresses test_grid_search_failing_classifier failure reported in #24424 and #24446.

What does this implement/fix? Explain your changes.

when input is nan, scipy >= 1.10 rankdata new default returns all nan.
To keep previous behaviour nans are set to the minimum value in the array before ranking.

Any other comments?

@ogrisel, @lesteve please let me know if this is a too naive way of fixing.

sklearn/model_selection/_search.py

Micky774

I'm assuming we do not reasonably expect array_means to contain +/- np.inf in which case this looks good! Just to sanity check, and to learn a new tool, I made a small hypothesis test which failed to find a falsifying example:

Hypothesis Test Script

import numpy as np
from hypothesis import assume, given, settings
from hypothesis import strategies as st
from hypothesis.extra import numpy as hnp
from scipy.stats import rankdata


@given(
    hnp.arrays(
        hnp.floating_dtypes(),
        st.tuples(
            st.integers(min_value=1, max_value=1000),
            st.integers(min_value=1, max_value=1000),
        ),
    )
)
@settings(max_examples=500)
def test_rank_func(x):
    assume(not np.any(np.isinf(x)))
    y = x.copy()
    rank_result_y = rankdata(-y, method="min")
    rank_result_y[np.isnan(rank_result_y)] = len(rank_result_y)

    min_x = x.min() - 1
    np.nan_to_num(x, copy=False, nan=min_x)
    rank_result_x = rankdata(-x, method="min")

    np.testing.assert_allclose(rank_result_x, rank_result_y)

sklearn/model_selection/_search.py

Co-authored-by: Meekail Zain <[email protected]>

sklearn/model_selection/_search.py

betatim · 2022-09-21T12:09:47Z

This PR might also fix #20678

ogrisel

Thanks for the fix @cmarmo. This LGTM. Let's just check if the failure in our [scipy-dev] goes away as expected when triggering it.

ogrisel · 2022-09-23T15:25:37Z

test_grid_search_failing_classifier is fixed in the [scipy-dev] run. Merging.

Fix ranking for scipy >= 1.10.

aa2172d

github-actions bot added the module:model_selection label Sep 20, 2022

TomDLT reviewed Sep 20, 2022

View reviewed changes

sklearn/model_selection/_search.py Outdated Show resolved Hide resolved

Remove for loop.

a21429e

Micky774 approved these changes Sep 21, 2022

View reviewed changes

sklearn/model_selection/_search.py Outdated Show resolved Hide resolved

Update sklearn/model_selection/_search.py

937b21c

Co-authored-by: Meekail Zain <[email protected]>

betatim reviewed Sep 21, 2022

View reviewed changes

sklearn/model_selection/_search.py Outdated Show resolved Hide resolved

cmarmo added 2 commits September 21, 2022 08:45

Merge branch 'main' into scipy-dev-grid-search-failing

669369e

Copy the array of means to keep results unchanged.

0b31282

Micky774 added the Waiting for Reviewer label Sep 22, 2022

betatim approved these changes Sep 23, 2022

View reviewed changes

Trigger [scipy-dev]

c22ccaa

ogrisel approved these changes Sep 23, 2022

View reviewed changes

ogrisel merged commit 0bf2479 into scikit-learn:main Sep 23, 2022

cmarmo deleted the scipy-dev-grid-search-failing branch September 23, 2022 17:57

glemaitre mentioned this pull request Sep 29, 2022

MAINT use nanmin to replace nan by finite values in ranking of SearchCV #24543

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX Fix ranking for scipy >= 1.10. #24483

FIX Fix ranking for scipy >= 1.10. #24483

Uh oh!

cmarmo commented Sep 20, 2022

Uh oh!

Uh oh!

Micky774 left a comment

Uh oh!

Uh oh!

Uh oh!

betatim commented Sep 21, 2022

Uh oh!

ogrisel left a comment

Uh oh!

ogrisel commented Sep 23, 2022

Uh oh!

Uh oh!

Uh oh!

FIX Fix ranking for scipy >= 1.10. #24483

FIX Fix ranking for scipy >= 1.10. #24483

Uh oh!

Conversation

cmarmo commented Sep 20, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

Micky774 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

betatim commented Sep 21, 2022

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Sep 23, 2022

Uh oh!

Uh oh!