[MRG+1] Changing default model for IterativeImputer to BayesianRidge #13038

sergeyf · 2019-01-23T22:08:47Z

As discussed in #13026 It turns out having RidgeCV as the default is problematic for reproducibility. This default should be gentler.

Note, this may not pass tests until a more stable example is merged via #13026.

Paging @jnothman

jnothman · 2019-01-23T22:57:33Z

Please update the documentation.

sergeyf · 2019-01-23T22:59:22Z

Whoops, forgot. Thanks.

jnothman

I assume this does not also affect documentation in the user guide.

We probably should have had a test that would have broken with this change, i.e. checking the class of each predictor. We should make sure there is such a test at least for custom predictors to ensure no regressions....

sergeyf · 2019-01-23T23:03:23Z

Sorry, I don't follow. The test would ensure what is going on with custom predictors?

jnothman · 2019-01-23T23:36:01Z

I meant like #13039, but also for predictor=None

sergeyf · 2019-01-24T01:11:53Z

I think you mean this?

def test_iterative_imputer_bayesianridge_default():
    rng = np.random.RandomState(0)

    n = 100
    d = 10
    X = sparse_random_matrix(n, d, density=0.10, random_state=rng).toarray()

    imputer = IterativeImputer(missing_values=0,
                               n_iter=1,
                               predictor=None,
                               random_state=rng)
    imputer.fit_transform(X)

    # check that types are correct for predictors
    hashes = []
    for triplet in imputer.imputation_sequence_:
        assert isinstance(triplet.predictor, type(BayesianRidge()))
        hashes.append(id(triplet.predictor))

    # check that each predictor is unique
    assert len(set(hashes)) == len(hashes)

jnothman · 2019-01-24T02:43:13Z

Yeah that sort of thing. Historically we should have tested the effect of sample_posterior is what I really mean... But now it is more straightforward.

jnothman

LGTM

glemaitre

I would refactor the test with the previous one. It would require just a single statement more.
Otherwise LGTM

glemaitre · 2019-01-24T10:56:50Z

sklearn/tests/test_impute.py

@@ -572,6 +572,24 @@ def test_iterative_imputer_predictors(predictor):
    assert len(set(hashes)) == len(hashes)


+def test_iterative_imputer_bayesianridge_default():


Would it be better to just make this case in the previous test

I meant:

@pytest.mark.parametrize( "predictor", [None, DummyRegressor(), BayesianRidge(), ARDRegression(), RidgeCV()] ) def test_iterative_imputer_predictors(predictor): rng = np.random.RandomState(0) n = 100 d = 10 X = sparse_random_matrix(n, d, density=0.10, random_state=rng).toarray() imputer = IterativeImputer(missing_values=0, n_iter=1, predictor=predictor, random_state=rng) imputer.fit_transform(X) # check that types are correct for predictors hashes = [] for triplet in imputer.imputation_sequence_: expected_type = type(predictor) if predictor is not None else type(BayesianRidge()) assert isinstance(triplet.predictor, expected_type) hashes.append(id(triplet.predictor)) # check that each predictor is unique assert len(set(hashes)) == len(hashes)

Good idea. Done.

glemaitre · 2019-01-24T15:48:04Z

Merging when CI turn green

glemaitre · 2019-01-24T16:59:21Z

Thanks for the change

sergeyf added 2 commits January 23, 2019 14:05

changing default for iterativeimputer

69c880b

also changing impute.rst to pass tests

d417047

updating documentation

55ff0d8

jnothman reviewed Jan 23, 2019

View reviewed changes

Merge branch 'iterativeimputer' into iterativeimputer_bayesianridge

eabfa97

jnothman mentioned this pull request Jan 23, 2019

TST IterativeImputer: Check predictor type #13039

Merged

jnothman mentioned this pull request Jan 23, 2019

[MRG] Merge IterativeImputer into master branch #11977

Merged

8 tasks

Merge branch 'iterativeimputer' into iterativeimputer_bayesianridge

51ee29b

sergeyf changed the title ~~Changing default model for IterativeImputer to BayesianRIdge~~ [WIP] Changing default model for IterativeImputer to BayesianRidge Jan 24, 2019

adding regression test

9ee731c

jnothman approved these changes Jan 24, 2019

View reviewed changes

sergeyf changed the title ~~[WIP] Changing default model for IterativeImputer to BayesianRidge~~ [MRG+1] Changing default model for IterativeImputer to BayesianRidge Jan 24, 2019

glemaitre requested changes Jan 24, 2019

View reviewed changes

sergeyf and others added 2 commits January 24, 2019 07:38

simplifying test

a73e648

cosmetic change

5926025

glemaitre approved these changes Jan 24, 2019

View reviewed changes

glemaitre merged commit cf4670c into scikit-learn:iterativeimputer Jan 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG+1] Changing default model for IterativeImputer to BayesianRidge #13038

[MRG+1] Changing default model for IterativeImputer to BayesianRidge #13038

Uh oh!

sergeyf commented Jan 23, 2019

Uh oh!

jnothman commented Jan 23, 2019

Uh oh!

sergeyf commented Jan 23, 2019

Uh oh!

jnothman left a comment

Uh oh!

sergeyf commented Jan 23, 2019

Uh oh!

jnothman commented Jan 23, 2019

Uh oh!

sergeyf commented Jan 24, 2019

Uh oh!

jnothman commented Jan 24, 2019 via email

Uh oh!

jnothman left a comment

Uh oh!

glemaitre left a comment

Uh oh!

glemaitre Jan 24, 2019

Uh oh!

glemaitre Jan 24, 2019

Uh oh!

sergeyf Jan 24, 2019

Uh oh!

glemaitre commented Jan 24, 2019

Uh oh!

glemaitre commented Jan 24, 2019

Uh oh!

Uh oh!

		@@ -572,6 +572,24 @@ def test_iterative_imputer_predictors(predictor):
		assert len(set(hashes)) == len(hashes)


		def test_iterative_imputer_bayesianridge_default():

Uh oh!

[MRG+1] Changing default model for IterativeImputer to BayesianRidge #13038

[MRG+1] Changing default model for IterativeImputer to BayesianRidge #13038

Uh oh!

Conversation

sergeyf commented Jan 23, 2019

Uh oh!

jnothman commented Jan 23, 2019

Uh oh!

sergeyf commented Jan 23, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

sergeyf commented Jan 23, 2019

Uh oh!

jnothman commented Jan 23, 2019

Uh oh!

sergeyf commented Jan 24, 2019

Uh oh!

jnothman commented Jan 24, 2019 via email

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre Jan 24, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre Jan 24, 2019

Choose a reason for hiding this comment

Uh oh!

sergeyf Jan 24, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Jan 24, 2019

Uh oh!

glemaitre commented Jan 24, 2019

Uh oh!

Uh oh!