[MRG + 1] #10336 adding fit_predict to mixture models #11281

haoranShu · 2018-06-15T17:15:25Z

Reference Issues

What does this implement/fix? Explain your changes.

Added fit_predict method to all Gaussian mixture models and added tests to Bayesian Gaussian Mixture and Gaussian Mixture.

Any other comments?

fit is changed to call fit-predict, which really does the computation. In this way we can use log_resp conveniently for predict.

amueller · 2018-06-15T17:19:49Z

sklearn/mixture/base.py

        times until the change of likelihood or lower bound is less than
-        `tol`, otherwise, a `ConvergenceWarning` is raised.
+        `tol`, otherwise, a `ConvergenceWarning` is raised. After fitting, it
+        predicts the most probable label for the input data points.


this docstring seems not correct.

You are right. I'm changing it back to original.

amueller · 2018-06-15T17:20:37Z

sklearn/mixture/base.py

        """Estimate model parameters with the EM algorithm.

-        The method fit the model `n_init` times and set the parameters with
+        The method first fit the model `n_init` times and set the parameters with


should probably be "fits", right?

amueller · 2018-06-15T17:51:33Z

sklearn/mixture/tests/test_bayesian_mixture.py

+        assert_array_equal(Y_pred1, Y_pred2)
+
+
+def test_bayesian_mixture_predict_predict_proba():


If this was copied from the other test, can you maybe say so?

I don't get how this relates to the issue... What am I missing?

Sorry just saw this! This was copied from other test. I will comment on that. We added this test because to test fit_predict, we intended to test two things: 1. it is equivalent to fit().predict(); 2. it's output is correct. There was no testing for correctness of predict() for bgmm, we added one so that we know fit_predict actually yields the correct output.

I see, great!

amueller · 2018-06-15T17:52:36Z

looks good. Is that jet in your avatar? ;)

jnothman

Otherwise LGTM.

It looks like those tests could be refactored though

jnothman · 2018-06-17T01:40:48Z

sklearn/mixture/tests/test_bayesian_mixture.py

+        assert_array_equal(Y_pred1, Y_pred2)
+
+
+def test_bayesian_mixture_predict_predict_proba():


I don't get how this relates to the issue... What am I missing?

jnothman · 2018-06-17T01:42:57Z

sklearn/mixture/tests/test_gaussian_mixture.py

+        X = rand_data.X[covar_type]
+        Y = rand_data.Y
+        g = GaussianMixture(n_components=rand_data.n_components,
+                            random_state=rng, weights_init=rand_data.weights,


I think we should be passing random_state=0 rather than passing an object which will be changed with each iteration

I think it doesn't matter because the only thing we need is that the GMM is not changed within one iteration. Using different random_state with different COVARIANCE_TYPE actually tests more robustness? I guess?

jnothman · 2018-06-20T02:14:52Z

You have flake8 failures

haoranShu · 2018-06-20T02:30:50Z

Just fixed some flake8 problems. There is one left at line 456 of file tests/test_bayesian_mixture.py which I do not know how to solve.

jnothman · 2018-06-20T05:35:58Z

sklearn/mixture/tests/test_bayesian_mixture.py

+            Y = rand_data.Y
+            bgmm = BayesianGaussianMixture(n_components=rand_data.n_components,
+                                           random_state=rng,
+                                           weight_concentration_prior_type=prior_type,


you could either leave this flake8 issue unsolved, or do:

bgmm = BayesianGaussianMixture( n_components=rand_data.n_components, random_state=rng, weight_concentration_prior_type=prior_type, covariance_type=covar_type)

jnothman · 2018-06-20T05:36:31Z

sklearn/mixture/tests/test_bayesian_mixture.py

+        assert_array_equal(Y_pred1, Y_pred2)
+
+
+def test_bayesian_mixture_predict_predict_proba():


I see, great!

jnothman · 2018-06-20T05:36:57Z

Please add an entry to the change log at doc/whats_new/v0.20.rst. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

I'm not sure if it's better listed under API changes or as an enhancement.

jnothman · 2018-06-20T05:37:19Z

sklearn/mixture/base.py

+        times until the change of likelihood or lower bound is less than
+        `tol`, otherwise, a `ConvergenceWarning` is raised. After fitting, it
+        predicts the most probable label for the input data points.
+


Perhaps this deserves .. versionadded:: 0.20

Sorry I missed an email and just saw this! Will do soon.

I am putting it under API changes, because there seems to be no enhancement in terms of efficiency or accuracy.

jnothman · 2018-07-03T02:54:43Z

Thanks @haoranShu

舒浩然 and others added 5 commits June 15, 2018 12:10

added fit_predict with tests

a59b240

added fit_predict test for bgmm

fe7d4fe

style

0b331ef

Adding test for predict proba for BGMM

8584b9d

removing unneeded variables

5c8fd78

amueller reviewed Jun 15, 2018

View reviewed changes

recovered fit() docstring

f25f9b0

amueller reviewed Jun 15, 2018

View reviewed changes

amueller changed the title ~~#10336 adding fit_predict to mixture models~~ [MRG + 1] #10336 adding fit_predict to mixture models Jun 15, 2018

amueller approved these changes Jun 15, 2018

View reviewed changes

jnothman reviewed Jun 17, 2018

View reviewed changes

added comment to copied test function

0118652

fixed flake8 issues

afcbcaa

jnothman approved these changes Jun 20, 2018

View reviewed changes

jnothman reviewed Jun 20, 2018

View reviewed changes

舒浩然 and others added 2 commits July 2, 2018 20:37

added log in whats_new

178d1ce

PEP*

e73b965

jnothman merged commit c303ed8 into scikit-learn:master Jul 3, 2018

		assert_array_equal(Y_pred1, Y_pred2)


		def test_bayesian_mixture_predict_predict_proba():

Uh oh!

[MRG + 1] #10336 adding fit_predict to mixture models #11281

[MRG + 1] #10336 adding fit_predict to mixture models #11281

Conversation

haoranShu commented Jun 15, 2018

Reference Issues

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amueller commented Jun 15, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jun 20, 2018

Uh oh!

haoranShu commented Jun 20, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jun 20, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jul 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants