[MRG+2] Added sample weight support to confusion matrix. #4001

DanielSidhion · 2014-12-24T22:23:55Z

Added a sample_weight parameter inside confusion_matrix which
will be used to build the confusion matrix. If not present, it
will default to np.ones() of size equal to the number of samples.
Tests were created in test_classification.py instead of
test_common.py.

DanielSidhion · 2014-12-24T22:24:32Z

Implementation for #3450. Any feedback is appreciated.

jnothman · 2014-12-25T13:15:37Z

sklearn/metrics/tests/test_classification.py

+
+    cm = confusion_matrix(y_true, y_pred, sample_weight=weights)
+
+    assert_array_almost_equal(cm, [[4.1, 0.8, 0.1],


I'd rather not have this hard-coded. More precisely, we can use:

assert_array_almost_equal(cm, .1 * confusion_matrix(y_true[:25], y_pred[:25]) + .2 * confusion_matrix(y_true[25:50], y_pred[25:50]) + .3 * confusion_matrix(y_true[50:], y_pred[50:]))

That's definitely better. I was influenced by the tests around it which had hard-coded values as well. Thank you for the feedback! Will push a commit shortly.

MechCoder · 2016-01-10T21:35:20Z

Are you still here? can you rebase?

DanielSidhion · 2016-01-15T18:36:14Z

Hi! Just got the notification. Will rebase in 48h, as I'm currently unable to do that.

MechCoder · 2016-01-18T05:08:30Z

Sure. !

Added a sample_weight parameter inside confusion_matrix which will be used to build the confusion matrix. If not present, it will default to np.ones() of size equal to the number of samples. Tests were created in test_classification.py instead of test_common.py.

Removed hard-coded result matrix for something better.

DanielSidhion · 2016-01-18T05:33:09Z

Should be ok now. If there's anything else you need, just let me know!

MechCoder · 2016-01-18T16:52:04Z

sklearn/metrics/tests/test_common.py

+                        # confusion_matrix with sample_weight is in
+                        # test_classification.py
+    "hamming_loss",
+    "matthews_corrcoef_score",


you can remove the "hamming loss" and "matthews_corrocef_score". They were supported recently

MechCoder · 2016-01-18T16:57:17Z

Just that minor comment. Will merge after that

DanielSidhion · 2016-01-18T18:01:30Z

Sorry about that! Please let me know of anything else needed. Thanks!

MechCoder · 2016-01-18T18:03:16Z

sklearn/metrics/classification.py

+        sample_weight = np.ones(y_true.shape[0], dtype=np.int)
+    else:
+        sample_weight = np.asarray(sample_weight)
+


nitpick: can you add a check here, to see if sample_weight is the same size as y_true and y_pred using check_consistent_length?

@MechCoder sure, seems like a good addition. I added the check after the highlighted code just to make sure that in the future, if some part of the previous code changes, the check still catches any future problems.

It would be good to have some tests against this behavior, but I'm a little short on time in the following weeks, so it's probably better to merge this and later I'll add another PR to improve the tests. What do you think?

MechCoder · 2016-01-19T02:45:57Z

Merged as 01b5b7b after doing a cosmit, an error to check for inconsistent length and updating whatsnew. Sorry for the year long wait @DanielSidhion !

jnothman reviewed Dec 25, 2014
View reviewed changes

Bernardo Vecchia Stein added 2 commits January 18, 2016 03:26

Improved confusion matrix test.

ef561ae

Removed hard-coded result matrix for something better.

MechCoder reviewed Jan 18, 2016
View reviewed changes

MechCoder changed the title ~~Added sample weight support to confusion matrix.~~ [MRG+2] Added sample weight support to confusion matrix. Jan 18, 2016

Removed recently supported metrics.

11a4456

MechCoder reviewed Jan 18, 2016
View reviewed changes

Added validation of sample weight.

36196a8

MechCoder closed this Jan 19, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG+2] Added sample weight support to confusion matrix. #4001

[MRG+2] Added sample weight support to confusion matrix. #4001

Uh oh!

DanielSidhion commented Dec 24, 2014

Uh oh!

DanielSidhion commented Dec 24, 2014

Uh oh!

jnothman Dec 25, 2014

Uh oh!

DanielSidhion Dec 25, 2014

Uh oh!

MechCoder commented Jan 10, 2016

Uh oh!

DanielSidhion commented Jan 15, 2016

Uh oh!

MechCoder commented Jan 18, 2016

Uh oh!

DanielSidhion commented Jan 18, 2016

Uh oh!

MechCoder Jan 18, 2016

Uh oh!

MechCoder commented Jan 18, 2016

Uh oh!

DanielSidhion commented Jan 18, 2016

Uh oh!

MechCoder Jan 18, 2016

Uh oh!

DanielSidhion Jan 19, 2016

Uh oh!

MechCoder commented Jan 19, 2016

Uh oh!

Uh oh!


		cm = confusion_matrix(y_true, y_pred, sample_weight=weights)

		assert_array_almost_equal(cm, [[4.1, 0.8, 0.1],

Uh oh!

[MRG+2] Added sample weight support to confusion matrix. #4001

[MRG+2] Added sample weight support to confusion matrix. #4001

Uh oh!

Conversation

DanielSidhion commented Dec 24, 2014

Uh oh!

DanielSidhion commented Dec 24, 2014

Uh oh!

jnothman Dec 25, 2014

Choose a reason for hiding this comment

Uh oh!

DanielSidhion Dec 25, 2014

Choose a reason for hiding this comment

Uh oh!

MechCoder commented Jan 10, 2016

Uh oh!

DanielSidhion commented Jan 15, 2016

Uh oh!

MechCoder commented Jan 18, 2016

Uh oh!

DanielSidhion commented Jan 18, 2016

Uh oh!

MechCoder Jan 18, 2016

Choose a reason for hiding this comment

Uh oh!

MechCoder commented Jan 18, 2016

Uh oh!

DanielSidhion commented Jan 18, 2016

Uh oh!

MechCoder Jan 18, 2016

Choose a reason for hiding this comment

Uh oh!

DanielSidhion Jan 19, 2016

Choose a reason for hiding this comment

Uh oh!

MechCoder commented Jan 19, 2016

Uh oh!

Uh oh!