Fix roc #6693

msoelch · 2016-04-21T09:47:00Z

Reference Issue

What does this implement/fix? Explain your changes.

It sets the tolerance threshold to 0.0 so that only identical thresholds are ignored.

Any other comments?

This is a quick fix at the cost of time efficiency. One can think of several fixes, all of which have different drawbacks:

affine rescaling of the scores to [0, 1]
scale the tolerance w.r.t. the std in y_score
add aditional arguments to roc_curve for the user to set a_tol and r_tol

raghavrv · 2016-04-21T12:27:36Z

cc: @jnothman

jnothman · 2016-04-21T12:47:23Z

So, this is effectively a reversion of @jblackburne's #3268, to which @arjoly objected, and which I and @ogrisel agreed was acceptable. You're right that there are solutions with dynamic tolerance. It seems like at least the fixed tolerance should be configurable.

msoelch · 2016-04-21T13:30:06Z

So what to do? Should I add function parameters to the top-most roc_curve function? That's a significant interface change, I didn't want to start that bold, but it's the only option out of the listed that does not obscure other use cases.

As you already stated in the corresponding issue, the default should be 0.0, like I implemented it in this PR, the reason being that the user does not even know that some potential thresholds are just discarded because they don't fulfill a more or less arbitrary tolerance threshold.

msoelch · 2016-04-21T13:56:56Z

I just checked the duplicate PR. I don't think that @arjoly's (understandable!) arguments apply in this case. In my real problem where this bug occured, the reduced number of thresholds suddenly yields artifacts in the ROC curve.

I don't want to get into the details of my actual problem, but here you can see what happens. The input gradient norm line (light blue) should look like in this file correct
However, this can only be achieved by scaling the scores (identical in both plots!) with a factor of 10. If I don't do this, it looks like this: incorrect
You cannot see it here, but I also did a scatter plot of the incorrect curve and found that the vertical part consists of a lot of points, while the diagonal part consists of no points except for its two endpoints.

And here, it's not just plotting artifacts, but the evaluation of my classifier is just plaing wrong (from close to perfect to suboptimal...).

jnothman · 2016-04-21T14:05:21Z

I don't think there's a big problem with adding new parameters. It's better
than having the mechanism hidden away and biting people at random. Making
this issue more obvious is also an option: warn if isclose is going to
conflate neighboring scores, and give a suggested solution to the user.

On 21 April 2016 at 23:56, Maximilian Soelch [email protected]
wrote:

I just checked the duplicate PR. I don't think that @arjoly
https://github.com/arjoly's (understandable!) arguments apply in this
case. In my real problem where this bug occured, the reduced number of
thresholds suddenly yields artifacts in the ROC curve.

I don't want to get into the details of my actual problem, but here you
can see what happens. The input gradient norm line (light blue) should look
like in this file correct
https://github.com/scikit-learn/scikit-learn/files/230022/on_roc.pdf
However, this can only be achieved by scaling the scores (identical in
both plots!) with a factor of 10. If I don't do this, it looks like this:
incorrect
https://github.com/scikit-learn/scikit-learn/files/230018/on_roc.pdf
You cannot see it here, but I also did a scatter plot of the incorrect
curve and found that the vertical part consists of a lot of points, while
the diagonal part consists of no points except for its two endpoints.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#6693 (comment)

msoelch · 2016-04-21T14:29:14Z

OK, will do later.

jblackburne · 2016-04-22T19:20:05Z

Hi, I caused all this with my previous PR. Sorry.

Looking back, I could have solved my problem more easily by just rounding my y_score before passing it into roc_curve(). It wasn't really fair to ask sklearn to solve what was essentially a problem in my client code.

So instead of making the threshold dropping more clever or adding kwargs that almost nobody will ever use, I propose just dropping the isclose altogether. This would shift the responsibility for controlling roundoff back to the client where it belongs. I could even put together a PR to do that if you like.

msoelch · 2016-04-22T19:49:38Z

Thanks for clarification.

Upon reconsideration, I don't think extra arguments are the best way. Way too much needs to be changed for little improvement, and documenting it properly is very hard.

I second @jblackburne's: It should be left to the user to check y_score (which was the very first thing I did when I ran into the error). However, I don't think a new PR is necessary, because applying isclose to delete real duplicates is fine, which is achieved by the current status of this PR (which sets tolerances to 0).

jblackburne · 2016-04-23T00:03:12Z

Well, once atol and rtol are zero, isclose is a no-op. That will be confusing for readers of the code. Also note that we can delete the isclose function from utils/fixes.py if we get rid of it here.

EDIT: That's inaccurate, it's not a no-op. What I mean is that where and diff eliminate the exactly equal scores by themselves without needing logical_not and isclose.

msoelch · 2016-04-23T08:22:20Z

That's true, it's redundant. If @jnothman agrees, I will adjust this PR accordingly by simply deleting isclose entirely.

jnothman · 2016-04-24T12:14:22Z

Okay. Let's revert, assuming @ogrisel's happy with that. :/

jnothman · 2016-04-24T12:14:57Z

And thanks for the input @jblackburne, even as I'm sorry we believed you the first time :P

amueller · 2016-10-11T01:43:29Z

fixed in #7353.

msoelch added 2 commits April 20, 2016 22:50

fix ROC for low variance scores

b122492

adjust docu after changes in ROC

dd23495

jnothman mentioned this pull request Apr 25, 2016

Bug in metrics.roc_auc_score #3864

Closed

jblackburne mentioned this pull request Sep 7, 2016

[MRG + 1] Remove np.isclose() from ROC curve calculation #7353

Merged

amueller closed this Oct 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix roc #6693

Fix roc #6693

Uh oh!

msoelch commented Apr 21, 2016

Uh oh!

raghavrv commented Apr 21, 2016

Uh oh!

jnothman commented Apr 21, 2016

Uh oh!

msoelch commented Apr 21, 2016

Uh oh!

msoelch commented Apr 21, 2016 •

edited

Loading

Uh oh!

jnothman commented Apr 21, 2016

Uh oh!

msoelch commented Apr 21, 2016

Uh oh!

jblackburne commented Apr 22, 2016

Uh oh!

msoelch commented Apr 22, 2016

Uh oh!

jblackburne commented Apr 23, 2016 •

edited

Loading

Uh oh!

msoelch commented Apr 23, 2016

Uh oh!

jnothman commented Apr 24, 2016

Uh oh!

jnothman commented Apr 24, 2016

Uh oh!

amueller commented Oct 11, 2016

Uh oh!

Uh oh!

Uh oh!

Fix roc #6693

Fix roc #6693

Uh oh!

Conversation

msoelch commented Apr 21, 2016

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

raghavrv commented Apr 21, 2016

Uh oh!

jnothman commented Apr 21, 2016

Uh oh!

msoelch commented Apr 21, 2016

Uh oh!

msoelch commented Apr 21, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented Apr 21, 2016

Uh oh!

msoelch commented Apr 21, 2016

Uh oh!

jblackburne commented Apr 22, 2016

Uh oh!

msoelch commented Apr 22, 2016

Uh oh!

jblackburne commented Apr 23, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msoelch commented Apr 23, 2016

Uh oh!

jnothman commented Apr 24, 2016

Uh oh!

jnothman commented Apr 24, 2016

Uh oh!

amueller commented Oct 11, 2016

Uh oh!

Uh oh!

msoelch commented Apr 21, 2016 •

edited

Loading

jblackburne commented Apr 23, 2016 •

edited

Loading