Float arrays' comparisons in tests #4400

artsobolev · 2015-03-17T10:35:55Z

A recent issue indicated a flaw in many of sklearn's tests: there are many places where arrays are compared using assert_array_equal which does not take float's lack of precision into account.

Sometimes, though, we might expect a tested functionality to return exactly the same value — when checking, say, predict. It seems legitimate to use strict comparison in those cases.

Even though apparently this is not a problem at the moment (at least no one filed a bunch of bug reports like the one I mentioned), we might want to do something with it. Some of the possible fixes are:

Redefine assert_array_equal to use approximate comparison in case of floating data type. Might break guarantees like "predict returns the same values that were passed in y".
Replace assert_array_equal with assert_array_almost_equal when appropriate. This is a huge body of work, there are at least 229 tests that compare float arrays using assert_array_equal.
Ignore it until somebody files an issue. Tests pass right now, so we're good :-)

The text was updated successfully, but these errors were encountered:

amueller · 2015-03-17T14:28:26Z

The "are" link is actually "almost" ;)

We should use assert_almost_equal for floats, and assert_equal for ints.
That means that for classification and clustering, we expect the exact same outcome, but for regression and embeddings we don't.

I am very certain that 2. is the way to go. And 229 lines are not that bad. I am quite sure that we are in not too bad a shape, and most uses of assert_array_equal are actually on ints.

artsobolev · 2015-03-17T14:32:47Z

@amueller I got 229 not by greping source code, but by redefining assert_array_equal to raise an exception when called on float arguments (both arguments should be float numpy arrays). So 229 (# of failed tests) is a lower bound, since there could easily be more than one assert_array_equal in a test.

amueller · 2015-03-17T14:36:34Z

Ah. I grepped and got ~700.
Still, doable. I replaced input validation in all classes not so recently and had to edit basically all files.
While there may be many lines to edit, they are mostly concentrated in a few tests.

ogrisel · 2015-03-18T08:59:31Z

I share @amueller's position on that matter.

Attempt to deal with scikit-learn#4400

Deals with scikit-learn#4400

amueller added Easy Well-defined and straightforward way to resolve Need Contributor labels Oct 27, 2016

chenhe95 mentioned this issue Dec 2, 2016

[WIP] Fix gradient boosting overflow #7959

Closed

amueller added the Sprint label Mar 3, 2017

venthur added a commit to flix-tech/scikit-learn that referenced this issue Sep 14, 2017

Fix float array comparisons for naive_bayes.

575dc5a

Attempt to deal with scikit-learn#4400

venthur mentioned this issue Sep 14, 2017

[MRG] Fix float array comparisons for naive_bayes. #9765

Closed

venthur added a commit to flix-tech/scikit-learn that referenced this issue Sep 14, 2017

Fix float array comparisons in test_dummy.

e82f250

Deals with scikit-learn#4400

This was referenced Sep 14, 2017

[MRG] Fix float array comparisons in test_dummy. #9766

Closed

[MRG+1] Replace assert_array_equal with -assert_array_almost_equal where necessary. #9774

Merged

jnothman closed this as completed in #9774 Sep 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Float arrays' comparisons in tests #4400

Float arrays' comparisons in tests #4400

artsobolev commented Mar 17, 2015

amueller commented Mar 17, 2015

Uh oh!

artsobolev commented Mar 17, 2015

Uh oh!

amueller commented Mar 17, 2015

Uh oh!

ogrisel commented Mar 18, 2015

Uh oh!

Uh oh!

Float arrays' comparisons in tests #4400

Float arrays' comparisons in tests #4400

Comments

artsobolev commented Mar 17, 2015

amueller commented Mar 17, 2015

Uh oh!

artsobolev commented Mar 17, 2015

Uh oh!

amueller commented Mar 17, 2015

Uh oh!

ogrisel commented Mar 18, 2015

Uh oh!