BUG: Error when using `nanmax()` and `nanmin()` on an object array with an axis specified. #9008

WarrenWeckesser · 2017-04-27T15:10:36Z

This was the underlying cause of a problem reported on stackoverflow: http://stackoverflow.com/questions/43659827/numpy-error-when-specifying-axis-in-nanmax-while-nansum-works-an-the-same-case

The error is raised when applying nanmax() or nanmin() to an array with object data type and specifying an axis.

Here's the example from my answer:

In [2]: import numpy as np

In [3]: np.__version__
Out[3]: '1.13.0.dev0+bca7922'

In [4]: a = np.array([[1.0, 2.0], [3.0, 4.0]], dtype=object)

In [5]: np.nanmax(a, axis=0)
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-5-a020f98a2536> in <module>()
----> 1 np.nanmax(a, axis=0)

/Users/warren/miniconda3numpy/lib/python3.5/site-packages/numpy-1.13.0.dev0+bca7922-py3.5-macosx-10.6-x86_64.egg/numpy/lib/nanfunctions.py in nanmax(a, axis, out, keepdims)
    343         # Fast, but not safe for subclasses of ndarray
    344         res = np.fmax.reduce(a, axis=axis, out=out, **kwargs)
--> 345         if np.isnan(res).any():
    346             warnings.warn("All-NaN slice encountered", RuntimeWarning, stacklevel=2)
    347     else:

TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

There is no error when the axis is not given:

In [6]: np.nanmax(a)
Out[6]: 4.0

nansum() handles the axis without an error:

In [7]: np.nansum(a, axis=0)
Out[7]: array([4.0, 6.0], dtype=object)

There are several other issues involving the nan-functions and object arrays, but I couldn't tell if this issue is a duplicate. Sorry for the noise if it is.

The text was updated successfully, but these errors were encountered:

Satyadev592 · 2017-04-27T15:30:08Z

http://stackoverflow.com/questions/43659827/numpy-error-when-specifying-axis-in-nanmax-while-nansum-works-an-the-same-case/43660147#43660147

There is an answer with three downvotes that explains why this is not a bug. Happy realisation :)

Nansum and Nanmax are not coded in the same fashion , a detailed explanation is available on the post. It's not an axis issue to begin with...

eric-wieser · 2017-04-27T15:31:46Z

So, ideally this breaks down into:

We need a O->? loop for the np.isnan ufunc (ENH: Add a np.isnan loop for the object dtype (and possible isfinite, ...) #9009)
We need to correct the OO->? loop for fmin and fmax (BUG: np.fmin behaves differently on object arrays #8975)

As a quick workaround, we should just not allow object arrays to take that special case branch

Satyadev592 · 2017-04-27T15:33:31Z

Yes, explicity type cast to float.
Nanmax does in the beginning of it's source code , the following :
res = np.fmax.reduce(a, axis=axis, out=out, **kwargs)
if np.isnan(res).any():

The np.isnan() fails on object type columns.

eric-wieser · 2017-04-27T15:35:46Z

@Satyadev592: Not the right fix. np.fmax.reduce is broken on object columns too.

This is essentially a different manifestation of the problem at #8974, and the quick fix to numpy would be as described there. This is absolutely a bug in numpy.

eric-wieser · 2017-04-27T15:46:22Z

@Satyadev592:

Yes, fmax is broken on object columns as well. But the assumption that axis is the reason for this is wrong. It's purely the data type of the column. The code works with row wise axis just because it coerces the data type to float (purely because of the dataset here)

The code example at the top of this page directly contradicts that claim, does it not?

The problem manifests itself only when all of the following are true: axis is specified, dtype is object, and input is 2d or higher

Satyadev592 · 2017-04-27T17:11:22Z

Take a numpy array with only string values and try the nanmax on either axis. It still pops an error. The error this time is something to do with the fmax, dig a bit deeper , it's got something to do with numpy not having good support for the object datatype. If this is a bug , then a bug for each numpy function should be created for lack of support for object datatypes. This particular problem makes you think axis is a reason while it has nothing to do with the problem.

Also shifting an explicit casting to float inside source code is probably not a good idea. Numpy does not seem to be designed for object types [Well established i think]

Also , the documentation for np.nanmax does not seem to indicate support for object data type (general norm from whatever i have gathered about numpy) :

eric-wieser · 2017-04-27T17:31:34Z

Take a numpy array with only string values

Strings are an entirely different beast, and have little to do with the object dtype. The problem there is that flexible types (np.bytes_, np.void, np.unicode_) are not supported by the ufunc system, due to having variable sizes. object arrays are perfectly supported by ufuncs - it just so happens that we don't have an implementation of isnan for them.

If this is a bug , then a bug for each numpy function should be created for lack of support for object datatypes.

Absolutely. If you can find numpy functions that fail with object arrays, where it would make sense for them not to, then either:

Their documentation should be updated to warn about this
They should be fixed so that they do.

ahaldane · 2017-04-28T02:10:17Z

This issue would be fixed by #6320 (I just tested the example).

[1]: a = np.array([[1.0, 2.0], [3.0, 4.0]], dtype=object)
[2]: np.nanmax(a, axis=0)
array([3.0, 4.0], dtype=object)
[3]: x = np.array([1, np.nan])
[4]: np.fmin.reduce(x)
1.0
[5]: np.fmin.reduce(x.astype(object))
1.0

Fixes numpygh-8974 and numpygh-9008

mattip · 2020-03-17T13:41:40Z

Closing, all the examples work correctly (including the last one np.fmin.reduce(x.astype(object)) which returns nan). Please reopen if I misunderstood.

eric-wieser · 2020-03-17T13:56:39Z

The bug here is that the case you mention should not return nan

eric-wieser · 2020-03-17T13:58:08Z

However that's covered in gh-8975, so fine to close this

eric-wieser · 2020-03-17T14:00:02Z

Closed by gh-9013

eric-wieser added 00 - Bug component: numpy.lib labels Apr 27, 2017

eric-wieser mentioned this issue Apr 27, 2017

ENH: Add a np.isnan loop for the object dtype (and possible isfinite, ...) #9009

Open

eric-wieser mentioned this issue Apr 27, 2017

BUG: Fix np.lib.nanfunctions on object arrays #9013

Merged

eric-wieser added a commit to eric-wieser/numpy that referenced this issue Apr 29, 2017

BUG: Fix incorrect behavior of nanfunctions on object arrays

a77709c

Fixes numpygh-8974 and numpygh-9008

mherkazandjian pushed a commit to mherkazandjian/numpy that referenced this issue May 30, 2017

BUG: Fix incorrect behavior of nanfunctions on object arrays

8a01c7c

Fixes numpygh-8974 and numpygh-9008

mattip closed this as completed Mar 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Error when using `nanmax()` and `nanmin()` on an object array with an axis specified. #9008

BUG: Error when using `nanmax()` and `nanmin()` on an object array with an axis specified. #9008

WarrenWeckesser commented Apr 27, 2017

Satyadev592 commented Apr 27, 2017 •

edited

Loading

Uh oh!

eric-wieser commented Apr 27, 2017 •

edited

Loading

Uh oh!

Satyadev592 commented Apr 27, 2017 •

edited

Loading

Uh oh!

eric-wieser commented Apr 27, 2017 •

edited

Loading

Uh oh!

eric-wieser commented Apr 27, 2017 •

edited

Loading

Uh oh!

Satyadev592 commented Apr 27, 2017 •

edited by eric-wieser

Loading

Uh oh!

eric-wieser commented Apr 27, 2017

Uh oh!

ahaldane commented Apr 28, 2017

Uh oh!

mattip commented Mar 17, 2020

Uh oh!

eric-wieser commented Mar 17, 2020

Uh oh!

eric-wieser commented Mar 17, 2020

Uh oh!

eric-wieser commented Mar 17, 2020

Uh oh!

Uh oh!

BUG: Error when using nanmax() and nanmin() on an object array with an axis specified. #9008

BUG: Error when using nanmax() and nanmin() on an object array with an axis specified. #9008

Comments

WarrenWeckesser commented Apr 27, 2017

Satyadev592 commented Apr 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser commented Apr 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Satyadev592 commented Apr 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser commented Apr 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser commented Apr 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Satyadev592 commented Apr 27, 2017 • edited by eric-wieser Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser commented Apr 27, 2017

Uh oh!

ahaldane commented Apr 28, 2017

Uh oh!

mattip commented Mar 17, 2020

Uh oh!

eric-wieser commented Mar 17, 2020

Uh oh!

eric-wieser commented Mar 17, 2020

Uh oh!

eric-wieser commented Mar 17, 2020

Uh oh!

BUG: Error when using `nanmax()` and `nanmin()` on an object array with an axis specified. #9008

BUG: Error when using `nanmax()` and `nanmin()` on an object array with an axis specified. #9008

Satyadev592 commented Apr 27, 2017 •

edited

Loading

eric-wieser commented Apr 27, 2017 •

edited

Loading

Satyadev592 commented Apr 27, 2017 •

edited

Loading

eric-wieser commented Apr 27, 2017 •

edited

Loading

eric-wieser commented Apr 27, 2017 •

edited

Loading

Satyadev592 commented Apr 27, 2017 •

edited by eric-wieser

Loading