DOC: np.append docs should explain how appending different dtypes works #26291

Fleyderer · 2024-04-16T13:45:12Z

Describe the issue:

When trying to append empty list to existing array, numpy array changes its dtype. This is absolutely not obvious behaviour and I've barely found this problem.

It may sound like minor problem, but in my case it was a reason of not deleting about 50k images by its indices, when I had to.

Reproduce the code example:

a = np.array([1, 2], dtype=int)
b = []
np.append(a, b) # array([1., 2.])

Error message:

No response

Python and NumPy Versions:

Python 3.11.5
Numpy 1.26.4

Runtime Environment:

No response

Context for the issue:

No response

rkern · 2024-04-16T14:23:09Z

It's intended behavior, but it should be documented better.

tuhinsharma121 · 2024-04-16T18:24:02Z

@rkern Can I work on a PR to document this behaviour? In that case can you assign this issue to me?

Fleyderer · 2024-04-16T18:36:11Z

What is "intended" in changing dtype of array, which values are not changed¿

rkern · 2024-04-16T20:45:27Z

Both arguments are converted to ndarrays through the usual method of np.asarray(). The default dtype for empty lists is float64. Once the arguments are converted to ndarrays, then their dtypes are compared to figure out the common dtype they can both be safely coerced to; int64 and float64 can both go to float64.

>>> np.asarray([]).dtype
dtype('float64')

If you want to work around this, even in the case of empty lists, be sure to coerce both of your arguments to ndarrays explicitly with dtype=int ahead of time.

>>> a = np.array([1, 2], dtype=int)
>>> b = np.array([], dtype=int)
>>> np.append(a, b)
array([1, 2])

seberg · 2024-04-17T07:12:10Z

For what it's worth, I would agree that for append casting to the first array would probably have been a nicer choice for NumPy, but that doesn't mean I think we should change it (at least not without a long warning which may be tedious).

There was discussion (maybe even a PR?) to add a dtype= argument, which would allow you to write np.append(arr, ..., dtype=arr.dtype).
np.concatenate already has that option and is really basically what np.append uses internally.
That could be reactivated.

(For the empty list case, the dtype= argument may still be awkward due to unsafe casting of the "inferred" float64.)

Improving the docs is also good of course.

Fleyderer added the 00 - Bug label Apr 16, 2024

rkern added 04 - Documentation and removed 00 - Bug labels Apr 16, 2024

ngoldbaum changed the title ~~BUG: np.append does change array dtype~~ DOC: np.append docs should explain how appending different dtypes works Apr 16, 2024

tuhinsharma121 mentioned this issue Apr 18, 2024

DOC: add explanation of dtype to parameter values for np.append #26303

Merged

mattip closed this as completed in #26303 Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DOC: np.append docs should explain how appending different dtypes works #26291

DOC: np.append docs should explain how appending different dtypes works #26291

Fleyderer commented Apr 16, 2024

rkern commented Apr 16, 2024

Uh oh!

tuhinsharma121 commented Apr 16, 2024

Uh oh!

Fleyderer commented Apr 16, 2024

Uh oh!

rkern commented Apr 16, 2024

Uh oh!

seberg commented Apr 17, 2024

Uh oh!

Uh oh!

DOC: np.append docs should explain how appending different dtypes works #26291

DOC: np.append docs should explain how appending different dtypes works #26291

Comments

Fleyderer commented Apr 16, 2024

Describe the issue:

Reproduce the code example:

Error message:

Python and NumPy Versions:

Runtime Environment:

Context for the issue:

rkern commented Apr 16, 2024

Uh oh!

tuhinsharma121 commented Apr 16, 2024

Uh oh!

Fleyderer commented Apr 16, 2024

Uh oh!

rkern commented Apr 16, 2024

Uh oh!

seberg commented Apr 17, 2024

Uh oh!