Nan are converted to int with slicing #4592

sinhrks · 2014-04-06T00:34:43Z

Related to #1578, nan's are still converted to -maxint when it is assigned by slicing.

>>> np.__version__
'1.9.0.dev-6857173'
>>> i = np.array([1, 2, 3, 4, 5])
>>> i[3] = np.nan
ValueError: cannot convert float NaN to integer
>>> i[0:2] = np.nan
array([-9223372036854775808, -9223372036854775808,                    3,
                          4,                    5])

The text was updated successfully, but these errors were encountered:

gerritholl · 2017-01-10T15:41:31Z

Not only when slicing:

In [251]: array([nan, 0.0]).astype(int32)
Out[251]: array([-2147483648,           0], dtype=int32)

charris · 2017-01-10T17:40:34Z

@gerritholl What would you expect?

gerritholl · 2017-01-10T17:48:29Z

@charris Expect, not sure, but I would desire a warning or error configurable with seterr/errstate.

A slice of integer ndarray allows setting numpy.nan while ndarry.__setitem__() disallows it. See numpy/numpy#4592 .

Avoid a segfault caused by a numpy bug: numpy/numpy#4592

figiel · 2018-11-23T10:14:23Z

Another way to stumble on this issue:

>>> np.intp(np.floor(np.nan))
-9223372036854775808

FYI on ARMv8:

>>> np.intp(np.floor(np.nan))
0

tylerjereddy · 2018-11-23T15:56:59Z

ARMv8 hardware is used in our CI matrix so if there is a fix / some kind of action to be taken we should be able to detect regressions if a test is added.

mhvk · 2018-11-23T17:10:19Z

@figiel's example seems very surprising. Looking a little further:

np.intp(np.nan)
# ValueError: cannot convert float NaN to integer
np.intp(np.float64(np.nan))
# -9223372036854775808
type(np.nan)
# float

So there is an odd type dependence. Possibly in the issue on top, the difference is that for the slice case the constant np.nan gets converted to an int before trying the assignment.

rth · 2019-09-03T06:21:20Z

Related to #6109

siddhesh · 2019-10-09T22:18:17Z

It seems the only case where numpy needs a consistent definition for this otherwise undefined conversion is NaN -> NaT. I've got tests running for a patch to fix this for aarch64 since x86 just happens to do the conversion correctly to INT64_MIN.

javidcf · 2020-01-30T11:53:40Z

This still happens in 1.18.1. As an additional comment, note advanced indexing (not just single-element indexing) also produces an error. As I see it, the semantics of assigning NaN to an integer array should be consistently defined, to either convert to the minimum value always or raise an error always, but the current behaviour can be quite surprising.

WarrenWeckesser · 2021-11-07T20:51:38Z

Update: The original issue was that i[0] = np.nan generated an error, but i[0:2] = np.nan did not. I don't know when, but this appears to have been fixed:

In [1]: import numpy as np

In [2]: np.__version__
Out[2]: '1.22.0.dev0+1696.g5cc7ef066'

In [3]: i = np.array([1, 2, 3, 4, 5])

In [4]: i[0:2] = np.nan
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-4-6feac306eca4> in <module>
----> 1 i[0:2] = np.nan

ValueError: cannot convert float NaN to integer

However (probably because of the type dependence that @mhvk pointed out above), this does not produce an error:

In [5]: i[0:2] = np.array([np.nan, np.nan])

In [6]: i
Out[6]: 
array([-9223372036854775808, -9223372036854775808,                    3,
                          4,                    5])

seberg · 2021-11-08T00:06:46Z

Also xref gh-17495 and almost a duplicate of gh-6109, although more of a focus on the differences caused by casting vs. element setting.

The main reason is the fact that we mix casting (float64 to int64) and setting a single element from a scalar. The first tries not to error and fails to warn (this is a bug, see gh-14412.
It gets even worse, because switching to the casting behaviour for certain edge cases broke pandas. Which IIRC is why it is stricter now up there (because making it less strict would have had more affect on pandas and it was even more random before).

The one problem here (currently): item setting, doesn't know about cast safety, etc. So it cannot try to imitate normal casting. It basically uses some cast-safety that casting doesn't know about anyway. Since assignments are always unsafe, but choose to error on particularly nonsensical conversions.

seberg · 2022-06-14T17:57:17Z

NumPy will now give a warning on the main branch (settable using np.errstate). We could do more, or sanitize the output, though. I have left gh-14412 open to track that possibility (please comment there if you feel it is important).

Otherwise, closing this issue, since I think the warning is a good step in the right direction (and I am not sure whether more will be feasible, especially in the forseeable future).

gerritholl mentioned this issue Jan 10, 2017

masked values not converted to nan when slicing #8460

Open

yungyuc added a commit to yungyuc/solvcon that referenced this issue Nov 27, 2017

Avoid a segfault caused by a numpy bug

e43dd4f

A slice of integer ndarray allows setting numpy.nan while ndarry.__setitem__() disallows it. See numpy/numpy#4592 .

yungyuc mentioned this issue Nov 27, 2017

Avoid a segfault caused by a numpy bug solvcon/solvcon#195

Merged

yungyuc added a commit to solvcon/solvcon that referenced this issue Nov 27, 2017

Merge pull request #195 from yungyuc/segfault

3b740df

Avoid a segfault caused by a numpy bug: numpy/numpy#4592

mattip added 00 - Bug component: numpy._core labels Aug 8, 2018

figiel mentioned this issue Nov 22, 2018

TestIQR.test_rng failing on ARMv8 scipy/scipy#9516

Closed

seberg closed this as completed Jun 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nan are converted to int with slicing #4592

Nan are converted to int with slicing #4592

sinhrks commented Apr 6, 2014

gerritholl commented Jan 10, 2017

charris commented Jan 10, 2017

gerritholl commented Jan 10, 2017

figiel commented Nov 23, 2018

tylerjereddy commented Nov 23, 2018

mhvk commented Nov 23, 2018

rth commented Sep 3, 2019

siddhesh commented Oct 9, 2019

javidcf commented Jan 30, 2020

WarrenWeckesser commented Nov 7, 2021

seberg commented Nov 8, 2021

seberg commented Jun 14, 2022

Nan are converted to int with slicing #4592

Nan are converted to int with slicing #4592

Comments

sinhrks commented Apr 6, 2014

gerritholl commented Jan 10, 2017

charris commented Jan 10, 2017

gerritholl commented Jan 10, 2017

figiel commented Nov 23, 2018

tylerjereddy commented Nov 23, 2018

mhvk commented Nov 23, 2018

rth commented Sep 3, 2019

siddhesh commented Oct 9, 2019

javidcf commented Jan 30, 2020

WarrenWeckesser commented Nov 7, 2021

seberg commented Nov 8, 2021

seberg commented Jun 14, 2022