DEP: Deprecate conversion of out-of-bound Python integers #22385

seberg · 2022-10-05T20:06:50Z

Any conversion from a Python integer (or subclass) that is stored
into a NumPy dtype but does not fit should raise an error in the future.
Note, that casts between NumPy types (or assignments of them) are
explicitly not affected by this.

A few important notes:

One reason here was to simplify ENH: Implement correct scalar and integer overflow errors for NEP 50 #21875, since it requires one special path less.
The test changes are a bit tedious but seem mostly OK (sometimes you have to do a cast dance).
The changes to the ufunc identity I have no sqirms about (ENH: Move identity to the ArrayMethod to allow customization #20970 should allow to fix it up)
The changes to MA seem right to me, but it will be a bit "wait and see".

I tested SciPy it has a fair bit of failures that seem managable.

Overall, I am not sure about this, but need the decision whether we do this, or stick with the original gh-21875 proposal. The other PR would be a bit simpler, but I am now not quite convinced that coupling the two is worth the simplification there. My worry is that the MA code may be strange, but downstream might also have similar hard to analyze issues with the change.

Any conversion from a Python integer (or subclass) that is stored into a NumPy dtype but does not fit should raise an error in the future. Note, that casts between NumPy types (or assignments of them) are explicitly not affected by this. There are certain use-cases for allowing such casts, even if technically undefined behavior in C. They just work out well in practice in many cases since e.g. -1 is all 1's in binary represenation (twos complement repr).

This is a temporary solution until such a time where we can have loop specific identities which can handle this more gracefully.

This wraps the fill value into an array, the default fill value for all ointegers is 99999 which doesn't work for many integer dtypes. Note that this might still subtle change the behavior in other code paths where we cannot avoid this. Plus, the deprecationwarning may show up (and in fact be a "in the future will use the default fill value" warning).

mattip · 2022-10-06T10:04:23Z

numpy/ma/core.py

+        # TODO: This is probably a mess, but should best preserve behavior?
+        vals = tuple(
+                np.array(_recursive_fill_value(dtype[name], f))
+                for name in dtype.names)


Did you reach the conclusion that this is correct based on a test failure? I don't see an added test, but the lines are not marked by coverage so I assume they are tested.

Yes, there were a few test failures (mainly in the recarray code that handles a lot of masked arrays with structs).
They go away with this change, which mirrors the non-struct path (that always wraps into array()...

It took me a while to figure that this seems like the easy way out, I don't particularly love it, but it seems to work. It is a bit unclear if downstream might notice these changes.

mattip · 2022-10-06T10:07:13Z

numpy/core/src/umath/ufunc_object.c

+    if (ufuncimpl == NULL) {
+        return NULL;
+    }
+


There was a request from Numba to expose the ufunc loop used for a dtype. They did not explicitly ask for reductions, but I imagine that will be the next request. Will it be sufficient to expose reducelike_promote_and_resolve or will we need more logic?

Good point, I have to think abou the API. Could would probably consider a single function/entrypoint. But with reduction=True as kwarg?
There are some weirder things in how we do reduction promotion and type resolution right now, but I imagine that is fine.

Such new API might also just always live in a NEP 50 future.

I don't think it matters much for this PR. To me the decision here seems mainly whether we prefer this churn, over the churn of having a special (internal for now) API for Python integers. (And maybe the timing of both)

mattip · 2022-10-06T10:35:24Z

To me the decision here seems mainly whether we prefer this churn, over the churn of having a special (internal for now) API for Python integers

I would prefer this churn, and not have a special API for Python integers.

I tested SciPy it has a fair bit of failures that seem managable.

Hmm, that is kind of worrisome. But doesn't #21875 also have a number of SciPy failures?

seberg · 2022-10-06T11:21:00Z

But doesn't #21875 also have a number of SciPy failures?

Yes, but they are a fully distinct set of failures. Merging this has no influence on those failures (besides maybe printing slightly different or giving two warnings for the same thing).

The failures here are things like np.array([-1], dtype=np.uint8) or np.uint8(-1) mainly, which NEP 50 has no opinion about.
The failures in the other PR are things like np.uint8(3) + (-1) which is similar but distinct (because in a pre NEP 50 world we convert to np.int16(-1) and not np.uint8(-1).

mattip · 2022-10-06T12:31:04Z

numpy/core/src/multiarray/arraytypes.c.src

+                    "NumPy will stop allowing conversion of out-of-bound "
+                    "Python integers to integer arrays.  The conversion "
+                    "of %.100R to %S will fail in the future.",
+                    obj, descr) < 0) {


Maybe add the astype() workaround, i.e. np.array(-1, np.uint8) will raise here, but np.array(-1).astype(np.uint8) will do the right thing? I am not sure how to express that this deep in the code path...

Good idea, also added a release note. If this becomes too painful for downstream, we could consider allowing np.uint8(-1) making it explicitly work for negative values. (I would still restrict this to the range of the corresponding signed integer, so -128 for uint8)

we might need that as a fallback

…r conv

mattip

LGTM. Only a small nit in the release note. Anyone else want to take a look?

doc/release/upcoming_changes/22393.deprecation.rst

Co-authored-by: Matti Picus <matti.picus@gmail.com>

mattip · 2022-10-11T16:35:59Z

Thanks @seberg. What happens now to #21875?

seberg · 2022-10-11T16:41:10Z

I will update it to remove the special parsing path. They will use the normal parsing paths instead always (might need some test adjustments). It won't be much simpler, but we won't have the "special" path at least.

After the change in numpy/numpy#22385, numpy raises a deprecation warning with calls such as np.int8(5000) and np.uint32(-1). This change avoids such calls in the tests.

WarrenWeckesser · 2022-10-12T01:45:14Z

Fixing the SciPy tests wasn't too bad: scipy/scipy#17209

After the change in numpy/numpy#22385, numpy raises a deprecation warning with calls such as np.int8(5000) and np.uint32(-1). This change avoids such calls in the tests.

seberg added 4 commits October 5, 2022 22:00

ENH: Workaround -1 reduce identity for unsigned integers

0a821c1

This is a temporary solution until such a time where we can have loop specific identities which can handle this more gracefully.

TST: Fixup tests for strict Python integer conversions

eec99b4

github-actions bot added the 07 - Deprecation label Oct 5, 2022

TST: Add deprecation tests for out-of-bound pyint conversion deprecation

7a951f9

seberg force-pushed the deprecate-out-of-bound-pyint-conversion branch from df872b4 to 7a951f9 Compare October 5, 2022 20:33

mattip reviewed Oct 6, 2022

View reviewed changes

seberg added the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Oct 6, 2022

seberg added 3 commits October 6, 2022 17:26

DOC: Add release note about deprecation of out-of-bound python intege…

0ca3a6d

…r conv

DOC: Extend out-of-bound python integer deprecation warning

fb44bd1

TST: Further test fixup for python integer conversion warning

9434081

seberg removed the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Oct 6, 2022

mattip approved these changes Oct 6, 2022

View reviewed changes

doc/release/upcoming_changes/22393.deprecation.rst Outdated Show resolved Hide resolved

Update doc/release/upcoming_changes/22393.deprecation.rst

5f11024

Co-authored-by: Matti Picus <matti.picus@gmail.com>

mattip merged commit 226f9c5 into numpy:main Oct 11, 2022

seberg deleted the deprecate-out-of-bound-pyint-conversion branch October 11, 2022 16:35

WarrenWeckesser mentioned this pull request Oct 11, 2022

DOC: add Numpy import to examples scipy/scipy#17115

Merged

WarrenWeckesser mentioned this pull request Oct 12, 2022

MAINT: Handle numpy's deprecation of accepting out-of-bound integers. scipy/scipy#17209

Merged

seberg mentioned this pull request Oct 12, 2022

ENH: Implement correct scalar and integer overflow errors for NEP 50 #21875

Merged

dhomeier mentioned this pull request Oct 17, 2022

TST: DeprecationWarning: NumPy will stop allowing conversion of out-of-bound Python integers to integer arrays astropy/astropy#13834

Closed

adrianeboyd mentioned this pull request Dec 5, 2022

DOC: Improve 1.24.0 release notes on converting out-of-bound Python integers #22733

Closed

iakoster added a commit to iakoster/pyiak_instr that referenced this pull request Feb 16, 2023

fix: fix numpy deprecation warning (numpy/numpy#22385)

2a410c8

anosillus mentioned this pull request May 25, 2023

numpy deprecation warnings when importing nptyping ramonhagenaars/nptyping#102

Open

danepitkin mentioned this pull request May 31, 2023

[Python] NumPy NEP50 introduces deprecation of "value based" promotion of scalars apache/arrow#35853

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEP: Deprecate conversion of out-of-bound Python integers #22385

DEP: Deprecate conversion of out-of-bound Python integers #22385

seberg commented Oct 5, 2022

mattip Oct 6, 2022

seberg Oct 6, 2022

mattip Oct 6, 2022

mattip Oct 6, 2022

seberg Oct 6, 2022

mattip commented Oct 6, 2022

seberg commented Oct 6, 2022

mattip Oct 6, 2022

seberg Oct 6, 2022

mattip Oct 6, 2022

mattip left a comment

mattip commented Oct 11, 2022

seberg commented Oct 11, 2022

WarrenWeckesser commented Oct 12, 2022

DEP: Deprecate conversion of out-of-bound Python integers #22385

DEP: Deprecate conversion of out-of-bound Python integers #22385

Conversation

seberg commented Oct 5, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattip commented Oct 6, 2022

seberg commented Oct 6, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattip left a comment

Choose a reason for hiding this comment

mattip commented Oct 11, 2022

seberg commented Oct 11, 2022

WarrenWeckesser commented Oct 12, 2022