BUG: Make `mask_invalid` consistent with `mask_where` if `copy` is set to `False` #22046

cmarmo · 2022-07-26T01:43:03Z

This pull requests makes mask_invalid consistent with mask_where when copy is set to False.
Fixes #19332.

I'd rather make the two functions consistent than change the documentation.
Also from the documentation

Only applies to arrays with a dtype where NaNs or infs make sense

I added a test checking that an error is thrown when isinfinite is not applicable.

Thanks for considering it.

…e. Add test for type erroring.

bsipocz · 2022-09-07T17:00:24Z

numpy/ma/core.py

+    try:
+        return masked_where(~(np.isfinite(getdata(a))), a, copy=copy)
+    except TypeError:
+        raise


Was there a particular reason for this try/except, or is it a debug remnant?

Suggested change

try:

return masked_where(~(np.isfinite(getdata(a))), a, copy=copy)

except TypeError:

raise

return masked_where(~(np.isfinite(getdata(a))), a, copy=copy)

Indeed, I can't recall why I put the try/except. Suggestion applied. Thanks!

seberg · 2022-09-07T17:02:25Z

numpy/ma/tests/test_core.py

+        with pytest.raises(TypeError,
+                           match="not supported for the input types"):
+            np.ma.masked_invalid(a)
+


We were looking at this in the triage meeting as well. I guess the test is great (I honestly still need to parse it fully).
But we are missing an additional test for the successful path that was fixed I think: Checking that the array was indeed modified in-place with copy=False?

I have added a new test in test_masked_array_no_copy() ... just to explain myself: I didn't add it in the first place because mask_invalid becomes a straight call of masked_where which is already tested. But I guess the more tests we have the better?
Thanks!

Yeah, thanks. It is true that masked_where is tested, but it is also just nice to see the fix in action in the PR and we mainly have integration tests anyway.

A test too much cannot hurt :).

I find it odd that we consider inf invalid by default, but that is not a change.

Thanks @cmarmo, lets get this in!

seberg · 2022-09-22T07:56:12Z

I suppose we should do this just to align things anyway...

Although, now I actually suspect that this may have been intentional (Chesterton's fence greets): The function always replaces the mask, but does not always copy the data, which is a pattern to the madness that does make sense, the docs are probably just fuzzy on that intention (if it was the intention).
Sorry, long shot, but @ahaldane do you happen to have a gut feeling on this?

InessaPawson · 2022-10-05T16:28:18Z

@mhvk Do you have any thoughts on this?

mhvk · 2022-10-05T17:03:17Z

Not really. Overall, to me this PR makes sense: do as the doc states and just call masked_where. I've never understood why if data is kept, masks are not, though clearly it was designed that way.

seberg · 2022-10-05T17:09:50Z

Thanks @cmarmo and Marten, lets give this a shot then!

charris · 2022-10-05T17:12:42Z

Should probably add a release note for this.

seberg · 2022-10-05T17:19:36Z

Hmmm, maybe better. @cmarmo are you interested in having a look at that, the instructions are in numpy/doc/release/upcoming_changes/README.txt (you basically add a file in that folder with a 22046.change.rst as name).

Otherwise, hopefully I will remember to follow up and do it :).

bsipocz · 2022-10-05T17:35:03Z

Hmm, if the label is used consistently, a script could help to look for missing release notes before release time?

cmarmo · 2022-10-05T21:36:00Z

@cmarmo are you interested in having a look at that, the instructions are in numpy/doc/release/upcoming_changes/README.txt (you basically add a file in that folder with a 22046.change.rst as name).

I can take care of that if it can wait until the week-end. :)

This pull request add the changelog for #22046.

This is the minimal solution to fix numpygh-22826 with as little change as possible. We should fix `getdata()` but I don't want to do that in a bug-fix release really. IMO the alternative is to revert numpygh-22046 which would also revert the behavior noticed in numpygh-22720 (which seems less harmful though). Closes numpygh-22826

cmarmo added 2 commits July 25, 2022 15:41

Make mask_invalid consistent with mask_where when copy is set to Fals…

44c8da9

…e. Add test for type erroring.

Fix lint.

2d73e10

cmarmo changed the title ~~Make mask_invalid consistent with mask_where if copy is set to False~~ BUG: Make mask_invalid consistent with mask_where if copy is set to False Aug 1, 2022

github-actions bot added the 00 - Bug label Aug 1, 2022

bsipocz reviewed Sep 7, 2022

View reviewed changes

seberg reviewed Sep 7, 2022

View reviewed changes

Remove try statement. Add test.

4213779

cmarmo force-pushed the masked-invalid branch from 071082b to 4213779 Compare September 7, 2022 18:26

seberg merged commit 02b68f1 into numpy:main Oct 5, 2022

bsipocz added the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Oct 5, 2022

cmarmo deleted the masked-invalid branch October 8, 2022 07:01

cmarmo mentioned this pull request Oct 8, 2022

DOC: Add changelog for masked_invalid change. #22406

Merged

seberg pushed a commit that referenced this pull request Oct 14, 2022

DOC: Add changelog for masked_invalid change. (#22406)

6d56ebb

This pull request add the changelog for #22046.

This was referenced Dec 2, 2022

TST: Add dev numpy to devdeps, fix compat with numpy 1.24 astropy/specreduce#153

Closed

BUG: Handle invalid masked_invalid mask astropy/specreduce#151

Closed

seberg mentioned this pull request Dec 19, 2022

BUG: Regression in interaction between numpy.ma and pandas with 1.24.0 #22826

Closed

This was referenced Dec 19, 2022

BUG: masked_invalid does not accept pandas.Series #22829

Closed

[Bug]: Some matplotlib functions do not work anymore with pandas and numpy 1.24.0 matplotlib/matplotlib#24773

Closed

seberg mentioned this pull request Dec 19, 2022

BUG: Do not use getdata() in np.ma.masked_invalid #22838

Merged

charris mentioned this pull request Dec 19, 2022

BUG: Do not use getdata() in np.ma.masked_invalid #22839

Merged

LoicGrobol mentioned this pull request Dec 20, 2022

BUG: ufunc 'isfinite' not supported for datetime64 data on version 1.24.0 #22842

Closed

vegardkv mentioned this pull request Jan 2, 2023

BUG: Changed behavior for masked_invalid in 1.24 for Fortran ordered arrays #22912

Closed

seberg mentioned this pull request Jan 4, 2023

ENH: Adding __array_ufunc__ capability to MaskedArrays (again) #22914

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Make `mask_invalid` consistent with `mask_where` if `copy` is set to `False` #22046

BUG: Make `mask_invalid` consistent with `mask_where` if `copy` is set to `False` #22046

cmarmo commented Jul 26, 2022

bsipocz Sep 7, 2022

cmarmo Sep 7, 2022

seberg Sep 7, 2022

cmarmo Sep 7, 2022

seberg Sep 22, 2022

seberg commented Sep 22, 2022

InessaPawson commented Oct 5, 2022

mhvk commented Oct 5, 2022

seberg commented Oct 5, 2022

charris commented Oct 5, 2022

seberg commented Oct 5, 2022

bsipocz commented Oct 5, 2022

cmarmo commented Oct 5, 2022

BUG: Make mask_invalid consistent with mask_where if copy is set to False #22046

BUG: Make mask_invalid consistent with mask_where if copy is set to False #22046

Conversation

cmarmo commented Jul 26, 2022

bsipocz Sep 7, 2022

Choose a reason for hiding this comment

cmarmo Sep 7, 2022

Choose a reason for hiding this comment

seberg Sep 7, 2022

Choose a reason for hiding this comment

cmarmo Sep 7, 2022

Choose a reason for hiding this comment

seberg Sep 22, 2022

Choose a reason for hiding this comment

seberg commented Sep 22, 2022

InessaPawson commented Oct 5, 2022

mhvk commented Oct 5, 2022

seberg commented Oct 5, 2022

charris commented Oct 5, 2022

seberg commented Oct 5, 2022

bsipocz commented Oct 5, 2022

cmarmo commented Oct 5, 2022

BUG: Make `mask_invalid` consistent with `mask_where` if `copy` is set to `False` #22046

BUG: Make `mask_invalid` consistent with `mask_where` if `copy` is set to `False` #22046