np.logical_xor.accumulate fails on 1.24 on mac #22841

knutdrand · 2022-12-20T15:10:51Z

Describe the issue:

.Accumulate on logical_xor gives wrong result on newest version. I suspect the same for bitwise_xor.

Reproduce the code example:

import numpy as np

a = np.array([False, False,  True, False, False, False, False,  True, False, False,  True, False, False,  True, False, False, False, False, False, False])
np.logical_xor.accumulate(a)
array([False, False,  True,  True, False, False, False,  True,  True,
       False,  True,  True, False,  True,  True, False, False, False,
       False, False])
# Should be [False, False,  True,  True, True, True, True, False,,,,]

Error message:

No response

NumPy/Python version information:

numpy=1.24
python 3.8,
mac version 12.5.1

Context for the issue:

No response

seberg · 2022-12-20T15:22:56Z

Again ping @seiko2plus and maybe @Developer-Ecosystem-Engineering. This looks like probably the identical issue as gh-22840 (just now slightly different with the reduction). Presumably introduced by gh-22167.

With a view (although I am not sure how important it is directly without digging in):

>>> a = np.array([False, False,  True, False, False, False, False,  True, False, False,  True, False, False,  True, False, False, False, False, False, False])
>>> np.logical_xor.accumulate(a).view(np.uint8)
array([  0,   0, 254, 254,   0,   0,   0, 254, 254,   0, 254, 254,   0,
       254, 254,   0,   0,   0,   0,   0], dtype=uint8)

Developer-Ecosystem-Engineering · 2022-12-20T22:07:34Z

Hi @seberg,

Appears related to #21483. The aliases for BOOL_logical_xor was BOOL_not_equal, which was subsequently changed

seiko2plus · 2022-12-21T01:29:45Z

Memory overlaps! The comparison loops of all data types are free of overlap checking:

numpy/numpy/core/src/umath/fast_loop_macros.h

Lines 381 to 392 in 0d1bb8e

    
           #define IS_BLOCKABLE_BINARY_BOOL(esize, vsize) \ 
        
               (steps[0] == (esize) && steps[0] == steps[1] && steps[2] == (1) && \ 
        
                npy_is_aligned(args[1], (esize)) && \ 
        
                npy_is_aligned(args[0], (esize))) 
        
           #define IS_BLOCKABLE_BINARY_SCALAR1_BOOL(esize, vsize) \ 
        
               (steps[0] == 0 && steps[1] == (esize) && steps[2] == (1) && \ 
        
                npy_is_aligned(args[1], (esize))) 
        
           #define IS_BLOCKABLE_BINARY_SCALAR2_BOOL(esize, vsize) \ 
        
               (steps[0] == (esize) && steps[1] == 0 && steps[2] == (1) && \ 
        
                npy_is_aligned(args[0], (esize)))

numpy/numpy/core/src/umath/loops_comparison.dispatch.c.src

Lines 315 to 328 in 0d1bb8e

    
           /* argument one scalar */ 
        
           if (IS_BLOCKABLE_BINARY_SCALAR1_BOOL(sizeof(@type@), NPY_SIMD_WIDTH)) { 
        
               simd_binary_scalar1_@kind@_@sfx@(args, dimensions[0]); 
        
               return; 
        
           } 
        
           /* argument two scalar */ 
        
           else if (IS_BLOCKABLE_BINARY_SCALAR2_BOOL(sizeof(@type@), NPY_SIMD_WIDTH)) { 
        
               simd_binary_scalar2_@kind@_@sfx@(args, dimensions[0]); 
        
               return; 
        
           } 
        
           else if (IS_BLOCKABLE_BINARY_BOOL(sizeof(@type@), NPY_SIMD_WIDTH)) { 
        
               simd_binary_@kind@_@sfx@(args, dimensions[0]); 
        
               return; 
        
           }

+    // multiply by sizeof(@type@) due to SIMD unroll
     /* argument one scalar */
-    if (IS_BLOCKABLE_BINARY_SCALAR1_BOOL(sizeof(@type@), NPY_SIMD_WIDTH)) {
+    if (IS_BLOCKABLE_BINARY_SCALAR1(sizeof(@type@), NPY_SIMD_WIDTH*sizeof(@type@))) {
         simd_binary_scalar1_@kind@_@sfx@(args, dimensions[0]);
         return;
     }
     /* argument two scalar */
-    else if (IS_BLOCKABLE_BINARY_SCALAR2_BOOL(sizeof(@type@), NPY_SIMD_WIDTH)) {
+    else if (IS_BLOCKABLE_BINARY_SCALAR2(sizeof(@type@), NPY_SIMD_WIDTH*sizeof(@type@))) {
         simd_binary_scalar2_@kind@_@sfx@(args, dimensions[0]);
         return;
     }
-    else if (IS_BLOCKABLE_BINARY_BOOL(sizeof(@type@), NPY_SIMD_WIDTH)) {
+    else if (IS_BLOCKABLE_BINARY(sizeof(@type@), NPY_SIMD_WIDTH*sizeof(@type@))) {
         simd_binary_@kind@_@sfx@(args, dimensions[0]);
         return;
     }

knutdrand added the 00 - Bug label Dec 20, 2022

seberg added component: SIMD Issues in SIMD (fast instruction sets) code or machinery 06 - Regression labels Dec 20, 2022

seberg added this to the 1.24.1 release milestone Dec 20, 2022

seiko2plus self-assigned this Dec 20, 2022

seiko2plus mentioned this issue Dec 21, 2022

BUG, SIMD: Fix memory overlap in ufunc comparison loops #22851

Merged

seberg closed this as completed in #22851 Dec 22, 2022

charris mentioned this issue Dec 22, 2022

BUG, SIMD: Fix memory overlap in ufunc comparison loops #22867

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

np.logical_xor.accumulate fails on 1.24 on mac #22841

np.logical_xor.accumulate fails on 1.24 on mac #22841

knutdrand commented Dec 20, 2022

seberg commented Dec 20, 2022

Developer-Ecosystem-Engineering commented Dec 20, 2022

seiko2plus commented Dec 21, 2022

np.logical_xor.accumulate fails on 1.24 on mac #22841

np.logical_xor.accumulate fails on 1.24 on mac #22841

Comments

knutdrand commented Dec 20, 2022

Describe the issue:

Reproduce the code example:

Error message:

NumPy/Python version information:

Context for the issue:

seberg commented Dec 20, 2022

Developer-Ecosystem-Engineering commented Dec 20, 2022

seiko2plus commented Dec 21, 2022