MAINT: cleanup of fast_loop_macros.h #13208

qwhelan · 2019-03-29T03:45:27Z

This is a followup to #12988 and incorporates comments suggested there:

OUTPUT_LOOP_FAST now takes the val to output rather than *out = val
- The two call sites in loops.c.src have been updated
Whitespace between macros inserted
tout * out has been changed to tout *out in all cases
All macros have had their indentation fixed

eric-wieser · 2019-03-29T05:41:09Z

numpy/core/src/umath/fast_loop_macros.h

 * combine with NPY_GCC_OPT_3 to allow autovectorization
 * should only be used where its worthwhile to avoid code bloat
 */
-#define BASE_OUTPUT_LOOP(tout, op) \
+#define BASE_OUTPUT_LOOP(tout, val) \


While we're touching this - what are your thoughts on renaming it to UNARY_CONSTANT_LOOP? This is describing (arg) -> const, I'd expect OUTPUT_LOOP to describe () -> result, possibly used in something like random if that ever becomes a ufunc.

Works for me - are you thinking s/OUTPUT_/UNARY_CONSTANT_/ for everything in these two files?

eric-wieser · 2019-03-29T05:42:38Z

numpy/core/src/umath/loops.c.src

@@ -652,7 +652,7 @@ BOOL__ones_like(char **args, npy_intp *dimensions, npy_intp *steps, void *NPY_UN
 NPY_NO_EXPORT void
 BOOL_@kind@(char **args, npy_intp *dimensions, npy_intp *steps, void *NPY_UNUSED(func))
 {
-    OUTPUT_LOOP_FAST(npy_bool, *out = @val@);


I'm curious - is it any slower is we just use UNARY_LOOP_FAST(npy_bool, npy_bool, *out=@val@) here? Is the compiler able to remove the unused iteration over the input?

If it is, I'd be inclined to maintain fewer macros.

I dug up the benchmark I wrote and quickly tried out your suggestion. I'll need to check my code in the morning but it looks like it causes a substantial regression:

$ asv compare upstream/master HEAD --only-changed before after ratio [db5fcc8e] [0c76cb6e] <maximum_speedup~1> <bool_perf> + 177±3μs 234±2μs 1.32 bench_ufunc.IsNan.time_isnan('float16') + 2.99±0.06μs 32.8±0.2μs 10.96 bench_ufunc.IsNan.time_isnan('int16') + 3.09±0.5μs 33.1±1μs 10.72 bench_ufunc.IsNan.time_isnan('int32') + 3.07±0.07μs 33.2±0.3μs 10.80 bench_ufunc.IsNan.time_isnan('int64')

Good news - there was a typo in what I tested last night and there's no performance impact if we just use UNARY_LOOP_FAST instead. I've pushed a commit that uses that and removes the OUTPUT_LOOP_FAST macros.

charris · 2019-03-30T00:21:53Z

numpy/core/src/umath/fast_loop_macros.h

 /* PR80198 again, scalar works without the pragma */
 #define BASE_BINARY_LOOP_S_INP(tin, tout, cin, cinp, vin, vinp, op) \
    const tin cin = *(tin *)cinp; \
    BINARY_LOOP { \
        const tin vin = *(tin *)vinp; \
-        tout * out = (tout *)vinp; \
+        tout *out = (tout *)vinp; \
        op; \


I was suggesting replacing op; by *out = val so that the call passes @val@ instead of the assignment statement.

Right, but that was in output_loop which is now gone completely

eric-wieser

Looks great, although I might need to use a local difftool to check that only indents changed

qwhelan · 2019-03-30T02:05:16Z

@eric-wieser One thing to note is that UNARY_LOOP_FAST declares in, which is unused in the output mode and thus generates unused variable warnings from gcc. This is the cause of the Travis failure and I'm not sure on the preferred way to suppress or resolve that warning.

eric-wieser · 2019-03-30T02:15:36Z

numpy/core/src/umath/loops.c.src

@@ -896,7 +896,7 @@ NPY_NO_EXPORT void
 NPY_NO_EXPORT void
 @TYPE@_@kind@(char **args, npy_intp *dimensions, npy_intp *steps, void *NPY_UNUSED(func))
 {
-    OUTPUT_LOOP_FAST(npy_bool, *out = @val@);
+    UNARY_LOOP_FAST(@type@, npy_bool, *out = @val@);


Does changing this to (void)in; *out = @val@ do the trick?, without costing performance?

Looks promising - no warnings locally and no performance regression. Just rebased and pushed, so will see what Travis has to say

Might be worth adding a comment explaining the (void) - that the macro provides an in variable, and we need to tell the compiler we are deliberately ignoring it

mattip · 2019-03-31T07:40:52Z

Thanks @qwhelan

qwhelan mentioned this pull request Mar 29, 2019

ENH: Create boolean and integer ufuncs for isnan, isinf, and isfinite. #12988

Merged

qwhelan force-pushed the charris_followup branch from 3b5b208 to ff646bf Compare March 29, 2019 03:49

eric-wieser reviewed Mar 29, 2019

View reviewed changes

charris reviewed Mar 30, 2019

View reviewed changes

eric-wieser requested changes Mar 30, 2019

View reviewed changes

eric-wieser reviewed Mar 30, 2019

View reviewed changes

MAINT: cleanup of fast_loop_macros.h

bea6946

qwhelan force-pushed the charris_followup branch from dd3050a to 3107b1f Compare March 30, 2019 02:25

MAINT: remove OUTPUT_LOOP_FAST macro and use UNARY_LOOP_FAST instead

838abd7

qwhelan force-pushed the charris_followup branch from 3107b1f to 838abd7 Compare March 30, 2019 02:44

eric-wieser approved these changes Mar 30, 2019

View reviewed changes

mattip merged commit a8eca5c into numpy:master Mar 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT: cleanup of fast_loop_macros.h #13208

MAINT: cleanup of fast_loop_macros.h #13208

Uh oh!

qwhelan commented Mar 29, 2019 •

edited

Loading

Uh oh!

eric-wieser Mar 29, 2019

Uh oh!

qwhelan Mar 29, 2019

Uh oh!

eric-wieser Mar 29, 2019

Uh oh!

qwhelan Mar 29, 2019

Uh oh!

qwhelan Mar 29, 2019

Uh oh!

charris Mar 30, 2019

Uh oh!

eric-wieser Mar 30, 2019

Uh oh!

eric-wieser left a comment

Uh oh!

qwhelan commented Mar 30, 2019

Uh oh!

eric-wieser Mar 30, 2019 •

edited

Loading

Uh oh!

qwhelan Mar 30, 2019

Uh oh!

eric-wieser Mar 30, 2019

Uh oh!

qwhelan Mar 30, 2019

Uh oh!

mattip commented Mar 31, 2019

Uh oh!

Uh oh!

Uh oh!

MAINT: cleanup of fast_loop_macros.h #13208

MAINT: cleanup of fast_loop_macros.h #13208

Uh oh!

Conversation

qwhelan commented Mar 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eric-wieser left a comment

Choose a reason for hiding this comment

Uh oh!

qwhelan commented Mar 30, 2019

Uh oh!

eric-wieser Mar 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattip commented Mar 31, 2019

Uh oh!

Uh oh!

qwhelan commented Mar 29, 2019 •

edited

Loading

eric-wieser Mar 30, 2019 •

edited

Loading