ENH/WIP: add a max_abs ufunc #5509

ewmoore · 2015-01-27T14:18:04Z

This is a follow on to our discussion in #5420, about adding sum_abs and max_abs ufuncs. Testing here show this version to be almost as fast as a raw maximum.reduce for float64 and float32. In both the SIMD loop and the BINARY_REDUCE_LOOP paths.

Still needs tests.

argriffing · 2015-01-27T16:04:03Z

Thanks for writing this! I guess the new PyUFunc_MaxAbsTypeResolver could also be used for sum_abs type resolution later?

I see that the USE_CHROOT=1 ARCH=i386 DIST=trusty PYTHON=3.4 Travis config and the USE_BENTO=1 config fail the double and single precision vector norm tests, but I couldn't track down the source of the problem.

juliantaylor · 2015-01-27T19:35:40Z

numpy/core/src/umath/funcs.inc.src

+    int cmp;
+
+    absi1 = PyNumber_Absolute(i1);
+    absi2 = PyNumber_Absolute(i2);


these can probably raise exceptions and return NULL which needs handling (and a test)

juliantaylor · 2015-01-27T19:56:42Z

shouldn't it be named abs_max, the absolute is applied first, similar for other functions of this type like square sum

argriffing · 2015-01-27T20:22:55Z

shouldn't it be named abs_max

I'd thought that maxabs is a thing, kind of like how logsumexp is a thing. One example is https://software.intel.com/en-us/node/502168. But I may be wrong.

juliantaylor · 2015-01-27T20:46:57Z

hm maximum of absolute, sum of squares, I guess it makes sense. If other libraries use this ordering its probably best to not deviate

juliantaylor · 2015-01-27T20:52:19Z

instead of adding lots of new functions it would be kind of nice to have a sort of preproces argument for ufuncs that is another ufunc. Then you could e.g. do:
np.max(a, preprocess=np.abs) or possibly even recursive chains like np.max(a, preprocess=partial(np.abs, preprocess=partial(np.power, b=3)))

kind of explicit unreadable lazy evaluation, on second though probably not so such a great idea

argriffing · 2015-01-27T20:57:02Z

instead of adding lots of new functions it would be kind of nice to have a sort of preproces argument for ufuncs that is another ufunc.

This reminds me of the discussion near #5218 (comment).

njsmith · 2015-01-27T21:00:28Z

I think you're about 20% of the way to reinventing numexpr?

Which is not a bad thing, a version of numexpr that worked with the regular
ufunc machinery instead of reimplementing it all would be awesome.

On Tue, Jan 27, 2015 at 8:52 PM, Julian Taylor [email protected]
wrote:

instead of adding lots of new functions it would be kind of nice to have a
sort of preproces argument for ufuncs that is another ufunc. Then you could
e.g. do:
np.max(a, preprocess=np.abs) or possibly even recursive chains like np.max(a,
preprocess=partial(np.abs, preprocess=np.partial(np.power, 3)))

kind of explicit unreadable lazy evaluation, on second though probably not
so such a great idea

—
Reply to this email directly or view it on GitHub
#5509 (comment).

Nathaniel J. Smith
Postdoctoral researcher - Informatics - University of Edinburgh
http://vorpus.org

njsmith · 2015-01-27T21:07:48Z

Basically we'd need two things:

a way to run an arbitrary ufunc expression graph over cache-sized chunks
(not terribly hard in principle, I think -- define a ufunc loop that
recurses over some data structure calling other ufunc loops?)
some user-friendly way to write down these expression graphs. This is
just a UI problem though and should be decoupled from the above -- instead
of awkward kwargs like prefilter= we there are lots of nice options
available e.g.

a = delay(a) # returns a "delay" object with trivial graph
ma = np.maximum.reduce(np.abs(a)) # returns a "delay" object with
nontrivial graph
ma.force() # evaluates the graph and returns an ndarray object

numpy_ufunc gets us part of the way there...

Of course one could get a minimum-viable-product by starting with
supporting relatively restricted expression graphs like pure unary nesting
and then improve from there.

On Tue, Jan 27, 2015 at 9:00 PM, Nathaniel Smith [email protected] wrote:

I think you're about 20% of the way to reinventing numexpr?

Which is not a bad thing, a version of numexpr that worked with the
regular ufunc machinery instead of reimplementing it all would be awesome.

On Tue, Jan 27, 2015 at 8:52 PM, Julian Taylor [email protected]
wrote:

instead of adding lots of new functions it would be kind of nice to have
a sort of preproces argument for ufuncs that is another ufunc. Then you
could e.g. do:
np.max(a, preprocess=np.abs) or possibly even recursive chains like np.max(a,
preprocess=partial(np.abs, preprocess=np.partial(np.power, 3)))

kind of explicit unreadable lazy evaluation, on second though probably
not so such a great idea

—
Reply to this email directly or view it on GitHub
#5509 (comment).

Nathaniel J. Smith
Postdoctoral researcher - Informatics - University of Edinburgh
http://vorpus.org

Nathaniel J. Smith
Postdoctoral researcher - Informatics - University of Edinburgh
http://vorpus.org

This function computes maximum(absolute(x), absolute(y)), which when used as a reduction is the inf-norm of a vector. Providing a single call ufunc for this operation is 2x-3x faster than the composite operation. For large array sizes, this also eliminates a full sized temporary. On x86_64 this operation is similar in speed to maximum for contiguous reductions.

ewmoore · 2015-02-25T14:13:00Z

I've removed the code that depends on the float representation from loops.c.src. However, This is no longer faster than just calling abs(arr).max() when the BINARY_LOOP path is taken. It looks to me as though gcc must generate about exactly the code I had, where I was masking out the sign bit instead of performing a comparison to find the absolute value if I use fabs directly. Calling npy_fabs obviously doesn't cut it since gcc doesn't know anything about it.

This version is about a factor of 2 slower for the non-contiguous case. I'm not sure what kind, if any, code changes can regain that speed in a fully portable way. The script I've been using to benchmark things is here: https://gist.github.com/ewmoore/5f34c148b00ed461057d

Although I think allowing arbitrary combinations of ufuncs would be nice and might make a good gsoc project I'm not going to have time to do that. So if we'd rather build that instead, lets close this. Its also possible to only expose this function (and a hypothetical sum_abs) via linalg.norm, perhaps that represents a sort of middle road. That being said, it seems to me that we don't want to make this slower, which this actually does right now.

charris · 2015-03-22T16:05:23Z

Rejigger tests by closing and opening.

argriffing · 2015-05-20T22:29:55Z

Although I think allowing arbitrary combinations of ufuncs would be nice and might make a good gsoc project I'm not going to have time to do that.

I think it's reasonable to not do everything in the maximum amount of generality possible. For example there's a short list of functions in https://software.intel.com/en-us/node/502164. Personally I am most interested in inf-norm and 1-norm which could be implemented as max_abs and sum_abs ufuncs.

So if we'd rather build that instead, lets close this.

In my opinion there could be value in not waiting for arbitrary combinations of ufuncs.

Its also possible to only expose this function (and a hypothetical sum_abs) via linalg.norm, perhaps that represents a sort of middle road.

This seems like a good idea to me.

charris · 2016-02-01T16:25:42Z

@ewmoore How come?

ewmoore · 2016-02-01T16:57:36Z

Mostly because I don't intend to work on this further right now. I think
its a fine idea though.

On Mon, Feb 1, 2016 at 11:25 AM, Charles Harris [email protected]
wrote:

@ewmoore https://github.com/ewmoore How come?

—
Reply to this email directly or view it on GitHub
#5509 (comment).

ewmoore · 2016-02-01T16:59:19Z

I'm very sympathetic to the idea that having all of these
dead/stalled/abandoned PRs around isn't great for the project. Scipy has
the same problem.

On Mon, Feb 1, 2016 at 11:57 AM, Eric Moore [email protected] wrote:

Mostly because I don't intend to work on this further right now. I think
its a fine idea though.

On Mon, Feb 1, 2016 at 11:25 AM, Charles Harris [email protected]
wrote:

@ewmoore https://github.com/ewmoore How come?

—
Reply to this email directly or view it on GitHub
#5509 (comment).

juliantaylor reviewed Jan 27, 2015
View reviewed changes

ewmoore force-pushed the max_abs branch from 9300433 to f10300c Compare February 25, 2015 14:01

rgommers added 01 - Enhancement component: numpy._core labels Mar 8, 2015

charris closed this Mar 22, 2015

charris reopened this Mar 22, 2015

ewmoore closed this Feb 1, 2016

charris added the 52 - Inactive Pending author response label Feb 1, 2016

jakirkham mentioned this pull request Aug 17, 2018

ENH: Create the ability for fused operations (fused ufuncs or map_reduce) style #11622

Open

Uh oh!

ENH/WIP: add a max_abs ufunc #5509

ENH/WIP: add a max_abs ufunc #5509

Uh oh!

Conversation

ewmoore commented Jan 27, 2015

Uh oh!

argriffing commented Jan 27, 2015

Uh oh!

juliantaylor Jan 27, 2015

Choose a reason for hiding this comment

Uh oh!

juliantaylor commented Jan 27, 2015

Uh oh!

argriffing commented Jan 27, 2015

Uh oh!

juliantaylor commented Jan 27, 2015

Uh oh!

juliantaylor commented Jan 27, 2015

Uh oh!

argriffing commented Jan 27, 2015

Uh oh!

njsmith commented Jan 27, 2015

Uh oh!

njsmith commented Jan 27, 2015

Uh oh!

ewmoore commented Feb 25, 2015

Uh oh!

charris commented Mar 22, 2015

Uh oh!

argriffing commented May 20, 2015

Uh oh!

charris commented Feb 1, 2016

Uh oh!

ewmoore commented Feb 1, 2016

Uh oh!

ewmoore commented Feb 1, 2016

Uh oh!

Uh oh!