Reimplement NonUniformImage, PcolorImage in Python, not C. #14913

anntzer · 2019-07-29T15:20:13Z

It's much shorter...

None of this has test coverage though :( -- probably
needed for the PR; but one can first check that
examples/images_contours_and_fields/image_nonuniform.py still works.

Edit: closes #15039.

PR Summary

PR Checklist

Has Pytest style unit tests
Code is Flake 8 compliant
New features are documented, with examples if plot related
Documentation is sphinx and numpydoc compliant
Added an entry to doc/users/next_whats_new/ if major new feature (follow instructions in README.rst there)
Documented in doc/api/api_changes.rst if API changed in a backward-incompatible way

jklymak · 2019-07-29T15:40:49Z

What about performance?

anntzer · 2019-07-29T15:54:33Z

Performance-wise (I haven't actually done any benchmarks):

the searchsorteds (x_int = x_mid.searchsorted(x_pix) etc.) should be a bit slower than in C because we don't take advantage of the fact that x_pix is also sorted, so the complexity is O(log(number-of-x-values) * number-of-rows) rather than O(number-of-x-values + number-of-rows) (however Big speed up in searchsorted if second input is also sorted numpy/numpy#10937 claims that due to low-level CPU stuff it's actually still quite fast).
bilinear interpolation (which @efiring argued should be deprecated in NonUniformImage class needs Axes method and pyplot function #7763 (comment)) will create a few more temporary floating point buffers, so peak memory consumption will be a bit worse.

OTOH I don't think a functionality that's so obscure that it doesn't even have tests yet :) warrants 400 lines of C to squeeze out every drop of performance from it.

jklymak · 2019-07-29T16:05:24Z

... does it justify rewriting if it isn't broken? Or is it broken?

WeatherGod · 2019-07-29T16:16:03Z

So, IIRC, NonUniformImage came about a few years ago when we realized that a limitation to UniformImage was apparently arbitrary (or maybe it was that NonUniformImage predates that work and we can now achieved using transforms and UniformImage?). Maybe the tests aren't obviously linked to NonUniformImage?

…

On Mon, Jul 29, 2019 at 12:05 PM Jody Klymak ***@***.***> wrote: ... does it justify rewriting if it isn't broken? Or is it broken? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#14913?email_source=notifications&email_token=AACHF6DQYID5YFQYLNSMGLTQB4IMNA5CNFSM4IHTQ5FKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD3BF6TA#issuecomment-516054860>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AACHF6DM3NG5IAXYT6M7I3TQB4IMNANCNFSM4IHTQ5FA> .

anntzer · 2019-07-29T16:32:58Z

... does it justify rewriting if it isn't broken? Or is it broken?

Fair point.

So, IIRC, NonUniformImage came about a few years ago when we realized that
a limitation to UniformImage was apparently arbitrary (or maybe it was that
NonUniformImage predates that work and we can now achieved using transforms
and UniformImage?).

You can't directly rewrite NonUniformImage in terms of AxesImage (well, you'd need to generate arbitrary transforms on-the-fly mapping the arbitrary x-values to a uniform grid -- not impossible, but probably more work than it's worth).

Maybe the tests aren't obviously linked to NonUniformImage?

What do you mean? You can grep for NonUniformImage throughout the codebase and the only things that show up is the implementation, the example linked above, and some smoke tests checking that one can set the cmap, set the norm, or update the value of a NonUniformImage.

anntzer · 2020-05-20T22:27:14Z

Actually looks like this fixes #15039.

efiring · 2020-05-21T00:23:03Z

This is worth a close look as a way to streamline the codebase.
I think NonUniformImage goes way back in time, and was then sort of abandoned; it never got wrapped and publicized, so it is probably rarely used in the wild.
When I wanted to make pcolor-type plots as fast as possible for the basic cases encountered in oceanography, one of which is a rectangular grid with unequal spacing in either or both of the dimensions, I made a simple modification of NonUniformImage to yield PcolorImage, which is called by the infamous Axes.pcolorfast.

I think the use case for NonUniformImage is actually handled now by pcolormesh, but probably slower and possibly with differences output by backends.

Related: #13442, #7763

jklymak · 2020-09-21T16:52:34Z

I guess this seems fine, if its fast enough, but definitely needs tests

anntzer · 2020-10-10T16:44:22Z

I finally spent some time on profiling this. The benchmark script I used is

from timeit import Timer
from matplotlib import pyplot as plt
from matplotlib.image import NonUniformImage, PcolorImage
import numpy as np

N = 100

fig, (ax_nn, ax_nb, ax_pc) = plt.subplots(3)

ax_nn.set(xlim=(-.5, .75), ylim=(-.5, .75))
nn = NonUniformImage(ax_nn)
nn.set_data(np.linspace(0, 1, 2 * N) ** 2, np.linspace(0, 1, N) ** 2,
            np.arange(2 * N**2).reshape((N, 2 * N)))
ax_nn.images.append(nn)

ax_nb.set(xlim=(-.5, .75), ylim=(-.5, .75))
nb = NonUniformImage(ax_nb, interpolation="bilinear")
nb.set_data(np.linspace(0, 1, 2 * N) ** 2, np.linspace(0, 1, N) ** 2,
            np.arange(2 * N**2).reshape((N, 2 * N)))
ax_nb.images.append(nb)

ax_pc.set(xlim=(-.5, .75), ylim=(-.5, .75))
pc = PcolorImage(ax_pc)
pc.set_data(np.linspace(0, 1, 2 * N + 1) ** 2, np.linspace(0, 1, N + 1) ** 2,
            np.arange(2 * N**2).reshape((N, 2 * N)))
ax_pc.images.append(pc)

fig.canvas.draw()

n, t = Timer("nn.make_image(fig._cachedRenderer)", globals=globals()).autorange()
print(f"NN: {1000*t/n:.4f}ms")
n, t = Timer("nb.make_image(fig._cachedRenderer)", globals=globals()).autorange()
print(f"NB: {1000*t/n:.4f}ms")
n, t = Timer("pc.make_image(fig._cachedRenderer)", globals=globals()).autorange()
print(f"PC: {1000*t/n:.4f}ms")
plt.show()

The original version was indeed much (many times) slower than the C version; unlike what I expected the bottleneck was not actually with searchsorted or even temporary buffers, but with general numpy overhead (indexing, iteration over non-contiguous buffers). I put in (and pushed) quite a few microoptimizations; now, on the benchmark above, NonUniformImage+nearest and PcolorImage are now ~50% (1.5x) slower than previously, and NonUniformImage+bilinear is ~2.5x slower, which I guess are in the more acceptable range given that this does fix some other issues (#15039).

dopplershift · 2020-10-12T18:49:50Z

Any chance we could get even a basic test here? Part of me thinks it's absurd that we have an entire feature that has 0 lines exercised by a test.

anntzer · 2020-10-12T19:31:39Z

It's not too hard to add a test (below), but unsurprisingly it does reveal that the new implementation is not pixel-identical to the previous one. I'll investigate a bit before committing to this new version.

@image_comparison(["nonuniform_and_pcolor.png"], style="mpl20")
def test_nonuniform_and_pcolor():
    axs = plt.figure().subplots(3, sharex=True, sharey=True)
    for ax, interpolation in zip(axs, ["nearest", "bilinear"]):
        im = NonUniformImage(ax, interpolation=interpolation)
        im.set_data(np.arange(3) ** 2, np.arange(3) ** 2, np.arange(9).reshape((3, 3)))
        ax.images.append(im)
    axs[2].pcolorfast(  # PColorImage
        np.arange(4) ** 2, np.arange(4) ** 2, np.arange(9).reshape((3, 3)))
    for ax in axs:
        ax.set_axis_off()
        # NonUniformImage "leaks" out of extents, not PColorImage.
        ax.set(xlim=(0, 20))

anntzer · 2020-10-13T12:45:43Z

I convinced myself that most of the off-by-1px just comes from searchsorted(..., "left") (the new behavior, which just comes from numpy's default) vs ..., "right" (effectively the old behavior). Given that the choice is arbitrary anyways, I'll stick with numpy's default, which also avoids having to add ..., "right" in other places such as PcolorImage.get_cursor_data() (which was previously off by 1px).

timhoffm · 2020-10-13T20:53:43Z

lib/matplotlib/image.py

+                np.ascontiguousarray(A).view(np.uint32).ravel()[
+                    np.add.outer(y_int * A.shape[1], x_int)]
+                .view(np.uint8).reshape((height, width, 4)))
+        else:  # self._interpolation == "bilinear"


Suggested change

else: # self._interpolation == "bilinear"

elif self._interpolation == "bilinear":

And add else: NotImplementedError(...). Even though this is checked in another place of the code, that check is quite far away, and could get out of sync with the implementation by accident. I feel a little safer with the explicit check.

that's not really how switch... case is written elsewhere in the codebase (e.g. in _axes.py you have quite a few else: # orientation == "horizontal" or variants thereof). I don't really mind either way, but let's be consistent.

timhoffm · 2020-10-13T20:53:52Z

lib/matplotlib/image.py

+                    np.add.outer(y_int * A.shape[1], x_int)]
+                .view(np.uint8).reshape((height, width, 4)))
+        else:  # self._interpolation == "bilinear"
+            # Use np.interp to compute x_int/x_float has similar speed.


You say that np.interp is approximately equally fast, but still decide to implement the interpolation yourself? Why?

If we want our own interpolation, I'd still favor a dedicated private function. That would make it more clear and simpler to test and profile.

Because in this specific case you also need to tweak a bit interp, i.e. the actual implementation is something like (from memory)

range_ax = np.arange(len(self._Ax)) # Don't index beyond the end. range_ax[-1] = np.nextafter(len(self._Ax) - 1, 0) x = np.interp(x_pix, self._Ax, range_ax) x_int = x.astype(int) x_frac = np.modulo(x, np.float32)

which I don't think is more readable (it's not really worse either).

I'm not convinced factoring this out into e.g. _interpolate(A, self._Ax, self._Ay, x_pix, y_pix) would help legibility either.

Also, re profiling, the real bottleneck is not actually here, it's in the actual interpolation code below.

jklymak · 2021-05-08T17:46:43Z

@anntzer is this ready for review or are you still mulling it over?

anntzer · 2021-05-08T20:08:49Z

This should be good to go (from what I remember). There's a significant slowdown (~1.5x with nearest, ~2.5x with bilinear), but that has to be weighted against fixing bugs on a slightly obscure part of the library and a very large shortening of the implementation.

jklymak · 2021-05-08T20:33:50Z

Seems fine? Is it exactly the same as the old implementation? If not, you need an API note?

anntzer · 2021-05-08T20:56:14Z

There's some single-pixel shifts (when a boundary falls exactly between two pixels, see discussion about searchsorted above), but nothing was tested before...

jklymak · 2021-05-08T21:19:26Z

Right but please add an API note in case someone else was testing they know it was a purposeful change.

anntzer · 2021-05-09T14:11:34Z

changelog added

It's much shorter... Perf check: ```python from timeit import Timer from matplotlib import pyplot as plt from matplotlib.image import NonUniformImage, PcolorImage import numpy as np N = 100 fig, (ax_nn, ax_nb, ax_pc) = plt.subplots(3) ax_nn.set(xlim=(-.5, .75), ylim=(-.5, .75)) nn = NonUniformImage(ax_nn) nn.set_data(np.linspace(0, 1, 2 * N) ** 2, np.linspace(0, 1, N) ** 2, np.arange(2 * N**2).reshape((N, 2 * N))) ax_nn.images.append(nn) ax_nb.set(xlim=(-.5, .75), ylim=(-.5, .75)) nb = NonUniformImage(ax_nb, interpolation="bilinear") nb.set_data(np.linspace(0, 1, 2 * N) ** 2, np.linspace(0, 1, N) ** 2, np.arange(2 * N**2).reshape((N, 2 * N))) ax_nb.images.append(nb) ax_pc.set(xlim=(-.5, .75), ylim=(-.5, .75)) pc = PcolorImage(ax_pc) pc.set_data(np.linspace(0, 1, 2 * N + 1) ** 2, np.linspace(0, 1, N + 1) ** 2, np.arange(2 * N**2).reshape((N, 2 * N))) ax_pc.images.append(pc) fig.canvas.draw() n, t = Timer("nn.make_image(fig._cachedRenderer)", globals=globals()).autorange() print(f"NN: {1000*t/n:.4f}ms") n, t = Timer("nb.make_image(fig._cachedRenderer)", globals=globals()).autorange() print(f"NB: {1000*t/n:.4f}ms") n, t = Timer("pc.make_image(fig._cachedRenderer)", globals=globals()).autorange() print(f"PC: {1000*t/n:.4f}ms") plt.show() ```

jklymak · 2021-05-22T15:21:53Z

@timhoffm if you had time I pinged you for another review. You had some fundamental objections a while ago.

FWIW I think moving this to python is good for our long-term health. If we can do things in python and it doesn't hurt performance noticeably that seems better than down in C code. However, if that is not a consensus, and we feel this function is fine as-is we should just close this.

efiring · 2021-05-24T03:25:14Z

I did some testing of the PcolorImage case, including a comparison to pcolormesh. This is on my year-old macbook pro with an I7. I find very little difference in speed with this change; the best times are unchanged, but there might be a little more variability from run to run with this PR, so some runs are marginally slower. Examples, first with 3.4.1:

(py37) ~/work/programs/py/mpl/tests $ python pcolorfast_timer.py
20x10        0.034s   AxesImage(80,52.8;496x369.6)
20x10        0.037s   <matplotlib.collections.QuadMesh object at 0x7f9c50319d90>
200x100      0.040s   AxesImage(80,52.8;496x369.6)
200x100      0.047s   <matplotlib.collections.QuadMesh object at 0x7f9c50711990>
2000x1000    0.123s   AxesImage(80,52.8;496x369.6)
2000x1000    0.682s   <matplotlib.collections.QuadMesh object at 0x7f9c57af5590>
(py37) ~/work/programs/py/mpl/tests $ python pcolorfast_timer.py
20x10        0.034s   AxesImage(80,52.8;496x369.6)
20x10        0.036s   <matplotlib.collections.QuadMesh object at 0x7fb028490090>
200x100      0.038s   AxesImage(80,52.8;496x369.6)
200x100      0.046s   <matplotlib.collections.QuadMesh object at 0x7fb029607950>
2000x1000    0.127s   AxesImage(80,52.8;496x369.6)
2000x1000    0.678s   <matplotlib.collections.QuadMesh object at 0x7fb0309dd550>

Now with this PR, with only one change: I added the missing __str__ method.

(mpl1) ~/work/programs/py/mpl/tests $ python pcolorfast_timer.py
20x10        0.041s   PcolorImage(80,52.8;496x369.6)
20x10        0.040s   <matplotlib.collections.QuadMesh object at 0x7fd6646a15d0>
200x100      0.045s   PcolorImage(80,52.8;496x369.6)
200x100      0.052s   <matplotlib.collections.QuadMesh object at 0x7fd665577f50>
2000x1000    0.130s   PcolorImage(80,52.8;496x369.6)
2000x1000    0.764s   <matplotlib.collections.QuadMesh object at 0x7fd669f87d90>
(mpl1) ~/work/programs/py/mpl/tests $ python pcolorfast_timer.py
20x10        0.039s   PcolorImage(80,52.8;496x369.6)
20x10        0.041s   <matplotlib.collections.QuadMesh object at 0x7fb29d004890>
200x100      0.045s   PcolorImage(80,52.8;496x369.6)
200x100      0.053s   <matplotlib.collections.QuadMesh object at 0x7fb29d467350>
2000x1000    0.134s   PcolorImage(80,52.8;496x369.6)
2000x1000    0.736s   <matplotlib.collections.QuadMesh object at 0x7fb2a4848a10>

For the 2000x1000 case I got times ranging from 0.127 to 0.157. It's possible that running more times would turn up a similar range without this PR. In any case, for this test, I find negligible slow-down with this PR. The test code is:

import numpy as np
import matplotlib
matplotlib.use("agg")
import matplotlib.pyplot as plt

import time

# warmup
fig, ax = plt.subplots()
ax.pcolorfast(np.arange(5) ** 2, np.arange(5) ** 2, np.random.randn(4, 4))
fig.savefig("junk.png")
plt.close()

for mult in (1, 10, 100):
    nx, ny = 20 * mult, 10 * mult
    nxny = f"{nx}x{ny}"
    x = (5 + np.arange(nx)) ** 1.5
    y = (3 + np.arange(ny)) ** 1.5
    X, Y = np.meshgrid(x, y)
    z = (X + Y)[1:, 1:]

    fig, ax = plt.subplots()

    tic = time.time()
    pc = ax.pcolorfast(x, y, z)
    fig.savefig("pcolorfast_timer0.png")
    print(f"{nxny:12s} {time.time() - tic:5.3f}s   {pc}")

    plt.close()
    fig, ax = plt.subplots()

    tic = time.time()
    pc = ax.pcolormesh(x, y, z)
    fig.savefig("pcolorfast_timer1.png")
    print(f"{nxny:12s} {time.time() - tic:5.3f}s   {pc}")

# plt.show()

For this timing test, that warmup at the start is critical; otherwise the first plot in the series takes much longer.

efiring

Looks good to me. While you are there, you might want to go ahead and add the missing __str__ methods to both NonuniformImage and PcolorImage, e.g., for the latter:

    def __str__(self):
        return "PcolorImage(%g,%g;%gx%g)" % tuple(self.axes.bbox.bounds)

anntzer · 2021-05-24T10:27:56Z

Thanks for the perf checks.

AFAICT the __str__ you suggest (which is also the one of AxesImage) is actually incorrect, as it assumes that the image matches the axes' extents, e.g.

im1 = imshow([[1, 2]]); im2 = imshow(np.arange(9).reshape((3, 3))); print(im1, im2)

prints that both images have a __str__ of AxesImage(80,52.8;496x369.6) even though they have different extents. I'll open a separate issue to track that... (#20294)

jklymak · 2021-05-24T15:17:25Z

I'll merge given that #20294 will track this...

anntzer added the topic: images label Jul 29, 2019

anntzer force-pushed the pcolor branch from 6b6a4e0 to c9b3cd0 Compare July 29, 2019 15:32

tacaswell added this to the v3.3.0 milestone Jul 29, 2019

QuLogic modified the milestones: v3.3.0, v3.4.0 May 2, 2020

QuLogic added the status: needs comment/discussion needs consensus on next step label May 2, 2020

anntzer mentioned this pull request May 20, 2020

NonUniformImage wrong image when using large values for axis #15039

Closed

anntzer force-pushed the pcolor branch from c9b3cd0 to 01f1a45 Compare May 20, 2020 22:57

jklymak added the status: needs tests label Sep 21, 2020

anntzer force-pushed the pcolor branch from 01f1a45 to 4c931ad Compare October 10, 2020 16:39

anntzer force-pushed the pcolor branch from 4c931ad to 6f453d1 Compare October 10, 2020 19:03

anntzer force-pushed the pcolor branch from 6f453d1 to 396c5d8 Compare October 13, 2020 12:44

anntzer force-pushed the pcolor branch from 396c5d8 to f89e5ad Compare October 13, 2020 12:46

anntzer removed the status: needs tests label Oct 13, 2020

anntzer force-pushed the pcolor branch from f89e5ad to 3da58a1 Compare October 13, 2020 13:43

timhoffm reviewed Oct 13, 2020

View reviewed changes

anntzer force-pushed the pcolor branch from 3da58a1 to 177e846 Compare October 14, 2020 12:38

anntzer force-pushed the pcolor branch from 177e846 to 2328cc1 Compare January 20, 2021 11:07

QuLogic modified the milestones: v3.4.0, v3.5.0 Jan 22, 2021

jklymak marked this pull request as draft May 8, 2021 17:46

anntzer marked this pull request as ready for review May 8, 2021 20:07

jklymak requested a review from timhoffm May 8, 2021 20:34

anntzer force-pushed the pcolor branch from 2328cc1 to 5c2fb35 Compare May 9, 2021 14:11

jklymak approved these changes May 9, 2021

View reviewed changes

anntzer force-pushed the pcolor branch from 5c2fb35 to ea2de97 Compare May 20, 2021 20:37

jklymak added PR: bugfix Pull requests that fix identified bugs status: needs review and removed status: needs comment/discussion needs consensus on next step labels May 22, 2021

efiring approved these changes May 24, 2021

View reviewed changes

jklymak merged commit f044270 into matplotlib:master May 24, 2021

efiring mentioned this pull request May 24, 2021

AxesImage.__str__ is wrong if the image does not span the full Axes. #20294

Closed

anntzer deleted the pcolor branch May 24, 2021 18:35

QuLogic removed the status: needs review label May 26, 2021

	else: # self._interpolation == "bilinear"
	elif self._interpolation == "bilinear":

Uh oh!

Reimplement NonUniformImage, PcolorImage in Python, not C. #14913

Reimplement NonUniformImage, PcolorImage in Python, not C. #14913

Uh oh!

Conversation

anntzer commented Jul 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

PR Checklist

Uh oh!

jklymak commented Jul 29, 2019

Uh oh!

anntzer commented Jul 29, 2019

Uh oh!

jklymak commented Jul 29, 2019

Uh oh!

WeatherGod commented Jul 29, 2019 via email

Uh oh!

anntzer commented Jul 29, 2019

Uh oh!

anntzer commented May 20, 2020

Uh oh!

efiring commented May 21, 2020

Uh oh!

jklymak commented Sep 21, 2020

Uh oh!

anntzer commented Oct 10, 2020

Uh oh!

dopplershift commented Oct 12, 2020

Uh oh!

anntzer commented Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anntzer commented Oct 13, 2020

Uh oh!

timhoffm Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

anntzer Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

timhoffm Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

anntzer Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

jklymak commented May 8, 2021

Uh oh!

anntzer commented May 8, 2021

Uh oh!

jklymak commented May 8, 2021

Uh oh!

anntzer commented May 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jklymak commented May 8, 2021

Uh oh!

anntzer commented May 9, 2021

Uh oh!

jklymak commented May 22, 2021

Uh oh!

efiring commented May 24, 2021

Uh oh!

efiring left a comment

Choose a reason for hiding this comment

Uh oh!

anntzer commented May 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jklymak commented May 24, 2021

Uh oh!

Uh oh!

anntzer commented Jul 29, 2019 •

edited

Loading

anntzer commented Oct 12, 2020 •

edited

Loading

anntzer commented May 8, 2021 •

edited

Loading

anntzer commented May 24, 2021 •

edited

Loading