Minor cleanup and optimization of Sketch #24964

oscargus · 2023-01-12T23:37:04Z

PR Summary

Getting rid of a few redundant/not required computations. Have not checked the assembly code to confirm that the compiler doesn't already optimize it away though.

PR Checklist

Documentation and Tests

Has pytest style unit tests (and pytest passes)
Documentation is sphinx and numpydoc compliant (the docs should build without error).
New plotting related features are documented with examples.

Release Notes

New features are marked with a .. versionadded:: directive in the docstring and documented in doc/users/next_whats_new/
API changes are marked with a .. versionchanged:: directive in the docstring and documented in doc/api/next_api_changes/
Release notes conform with instructions in next_whats_new/README.rst or next_api_changes/README.rst

oscargus · 2023-01-13T08:53:17Z

src/path_converters.h

                len = sqrt(len);
-                *x += r * num / len;
-                *y += r * -den / len;
+                double r = sin(m_p * (2.0 * d_M_PI) / m_length) * m_scale;


Two minor changes since the review:

Brackets to enable constant folding here.

Change += x * -y to -= x * y below.

At least both these give simplifications in the LLVM IR (maybe the backend will still provide the same code in the end though, but probably not for the first one due to FP aspects).

oscargus · 2023-01-19T07:25:10Z

I did a major rewrite after reading up on pow works. This is now replaced by exp, which makes it much faster on x86 and shouldn't make much change on other platforms (exp shouldn't be worse than pow).

Added some comments about how it is refactored.

For my test example, similar to #24908 (comment), the time doing pow was 33% of the total time, now the time doing exp is 16% of the total time. (Hard to compare actual times as the workloads were slightly different on the machine...)

oscargus · 2023-01-19T07:27:23Z

One may want to move the random number generation and exp into the if-clause. However, that may change the outcome, although I guess that the if-clause is primarily for avoiding division by zero in rare cases rather than something that is heavily used, so there may be limited performance benefits and limited changes in the outcome.

Edit: I guess what I am trying to say is that if there is a performance benefit of moving it there, it will also modify the output to a larger extent as what we are doing is skipping updating the phase and random sequence when len is 0. So probably not a good idea.

jklymak · 2023-02-02T17:01:25Z

Is this code path tested? Do you have evidence that this is actually a speed up?

tacaswell · 2023-02-02T17:59:53Z

We will hit this in : https://github.com/matplotlib/matplotlib/blob/main/lib/matplotlib/tests/baseline_images/test_path/xkcd.png

I'm happy to take Oscar at his word that this is faster.

oscargus · 2023-02-03T16:30:05Z

Yes. it is faster (and tested). At least on x86, but shouldn't be slower on other architectures (as in about as fast, worst case). The background is that pow(a, b) is computed as exp(b*log(a)) internally on x86 (as there are exp and log instructions). Since a is constant in this case, one can compute log(a) once.

(Been focusing quite heavily on a work project recently...)

QuLogic approved these changes Jan 13, 2023

View reviewed changes

oscargus force-pushed the pathspeedup branch from b1fdf6a to d05d859 Compare January 13, 2023 08:50

oscargus commented Jan 13, 2023

View reviewed changes

oscargus force-pushed the pathspeedup branch from d05d859 to 65b74d1 Compare January 18, 2023 18:07

oscargus marked this pull request as draft January 19, 2023 05:00

Minor cleanup and optimization of Sketch

96c9a30

oscargus force-pushed the pathspeedup branch from 65b74d1 to 96c9a30 Compare January 19, 2023 07:18

oscargus mentioned this pull request Jan 19, 2023

[Bug]: coredump when combining xkcd, FigureCanvasTkAgg and FuncAnimation #24908

Open

oscargus added the Performance label Jan 19, 2023

oscargus marked this pull request as ready for review January 19, 2023 07:38

QuLogic self-requested a review January 19, 2023 09:36

tacaswell added this to the v3.8.0 milestone Feb 2, 2023

tacaswell approved these changes Feb 2, 2023

View reviewed changes

tacaswell merged commit e86ad1e into matplotlib:main Feb 2, 2023

oscargus deleted the pathspeedup branch February 3, 2023 16:30

This was referenced Aug 5, 2024

[Bug]: division-by-zero error in Sketch::Sketch with Agg backend #28669

Closed

Avoid division-by-zero in Sketch::Sketch #28707

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Minor cleanup and optimization of Sketch #24964

Minor cleanup and optimization of Sketch #24964

Uh oh!

oscargus commented Jan 12, 2023

Uh oh!

oscargus Jan 13, 2023

Uh oh!

oscargus commented Jan 19, 2023

Uh oh!

oscargus commented Jan 19, 2023 •

edited

Loading

Uh oh!

jklymak commented Feb 2, 2023

Uh oh!

tacaswell commented Feb 2, 2023

Uh oh!

oscargus commented Feb 3, 2023

Uh oh!

Uh oh!

Uh oh!

Minor cleanup and optimization of Sketch #24964

Minor cleanup and optimization of Sketch #24964

Uh oh!

Conversation

oscargus commented Jan 12, 2023

PR Summary

PR Checklist

Uh oh!

oscargus Jan 13, 2023

Choose a reason for hiding this comment

Uh oh!

oscargus commented Jan 19, 2023

Uh oh!

oscargus commented Jan 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jklymak commented Feb 2, 2023

Uh oh!

tacaswell commented Feb 2, 2023

Uh oh!

oscargus commented Feb 3, 2023

Uh oh!

Uh oh!

oscargus commented Jan 19, 2023 •

edited

Loading