Fix flaky text tests #8708

QuLogic · 2017-06-03T06:25:27Z

As noted in #7911, the mathtext tests sometimes fail if the id of the renderer is the same between tests. Since it's difficult to add the mathtext fontset to the key, I fixed/hacked it into working by adding a cache-busting "unique" value to each renderer, which is just a globally-incremented number. I thought of using a UUID or random number, but that seemed too expensive since we don't need to worry about threading locks on globals (AFAIK).

I ran the test from #7911, but using the repeat plugin to run everything 100 times over (so 50000 tests total.) So I'm not 100% certain, but I believe this fixes #7911.

Additionally, we've lately been seeing several failures of test_savefig_to_stringio on AppVeyor. This doesn't usually fail locally, but seems to be a timeout on the TeX cache. I've added some retries on these tests to hopefully reduce the problem.

PR Checklist

Has Pytest style unit tests
Code is PEP 8 compliant
New features are documented, with examples if plot related
Documentation is sphinx and numpydoc compliant
Added an entry to doc/users/whats_new.rst if major new feature
Documented in doc/api/api_changes.rst if API changed in a backward-incompatible way

2.5.5 is what's in my conda env, and 2.6.1 is what's built with the local_freetype option and is used on Travis and AppVeyor without issue.

The result of id() is only guaranteed to be unique for objects that exist at the same time. This global integer is a quick and light way to ensure that new renderers that match an old one don't produce the same caching key.

The Mathtext tests should no longer be flaky based on the previous change. Mark some PS tests as flaky because they require a lock that sometimes gets stuck on AppVeyor due to the multiple processes.

QuLogic · 2017-06-03T06:27:22Z

Huh, just realized that @tacaswell outlined basically this procedure in #7911 but I didn't recall it; must have been stuck in my subconscious somewhere.

tacaswell · 2017-06-03T21:21:51Z

I don't think this is thread safe, and that is ok. I do not think we make the claim that Matplotlib is threadsafe in any way. Further most of our time is spent in python and I do not think we release the GIL in the c extensions so it is not clear to me you would get any major gains (unless you really are I/O bound).

Although the way that this will fail to be thread safe also means both objects existing at the same time in which case the id() value should be different. If the race condition is between two threads calling _uid on the same renderer at the same time then the user is trying to render the same object on more than one thread and I expect other things to break.

There are two places to get race conditions here (see https://www.youtube.com/watch?v=7SSYhuk5hmc for lots more details) If you look

x = 5


class Test:
    
    def foo():
        global x
        x += 1
        self.cache = x

and

In [91]: t = Test()

In [93]: dis.dis(t.foo)
  8           0 LOAD_GLOBAL              0 (x)
              2 LOAD_CONST               1 (1)
              4 INPLACE_ADD
              6 STORE_GLOBAL             0 (x)

  9           8 LOAD_GLOBAL              0 (x)
             10 LOAD_GLOBAL              1 (self)
             12 STORE_ATTR               2 (cache)
             14 LOAD_CONST               0 (None)
             16 RETURN_VALUE

I think you can get races conditions between all of the op-codes in line 8 which missing an increment and between lines 8 and 9 (both increment, but then both use the higher value).

anntzer · 2017-06-03T22:52:02Z

Should this just go into RendererBase.__hash__? (then get_prop_tup would just stick renderer into the tuple instead of id(renderer), renderer._uid.)

tacaswell · 2017-06-03T23:01:25Z

We probably do not want to go down the __hash__ route without implementing all of other __eq__ and friends methods.

QuLogic · 2017-06-03T23:25:15Z

For thread-safety, I was thinking more of subsequent renderers. Across the threads, the renderers would have different id anyway, so it'd be fine if they ended up with the same value.

Theoretically, thread 1 could create a couple renderers in the time it takes thread 2 to store the wrong value. Then the next time thread 1 created a new id-conflicting renderer it would have the wrong uid. But something would have to be seriously wrong with thread 2 if thread 1 was able to create multiple renderers in the time it took to load+add+store.

anntzer · 2017-06-04T00:02:59Z

Actually you can just use itertools.count as a threadsafe counter (create _counter at the module level and call next(_counter) to get a uid). Yes, this is only due to a CPython implementation detail (i.e. the GIL) but it's not worse than the current PR and should be at least as fast (or better) (in any case I doubt that's the slowest part of rendering).

QuLogic · 2017-06-04T00:06:08Z

That's a good idea; even if not thread-safe on all implementation, it's at least clearer.

anntzer · 2017-06-04T00:19:50Z

lib/matplotlib/backend_bases.py

+        part of a caching key.
+        """
+        if self._id is None:
+            self._id = next(_unique_renderer_id)


I would just make this a regular attribute created at instantiation (and leave a comment in the ctor). I'm not sure what making this a property buys you...

I meant to remove this now when switching to itertools but forgot about it.

@anntzer

As suggested by @anntzer.

QuLogic · 2017-06-06T03:21:08Z

@tomspur This patch should help with https://bugzilla.redhat.com/show_bug.cgi?id=1401267 (though leave out the pytest parts if on 1.5.3.)

tacaswell

Approved before and it is much better now.

anntzer · 2017-06-10T04:49:25Z

thanks!

matthew-brett · 2017-06-10T12:43:44Z

@QuLogic - great - thanks for doing that.

QuLogic added 3 commits June 1, 2017 21:10

Update FreeType version in test_tightlayout4.

8a0e56e

2.5.5 is what's in my conda env, and 2.6.1 is what's built with the local_freetype option and is used on Travis and AppVeyor without issue.

Add a unique number to any renderer hash keys.

7a49d20

The result of id() is only guaranteed to be unique for objects that exist at the same time. This global integer is a quick and light way to ensure that new renderers that match an old one don't produce the same caching key.

Update tests that are marked flaky.

e050729

The Mathtext tests should no longer be flaky based on the previous change. Mark some PS tests as flaky because they require a lock that sometimes gets stuck on AppVeyor due to the multiple processes.

QuLogic added this to the 2.1 (next point release) milestone Jun 3, 2017

QuLogic requested a review from tacaswell June 3, 2017 06:25

tacaswell approved these changes Jun 3, 2017

View reviewed changes

anntzer reviewed Jun 4, 2017

View reviewed changes

Use itertools.count for renderer unique ID.

2005738

As suggested by @anntzer.

QuLogic force-pushed the flaky-tests branch from 6fd5e24 to 2005738 Compare June 4, 2017 00:28

tacaswell approved these changes Jun 6, 2017

View reviewed changes

anntzer merged commit b07fbb8 into matplotlib:master Jun 10, 2017

QuLogic deleted the flaky-tests branch June 10, 2017 04:50

anntzer mentioned this pull request Aug 22, 2017

Replace use of renderer._uid by weakref. #9070

Merged

6 tasks

Uh oh!

Fix flaky text tests #8708

Fix flaky text tests #8708

Uh oh!

Conversation

QuLogic commented Jun 3, 2017

PR Checklist

Uh oh!

QuLogic commented Jun 3, 2017

Uh oh!

tacaswell commented Jun 3, 2017 • edited by QuLogic Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anntzer commented Jun 3, 2017

Uh oh!

tacaswell commented Jun 3, 2017

Uh oh!

QuLogic commented Jun 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anntzer commented Jun 4, 2017

Uh oh!

QuLogic commented Jun 4, 2017

Uh oh!

anntzer Jun 4, 2017

Choose a reason for hiding this comment

Uh oh!

QuLogic Jun 4, 2017

Choose a reason for hiding this comment

Uh oh!

QuLogic commented Jun 6, 2017

Uh oh!

tacaswell left a comment

Choose a reason for hiding this comment

Uh oh!

anntzer commented Jun 10, 2017

Uh oh!

matthew-brett commented Jun 10, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tacaswell commented Jun 3, 2017 •

edited by QuLogic

Loading

QuLogic commented Jun 3, 2017 •

edited

Loading