Simplify and unify character tracking in pdf and ps backends. #15320

anntzer · 2019-09-21T10:32:45Z

Instead of trying to resolve font paths to absolute files and key off by
inode(!), just track fonts using whatever names they use, and simplify
used_characters to be a straight mapping of filenames to character ids
(making the attribute private -- with a backcompat shim) at the same
time).

The previous approach would avoid embedding the same file twice if it is
given under two different filenames (hardlinks to the same file...), but
it would fail if the user passes a relative path, chdir()s to another
directory, and passes another different font with the same filename,
because of the lru_cache(). None of these seem likely to happen in
practice, and in any case we can cover most of it by making the font
paths absolute before passing them to FreeType (which is going to open
the file anyways, so the cost of making them absolute doesn't matter).

missing_references.json needs to be regenerated due to the missing reference to the private CharacterTracker class; to avoid changes in order in missing_references.json creating needless diffs, this goes on top of #15321

PR Summary

PR Checklist

Has Pytest style unit tests
Code is Flake 8 compliant
New features are documented, with examples if plot related
Documentation is sphinx and numpydoc compliant
Added an entry to doc/users/next_whats_new/ if major new feature (follow instructions in README.rst there)
Documented in doc/api/api_changes.rst if API changed in a backward-incompatible way

timhoffm · 2019-09-22T16:08:10Z

missing-references.json is still large. Probably this needs a rebase?

anntzer · 2019-09-22T16:37:45Z

rebased

timhoffm · 2019-09-22T16:24:58Z

lib/matplotlib/backends/_backend_pdf_ps.py

@@ -16,11 +16,43 @@ def _cached_get_afm_from_fname(fname):
        return AFM(fh)


+class CharacterTracker:
+    def __init__(self):


I would love to see a bit more documentation on the class itself and its public methods.

lib/matplotlib/backends/backend_pdf.py

timhoffm

Modulo a minor formatting issue in the docstring.

lib/matplotlib/backends/_backend_pdf_ps.py

Instead of trying to resolve font paths to absolute files and key off by inode(!), just track fonts using whatever names they use, and simplify used_characters to be a straight mapping of filenames to character ids (making the attribute private -- with a backcompat shim) at the same time). The previous approach would avoid embedding the same file twice if it is given under two different filenames (hardlinks to the same file...), but it would fail if the user passes a relative path, chdir()s to another directory, and passes another different font with the same filename, because of the lru_cache(). None of these seem likely to happen in practice, and in any case we can cover most of it by making the font paths absolute before passing them to FreeType (which is going to open the file anyways, so the cost of making them absolute doesn't matter).

sauerburger · 2019-11-12T22:40:37Z

I had a closer look at this implementation. I've tested the code in the scenario from #15629 (linked fonts with different file names, e.g. arial.ttf -> Arial.ttf). There is still one place where the real fonts are used: font_manager.get_font()

In this case, all the characters are missing:
example.pdf, CI run

anntzer · 2019-11-20T10:13:32Z

Superseded by #15686.

anntzer added backend: ps backend: pdf labels Sep 21, 2019

anntzer force-pushed the unrealpath branch from f9a2ba8 to 2383d89 Compare September 21, 2019 15:14

anntzer mentioned this pull request Sep 21, 2019

Sort missing_references.json. #15321

Merged

6 tasks

anntzer added status: waiting for other PR and removed status: waiting for other PR labels Sep 21, 2019

anntzer force-pushed the unrealpath branch from 2383d89 to 443b7d2 Compare September 22, 2019 16:36

timhoffm reviewed Sep 22, 2019

View reviewed changes

anntzer added the status: work in progress label Sep 23, 2019

anntzer force-pushed the unrealpath branch from 443b7d2 to f99ee63 Compare November 8, 2019 09:58

anntzer mentioned this pull request Nov 8, 2019

Consistently use realpaths to build XObject names #15629

Closed

6 tasks

anntzer removed the status: work in progress label Nov 8, 2019

anntzer force-pushed the unrealpath branch from f99ee63 to 71b762f Compare November 8, 2019 10:35

timhoffm added the status: needs rebase label Nov 9, 2019

anntzer force-pushed the unrealpath branch from 71b762f to b6dca12 Compare November 9, 2019 14:31

anntzer removed the status: needs rebase label Nov 9, 2019

timhoffm approved these changes Nov 9, 2019

View reviewed changes

lib/matplotlib/backends/_backend_pdf_ps.py Outdated Show resolved Hide resolved

anntzer force-pushed the unrealpath branch from b6dca12 to ef58a59 Compare November 9, 2019 16:09

sauerburger mentioned this pull request Nov 12, 2019

Simplify and unify character tracking in pdf and ps backends (with linked fonts) #15686

Merged

6 tasks

anntzer closed this Nov 20, 2019

anntzer deleted the unrealpath branch November 20, 2019 10:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Simplify and unify character tracking in pdf and ps backends. #15320

Simplify and unify character tracking in pdf and ps backends. #15320

Uh oh!

anntzer commented Sep 21, 2019 •

edited

Loading

Uh oh!

timhoffm commented Sep 22, 2019

Uh oh!

anntzer commented Sep 22, 2019

Uh oh!

timhoffm Sep 22, 2019

Uh oh!

anntzer Nov 8, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timhoffm left a comment •

edited

Loading

Uh oh!

Uh oh!

sauerburger commented Nov 12, 2019

Uh oh!

anntzer commented Nov 20, 2019

Uh oh!

Uh oh!

Uh oh!

Simplify and unify character tracking in pdf and ps backends. #15320

Simplify and unify character tracking in pdf and ps backends. #15320

Uh oh!

Conversation

anntzer commented Sep 21, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

PR Checklist

Uh oh!

timhoffm commented Sep 22, 2019

Uh oh!

anntzer commented Sep 22, 2019

Uh oh!

timhoffm Sep 22, 2019

Choose a reason for hiding this comment

Uh oh!

anntzer Nov 8, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timhoffm left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sauerburger commented Nov 12, 2019

Uh oh!

anntzer commented Nov 20, 2019

Uh oh!

Uh oh!

anntzer commented Sep 21, 2019 •

edited

Loading

timhoffm left a comment •

edited

Loading