Use glyph indices for font tracking in vector formats #30335

QuLogic · 2025-07-19T07:37:40Z

PR summary

With libraqm, string layout produces glyph indices, not character codes, and font features may even produce different glyphs for the same character code (e.g., by picking a different Stylistic Set). Thus we cannot rely on character codes as unique items within a font, and must move toward glyph indices everywhere.

The only thing I don't quite like is that PDF uses character codes for its lookup, and I have to map glyph indices back through an inverse charmap. I think I may have to send everything through CharacterTracker and produce my own limited charmap, but still need to test out what's required. Better stuff for this is done in #30512.

~~This is based on #30143.~~

PR checklist

[n/a] "closes #0000" is in the body of the PR description to link the related issue
new and changed code is tested
[n/a] Plotting related features are demonstrated in an example
[n/a] New Features and API Changes are noted with a directive and release note
[n/a] Documentation complies with general and docstring guidelines

QuLogic · 2025-09-04T04:52:40Z

I've decided to restore the character code in the return values from mathtext, because I've found some use for it in PDF output.

lib/matplotlib/_mathtext.py

lib/matplotlib/_text_helpers.py

lib/matplotlib/backends/_backend_pdf_ps.py

anntzer · 2025-09-05T08:43:00Z

lib/matplotlib/backends/backend_pdf.py

@@ -2274,7 +2268,7 @@ def draw_tex(self, gc, x, y, s, prop, angle, *, mtext=None):
                seq += [['font', pdfname, dvifont.size]]
                oldfont = dvifont
            seq += [['text', x1, y1, [bytes([glyph])], x1+width]]
-            self.file._character_tracker.track(dvifont, chr(glyph))
+            self.file._character_tracker.track_glyph(dvifont, glyph)


I think you need to use text.index here? (with for text in page.text: x1, y1 dvifont, glyph, width = text; ...) (#29868)
I would even stop unpacking and just use text.x, text.y, etc.

I think you might mean #29829 here?

Hmm, it looks like switching to text.index would require a bit more work, as the T1 font subsetter is working with characters too. I guess dbd689f would be the best place for that.

Indeed, you are correct. However this makes things a bit tricky to follow because this means that track_glyph effectively takes a glyph index as second argument if the font is a non-DVI font, but a charcode if the font is a DVI font, or more specifically, a type 1 font (because the type1 subsetter works with characters, as you mention). Is that correct? I guess that's OK as a temporary state because as you mention dbd689f will resolve that discrepancy, but this probably warrants a comment (that can later be dropped in dbd689f) to avoid puzzling the reader?
(Also, keeping this discrepancy would be problematic in the long term as lua/xelatex support will mean that this loop will also sometimes emit glyphs from TTF fonts, but I believe this will again be made clearer by dbd689f.)

lib/matplotlib/backends/backend_ps.py

lib/matplotlib/textpath.py

With libraqm, string layout produces glyph indices, not character codes, and font features may even produce different glyphs for the same character code (e.g., by picking a different Stylistic Set). Thus we cannot rely on character codes as unique items within a font, and must move toward glyph indices everywhere.

QuLogic added this to the v3.11.0 milestone Jul 19, 2025

QuLogic added this to Font and text overhaul Jul 19, 2025

QuLogic added the status: waiting for other PR label Jul 19, 2025

github-project-automation bot moved this to Waiting for other PR in Font and text overhaul Jul 19, 2025

github-actions bot added topic: text backend: ps backend: pdf backend: svg backend: cairo topic: text/fonts topic: text/mathtext labels Jul 19, 2025

github-actions bot added the status: needs rebase label Jul 31, 2025

QuLogic force-pushed the vector-glyphs branch from 33418b6 to e2befff Compare August 23, 2025 09:40

github-actions bot removed topic: text/fonts status: needs rebase labels Aug 23, 2025

QuLogic force-pushed the vector-glyphs branch 2 times, most recently from 4ca7af0 to e684f7b Compare August 27, 2025 02:21

QuLogic removed the status: waiting for other PR label Aug 27, 2025

QuLogic marked this pull request as ready for review August 27, 2025 02:33

QuLogic moved this from Waiting for other PR to Ready for Review in Font and text overhaul Aug 27, 2025

QuLogic force-pushed the vector-glyphs branch 3 times, most recently from 3d5e48c to a2db55c Compare August 30, 2025 05:37

QuLogic force-pushed the vector-glyphs branch from a2db55c to 2118966 Compare September 4, 2025 04:31

QuLogic mentioned this pull request Sep 4, 2025

pdf: Improve text with characters outside embedded font limits #30512

Draft

4 tasks

anntzer reviewed Sep 4, 2025

View reviewed changes

lib/matplotlib/_mathtext.py Show resolved Hide resolved

anntzer reviewed Sep 5, 2025

View reviewed changes

lib/matplotlib/_text_helpers.py Show resolved Hide resolved

anntzer reviewed Sep 5, 2025

View reviewed changes

lib/matplotlib/backends/_backend_pdf_ps.py Outdated Show resolved Hide resolved

anntzer reviewed Sep 5, 2025

View reviewed changes

lib/matplotlib/backends/backend_ps.py Show resolved Hide resolved

anntzer reviewed Sep 5, 2025

View reviewed changes

lib/matplotlib/textpath.py Outdated Show resolved Hide resolved

QuLogic force-pushed the vector-glyphs branch 2 times, most recently from 41a5b7d to df7fa98 Compare September 13, 2025 10:53

QuLogic force-pushed the vector-glyphs branch from df7fa98 to 8de7f4e Compare September 13, 2025 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use glyph indices for font tracking in vector formats #30335

Use glyph indices for font tracking in vector formats #30335

Uh oh!

QuLogic commented Jul 19, 2025 •

edited

Loading

Uh oh!

QuLogic commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

anntzer Sep 5, 2025 •

edited

Loading

Uh oh!

QuLogic Sep 13, 2025

Uh oh!

QuLogic Sep 13, 2025 •

edited

Loading

Uh oh!

anntzer Sep 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Use glyph indices for font tracking in vector formats #30335

Are you sure you want to change the base?

Use glyph indices for font tracking in vector formats #30335

Uh oh!

Conversation

QuLogic commented Jul 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR summary

PR checklist

Uh oh!

QuLogic commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

anntzer Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

QuLogic Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

QuLogic Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anntzer Sep 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

QuLogic commented Jul 19, 2025 •

edited

Loading

anntzer Sep 5, 2025 •

edited

Loading

QuLogic Sep 13, 2025 •

edited

Loading