Fix error with pyparsing 3 for 3.5.x #21454

timhoffm · 2021-10-24T20:49:34Z

This is the minimal fix suggestion taken from Use named groups in mathtext parser. #21448
This addresses the branch v3.5.x directly. The main branch will be fixed through Use named groups in mathtext parser. #21448 (and then only needs the unpinning as well).
Removed all pyparsing pinning. If this runs through CI, the fix is proven to be working.

timhoffm · 2021-10-24T21:36:36Z

Ping @anntzer. Your proposed fix doesn't seem to work.

anntzer · 2021-10-24T22:11:46Z

Sorry, I initially missed that the error message (that we're matching against) slightly changed as well:

diff --git a/lib/matplotlib/tests/test_mathtext.py b/lib/matplotlib/tests/test_mathtext.py
index 75bc6af9b6..39f2242c85 100644
--- a/lib/matplotlib/tests/test_mathtext.py
+++ b/lib/matplotlib/tests/test_mathtext.py
@@ -250,7 +250,9 @@ def test_fontinfo():
         (r'$\leftF$', r'Expected a delimiter'),
         (r'$\rightF$', r'Unknown symbol: \rightF'),
         (r'$\left(\right$', r'Expected a delimiter'),
-        (r'$\left($', r'Expected "\right"'),
+        # PyParsing 2 uses double quotes, PyParsing 3 uses single quotes and an
+        # extra backslash.
+        (r'$\left($', re.compile(r'Expected ("|\'\\)\\right["\']')),
         (r'$\dfrac$', r'Expected \dfrac{num}{den}'),
         (r'$\dfrac{}{}$', r'Expected \dfrac{num}{den}'),
         (r'$\overset$', r'Expected \overset{annotation}{body}'),
@@ -281,8 +283,8 @@ def test_fontinfo():
 )
 def test_mathtext_exceptions(math, msg):
     parser = mathtext.MathTextParser('agg')
-
-    with pytest.raises(ValueError, match=re.escape(msg)):
+    match = re.escape(msg) if isinstance(msg, str) else msg
+    with pytest.raises(ValueError, match=match):
         parser.parse(math)

(it's basically the second commit of #21448)

- Code suggestion taken from matplotlib#21448 - Removed all pyparsing pinning. If this runs through CI, the fix is proven to be working.

ptmcg · 2021-10-26T11:50:05Z

lib/matplotlib/_mathtext.py

@@ -2044,7 +2044,7 @@ def __init__(self):
        p.accentprefixed <<= Suppress(p.bslash) + oneOf(self._accentprefixed)
        p.symbol_name   <<= (
            Combine(p.bslash + oneOf(list(tex2uni)))


Could this also have been fixed by changing oneOf(list(tex2uni)) to oneOf(list(tex2uni), asKeyword=True) and removing the FollowedBy altogether? This has the benefit of reducing the number of pyparsing terms to be matched, which should translate to faster parsing.

I think the problem of asKeyword is that "word boundary" needs a custom definition here (e.g. underscores and digits also separate words). I this this is configurable on pyparsing's side, but I'd rather just write our own regex (something like (?![a-zA-Z])) and be done with it.

This sounds like a better approach than trying to do custom keywords.

If you really want to collapse this down to a single Regex, then you could use oneOf to build a Regex for you of just the word choices, but then extract the generated regex pattern into a new Regex, something like Regex(r"\\(" + oneOf(text2uni).pattern + ")(?![a-zA-Z])"). (untested, ymmv, etc.)

tex symbols should not have any metacharacters, so the middle part is probably even just "|".join(tex2uni).

Beware of this - oneOf also takes care of reordering longer entries before shorter in the event the shorter entry is a leading subset, and also deduping. You could just reverse sort the keys in tex2uni by length, but if there are known frequencies to some symbols vs others, and more frequent entries were placed first, oneOf would only reorder them to avoid masking entries, whereas sorting by longest-to-shortest would lose some of this priority ordering.

You can test this for yourself:

re.match(r"ab|abb", "abb")

will return "ab", not the longer and more desirable match "abb". And there are several cases in tex2uni of these kind of masking pairs.

Maybe keep it simple first, just do "|".join(sorted(set(tex2uni), key=len, reverse=True)) and then get clever with the frequency-based ordering as a later experiment.

Ah, good point, thanks for mentioning that.

timhoffm added this to the v3.5.0 milestone Oct 24, 2021

timhoffm added the Release critical For bugs that make the library unusable (segfaults, incorrect plots, etc) and major regressions. label Oct 24, 2021

timhoffm mentioned this pull request Oct 24, 2021

Use named groups in mathtext parser. #21448

Merged

7 tasks

timhoffm force-pushed the fix-pyparsing branch from 4dfac9b to ba51894 Compare October 24, 2021 20:53

Fix error with pyparsing 3

90c7afd

- Code suggestion taken from matplotlib#21448 - Removed all pyparsing pinning. If this runs through CI, the fix is proven to be working.

timhoffm force-pushed the fix-pyparsing branch from ba51894 to 90c7afd Compare October 24, 2021 22:40

anntzer approved these changes Oct 25, 2021

View reviewed changes

jklymak approved these changes Oct 25, 2021

View reviewed changes

jklymak merged commit 367a267 into matplotlib:v3.5.x Oct 25, 2021

jklymak mentioned this pull request Oct 25, 2021

[Bug]: Plotting labels with Greek latters in math mode produces Parsing error when plt.show() runs #21463

Closed

jklymak linked an issue Oct 25, 2021 that may be closed by this pull request

[Bug]: Plotting labels with Greek latters in math mode produces Parsing error when plt.show() runs #21463

Closed

speth mentioned this pull request Oct 25, 2021

Broken example CI runs Cantera/cantera#1132

Closed

antalszava mentioned this pull request Oct 25, 2021

Pin pyparsing (master branch) PennyLaneAI/qml#357

Merged

timhoffm mentioned this pull request Oct 25, 2021

Backport #21429 from jklymak/doc-use-mpl-sphinx #21461

Merged

7 tasks

ptmcg reviewed Oct 26, 2021

View reviewed changes

rrjbca mentioned this pull request Oct 26, 2021

Readthedocs failing due to incompatibility between pyparsing>=3.0.0 and matplotlib<=3.4.3 skypyproject/skypy#499

Closed

timhoffm deleted the fix-pyparsing branch October 28, 2021 20:03

anntzer mentioned this pull request Oct 29, 2021

[MNT]: mathtext.MathTextParser is slow #20821

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix error with pyparsing 3 for 3.5.x #21454

Fix error with pyparsing 3 for 3.5.x #21454

Uh oh!

timhoffm commented Oct 24, 2021

Uh oh!

timhoffm commented Oct 24, 2021

Uh oh!

anntzer commented Oct 24, 2021 •

edited

Loading

Uh oh!

ptmcg Oct 26, 2021

Uh oh!

anntzer Oct 29, 2021

Uh oh!

ptmcg Oct 29, 2021

Uh oh!

anntzer Oct 29, 2021

Uh oh!

ptmcg Oct 29, 2021 •

edited

Loading

Uh oh!

anntzer Oct 29, 2021

Uh oh!

Uh oh!

Uh oh!

Fix error with pyparsing 3 for 3.5.x #21454

Fix error with pyparsing 3 for 3.5.x #21454

Uh oh!

Conversation

timhoffm commented Oct 24, 2021

Uh oh!

timhoffm commented Oct 24, 2021

Uh oh!

anntzer commented Oct 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ptmcg Oct 26, 2021

Choose a reason for hiding this comment

Uh oh!

anntzer Oct 29, 2021

Choose a reason for hiding this comment

Uh oh!

ptmcg Oct 29, 2021

Choose a reason for hiding this comment

Uh oh!

anntzer Oct 29, 2021

Choose a reason for hiding this comment

Uh oh!

ptmcg Oct 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anntzer Oct 29, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anntzer commented Oct 24, 2021 •

edited

Loading

ptmcg Oct 29, 2021 •

edited

Loading