bpo-33305: Improve SyntaxError for invalid numerical literals. #6517

serhiy-storchaka · 2018-04-18T11:19:47Z

https://bugs.python.org/issue33305

SylvainDe

This could be great.
Out of curiosity, would it make sense to add the character causing troubles in all cases ?

SylvainDe · 2018-04-19T14:16:20Z

Parser/tokenizer.c

+                                    "invalid digit '%c' in octal literal", c);
+                        }
+                        else {
+                            return syntaxerror(tok, "invalid octal literal");


Would it be possible to add the value of c in this message as well ?

This error is raised in the case if an underscore or 0o is not followed by a digit. What error messages could be helpful for 0o+2, 0o + 2, (2+0o), 0or[]?

I don't know, I'd expect "invalid character '%c' in octal literal" to be useful in all cases.

It is easy to report only if an invalid digit (in the range 2-9 or 8-9) is occurred. In general case there are much subtle details, handling them will complicate the code too much:

Not always an invalid character exists. This error can be raised at the end of the input.

It can be non-ASCII. In this case we need to decode a multibyte UTF-8 for getting a character.

It can be non-printable.

Even if it is printable from the Unicode's point of view, it can look indistinguishably from other characters. For example, non-breakable space character looks like an ordinary space for humans, but not for the Python parser.

Even in ASCII there are non-printable characters, or characters that need special handling: tab, newline, single quote, backslash, ...

It may be worth to produce more specialized error message for some cases, but just reporting the next invalid character is no a way.

Ho, indeed, I didn't think about all these issues...

serhiy-storchaka · 2018-07-07T21:24:42Z

I'm going to merge this PR if there are no other suggestions.

mdickinson

The new messages look great to me; I haven't reviewed the C code changes in detail.

bpo-33305: Improved SyntaxError for invalid numerical literals.

1f4d9da

serhiy-storchaka added type-feature A feature request or enhancement DO-NOT-MERGE labels Apr 18, 2018

the-knights-who-say-ni added the CLA signed label Apr 18, 2018

bedevere-bot added the awaiting merge label Apr 18, 2018

Add tests.

a2ebdc1

SylvainDe reviewed Apr 19, 2018

View reviewed changes

Merge branch 'master' into compiler-invalid-numbers-errors

e51953b

serhiy-storchaka requested a review from mdickinson April 29, 2018 17:29

serhiy-storchaka added 2 commits May 10, 2018 14:39

Merge branch 'master' into compiler-invalid-numbers-errors

18c1321

Merge branch 'master' into compiler-invalid-numbers-errors

9194a70

serhiy-storchaka removed the DO-NOT-MERGE label Jul 7, 2018

mdickinson approved these changes Jul 9, 2018

View reviewed changes

serhiy-storchaka merged commit cf7303e into python:master Jul 9, 2018

bedevere-bot removed the awaiting merge label Jul 9, 2018

serhiy-storchaka deleted the compiler-invalid-numbers-errors branch July 9, 2018 12:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-33305: Improve SyntaxError for invalid numerical literals. #6517

bpo-33305: Improve SyntaxError for invalid numerical literals. #6517

Uh oh!

serhiy-storchaka commented Apr 18, 2018 •

edited by bedevere-bot

Loading

Uh oh!

SylvainDe left a comment

Uh oh!

SylvainDe Apr 19, 2018

Uh oh!

serhiy-storchaka Apr 24, 2018

Uh oh!

SylvainDe Apr 25, 2018

Uh oh!

serhiy-storchaka May 13, 2018

Uh oh!

SylvainDe May 14, 2018

Uh oh!

serhiy-storchaka commented Jul 7, 2018

Uh oh!

mdickinson left a comment

Uh oh!

Uh oh!

Uh oh!

bpo-33305: Improve SyntaxError for invalid numerical literals. #6517

bpo-33305: Improve SyntaxError for invalid numerical literals. #6517

Uh oh!

Conversation

serhiy-storchaka commented Apr 18, 2018 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SylvainDe left a comment

Choose a reason for hiding this comment

Uh oh!

SylvainDe Apr 19, 2018

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

SylvainDe Apr 25, 2018

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka May 13, 2018

Choose a reason for hiding this comment

Uh oh!

SylvainDe May 14, 2018

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka commented Jul 7, 2018

Uh oh!

mdickinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

serhiy-storchaka commented Apr 18, 2018 •

edited by bedevere-bot

Loading