Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 0dbebc0

Browse files
committed
Issue #10542: Document that identifiers use XID_Start XID_Continue*.
1 parent 627284c commit 0dbebc0

1 file changed

Lines changed: 5 additions & 1 deletion

File tree

Doc/reference/lexical_analysis.rst

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -292,9 +292,11 @@ Unicode Character Database as included in the :mod:`unicodedata` module.
292292
Identifiers are unlimited in length. Case is significant.
293293

294294
.. productionlist::
295-
identifier: `id_start` `id_continue`*
295+
identifier: `xid_start` `xid_continue`*
296296
id_start: <all characters in general categories Lu, Ll, Lt, Lm, Lo, Nl, the underscore, and characters with the Other_ID_Start property>
297297
id_continue: <all characters in `id_start`, plus characters in the categories Mn, Mc, Nd, Pc and others with the Other_ID_Continue property>
298+
xid_start: <all characters in `id_start` whose NFKC normalization is in "id_start xid_continue*">
299+
xid_continue: <all characters in `id_continue` whose NFKC normalization is in "id_continue*">
298300

299301
The Unicode category codes mentioned above stand for:
300302

@@ -308,6 +310,8 @@ The Unicode category codes mentioned above stand for:
308310
* *Mc* - spacing combining marks
309311
* *Nd* - decimal numbers
310312
* *Pc* - connector punctuations
313+
* *Other_ID_Start* - explicit list of characters in `PropList.txt <http://unicode.org/Public/UNIDATA/PropList.txt>`_ to support backwards compatibility
314+
* *Other_ID_Continue* - likewise
311315

312316
All identifiers are converted into the normal form NFKC while parsing; comparison
313317
of identifiers is based on NFKC.

0 commit comments

Comments
 (0)