Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 9def6a3

Browse files
committed
Update to Unicode 3.2 database.
1 parent ecbb0ea commit 9def6a3

8 files changed

Lines changed: 13977 additions & 9410 deletions

File tree

Lib/test/output/test_ucn

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,6 @@ test_ucn
22
Testing General Unicode Character Name, and case insensitivity... done.
33
Testing name to code mapping.... done.
44
Testing code to name mapping for all characters.... done.
5-
Found 10538 characters in the unicode name database
5+
Found 11556 characters in the unicode name database
66
Testing misc. symbols for unicode character name expansion.... done.
77
Testing unicode character name expansion strict error handling.... done.

Lib/test/output/test_unicodedata

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
test_unicodedata
22
Testing Unicode Database...
3-
Methods: 84b72943b1d4320bc1e64a4888f7cdf62eea219a
4-
Functions: 41e1d4792185d6474a43c83ce4f593b1bdb01f8a
3+
Methods: a37276dc2c158bef6dfd908ad34525c97180fad9
4+
Functions: 79b4425f140f5f31179fde6db05772d21e75c228
55
API: ok

Modules/unicodedata.c

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ _getrecord(PyUnicodeObject* v)
3636

3737
code = (int) *PyUnicode_AS_UNICODE(v);
3838

39-
if (code < 0 || code >= 65536)
39+
if (code < 0 || code >= 0x110000)
4040
index = 0;
4141
else {
4242
index = index1[(code>>SHIFT)];
@@ -219,7 +219,7 @@ unicodedata_decomposition(PyObject *self, PyObject *args)
219219

220220
code = (int) *PyUnicode_AS_UNICODE(v);
221221

222-
if (code < 0 || code >= 65536)
222+
if (code < 0 || code >= 0x110000)
223223
index = 0;
224224
else {
225225
index = decomp_index1[(code>>DECOMP_SHIFT)];
@@ -284,7 +284,7 @@ _getucname(Py_UCS4 code, char* buffer, int buflen)
284284
int word;
285285
unsigned char* w;
286286

287-
if (code >= 65536)
287+
if (code >= 0x110000)
288288
return 0;
289289

290290
/* get offset into phrasebook */

Modules/unicodedata_db.h

Lines changed: 2769 additions & 1497 deletions
Large diffs are not rendered by default.

Modules/unicodename_db.h

Lines changed: 10240 additions & 7489 deletions
Large diffs are not rendered by default.

Objects/unicodectype.c

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ gettyperecord(Py_UNICODE code)
3636
{
3737
int index;
3838

39-
if (code >= 65536)
39+
if (code >= 0x110000)
4040
index = 0;
4141
else {
4242
index = index1[(code>>SHIFT)];

Objects/unicodetype_db.h

Lines changed: 959 additions & 415 deletions
Large diffs are not rendered by default.

Tools/unicode/makeunicodedata.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -448,7 +448,7 @@ class UnicodeData:
448448

449449
def __init__(self, filename, expand=1):
450450
file = open(filename)
451-
table = [None] * 65536
451+
table = [None] * 0x110000
452452
while 1:
453453
s = file.readline()
454454
if not s:
@@ -476,7 +476,7 @@ def __init__(self, filename, expand=1):
476476
# public attributes
477477
self.filename = filename
478478
self.table = table
479-
self.chars = range(65536) # unicode
479+
self.chars = range(0x110000) # unicode 3.2
480480

481481
def uselatin1(self):
482482
# restrict character range to ISO Latin 1

0 commit comments

Comments
 (0)