Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Unicode 15.1→16.0 #539

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Sep 19, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions .github/workflows/build-jsp.yml
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
name: Build JSP

env:
CURRENT_UVERSION: 15.1.0
PREVIOUS_UVERSION: 15.0.0 # not used at present
CURRENT_UVERSION: 16.0.0
PREVIOUS_UVERSION: 15.1.0 # not used at present

on:
push:
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/cli-build-instructions.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,8 +9,8 @@ on:
- '*'

env:
CURRENT_UVERSION: 15.1.0
PREVIOUS_UVERSION: 15.0.0
CURRENT_UVERSION: 16.0.0
PREVIOUS_UVERSION: 15.1.0

jobs:

Expand Down
25 changes: 18 additions & 7 deletions docs/data-workflow.md
Original file line number Diff line number Diff line change
Expand Up @@ -95,6 +95,8 @@ and skip any others that are only for internal use.

For the alpha review, publish (at least) the UCD and emoji files, and the charts.

Review/edit the pub/*.sh scripts and advance the version numbers and copyright years.

Run the [pub/copy-alpha-to-draft.sh](https://github.com/unicode-org/unicodetools/blob/main/pub/copy-alpha-to-draft.sh)
script from an up-to-date repo workspace.
The script copies the set of the .../dev/ data files for an alpha snapshot
Expand Down Expand Up @@ -123,7 +125,7 @@ Ask Rick to add other files that are not tracked in the unicodetools repo:
* UCDXML files to .../draft/UCD/ucdxml
* beta charts to .../draft/UCD/charts

### Publish a release
### Publish a release snapshot

After the last UTC meeting for the release, collect all of the data file updates
(mostly from recently opened action items).
Expand All @@ -145,20 +147,29 @@ This script works much like the beta script, except it:
rather than for Public/draft/
* creates a zipped/{version} folder with UCD.zip

### After a release
### Before a release

When the data files are supposed to be final, about a week or two before the release:

Verify once more that the unicodetools repo .../dev/ files match the released/published files.
(They better...)

Create a release tag in the repo.
Example, from four days before Unicode 15.1 was released:
https://github.com/unicode-org/unicodetools/releases/tag/final-15.1-20230908

### After a release

Copy a snapshot of the unicodetools repo .../dev/ files to a versioned unicodetools folder;
for example: .../unicodetools/data/ucd/15.1.0/ .
for example: .../unicodetools/data/ucd/16.0.0/ .
(We no longer append a “-Update” suffix to the folder name.)
List: emoji, idna, security, uca, ucd, ucdxml
Watch for different naming conventions: emoji versions use only two fields, not three.

Create a release tag in the repo.

Edit the pub/*.sh scripts and advance the version numbers and copyright years.
Edit the pub/*.sh scripts and advance the version numbers.

Change the Unicode Tools code as necessary for the start of work on the next version.
Settings.java lastVersion & latestVersion and more.

Example, Unicode 15.1→16.0: https://github.com/unicode-org/unicodetools/pull/539

Declare “main” to be open for the next version.
4 changes: 2 additions & 2 deletions pub/copy-alpha-to-draft.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,8 +12,8 @@ UNITOOLS_DATA=$UNICODETOOLS/unicodetools/data

# Adjust the following for each year and version as needed.
COPY_YEAR=2023
UNI_VER=15.1.0
EMOJI_VER=15.1
UNI_VER=16.0.0
EMOJI_VER=16.0

TODAY=`date --iso-8601`

Expand Down
6 changes: 3 additions & 3 deletions pub/copy-beta-to-draft.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,11 +12,11 @@ UNITOOLS_DATA=$UNICODETOOLS/unicodetools/data

# Adjust the following for each year and version as needed.
COPY_YEAR=2023
UNI_VER=15.1.0
EMOJI_VER=15.1
UNI_VER=16.0.0
EMOJI_VER=16.0
# UTS #10 release revision number to be used in CollationTest.html:
# One more than the last release revision number.
TR10_REV=tr10-48
TR10_REV=tr10-50

TODAY=`date --iso-8601`

Expand Down
6 changes: 3 additions & 3 deletions pub/copy-final.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,11 +12,11 @@ UNITOOLS_DATA=$UNICODETOOLS/unicodetools/data

# Adjust the following for each year and version as needed.
COPY_YEAR=2023
UNI_VER=15.1.0
EMOJI_VER=15.1
UNI_VER=16.0.0
EMOJI_VER=16.0
# UTS #10 release revision number to be used in CollationTest.html:
# *Two* more than the last release revision number.
TR10_REV=tr10-49
TR10_REV=tr10-51

TODAY=`date --iso-8601`

Expand Down
21 changes: 21 additions & 0 deletions unicodetools/data/emoji/15.1/ReadMe.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
# Unicode Emoji
# © COPY_YEAR Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use, see https://www.unicode.org/terms_of_use.html

This directory contains PUB_STATUS data files for Unicode Emoji, Version EMOJI_VER

PUBLIC_EMOJI

emoji-sequences.txt
emoji-zwj-sequences.txt
emoji-test.txt

The following related files are found in the UCD for Version EMOJI_VER

PUBLIC_UCD_EMOJI

emoji-data.txt
emoji-variation-sequences.txt

For documentation, see UTS #51 Unicode Emoji, Version EMOJI_VER
1,489 changes: 1,489 additions & 0 deletions unicodetools/data/emoji/15.1/emoji-sequences.txt

Large diffs are not rendered by default.

5,325 changes: 5,325 additions & 0 deletions unicodetools/data/emoji/15.1/emoji-test.txt

Large diffs are not rendered by default.

1,529 changes: 1,529 additions & 0 deletions unicodetools/data/emoji/15.1/emoji-zwj-sequences.txt

Large diffs are not rendered by default.

749 changes: 749 additions & 0 deletions unicodetools/data/emoji/15.1/internal/emoji-diff.txt

Large diffs are not rendered by default.

87 changes: 87 additions & 0 deletions unicodetools/data/emoji/15.1/internal/emoji-internal.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,87 @@
# emoji-internal.txt
# Date: 2023-02-01, 01:15:48 GMT
# © 2023 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use, see https://www.unicode.org/terms_of_use.html
#
# Emoji Data Internal for UTS #51
# Version: 15.1
#
# For documentation and usage, see https://www.unicode.org/reports/tr51
#
# Format:
# <codepoint(s)> ; <property> # <comments>
# Note: there is no guarantee as to the structure of whitespace or comments
#
# Characters and sequences are listed in code point order. Users should be shown a more natural order.
# See the CLDR collation order for Emoji.


# ================================================

# All omitted code points have Emoji_Gender_Base=No

26F9 ; Emoji_Gender_Base # E0.7 [1] (⛹️) person bouncing ball
1F3C3..1F3C4 ; Emoji_Gender_Base # E0.6 [2] (🏃..🏄) person running..person surfing
1F3CA ; Emoji_Gender_Base # E0.6 [1] (🏊) person swimming
1F3CB..1F3CC ; Emoji_Gender_Base # E0.7 [2] (🏋️..🏌️) person lifting weights..person golfing
1F46E..1F471 ; Emoji_Gender_Base # E0.6 [4] (👮..👱) police officer..person: blond hair
1F473 ; Emoji_Gender_Base # E0.6 [1] (👳) person wearing turban
1F477 ; Emoji_Gender_Base # E0.6 [1] (👷) construction worker
1F481..1F482 ; Emoji_Gender_Base # E0.6 [2] (💁..💂) person tipping hand..guard
1F486..1F487 ; Emoji_Gender_Base # E0.6 [2] (💆..💇) person getting massage..person getting haircut
1F575 ; Emoji_Gender_Base # E0.7 [1] (🕵️) detective
1F645..1F647 ; Emoji_Gender_Base # E0.6 [3] (🙅..🙇) person gesturing NO..person bowing
1F64B ; Emoji_Gender_Base # E0.6 [1] (🙋) person raising hand
1F64D..1F64E ; Emoji_Gender_Base # E0.6 [2] (🙍..🙎) person frowning..person pouting
1F6A3 ; Emoji_Gender_Base # E1.0 [1] (🚣) person rowing boat
1F6B4..1F6B5 ; Emoji_Gender_Base # E1.0 [2] (🚴..🚵) person biking..person mountain biking
1F6B6 ; Emoji_Gender_Base # E0.6 [1] (🚶) person walking
1F926 ; Emoji_Gender_Base # E3.0 [1] (🤦) person facepalming
1F935 ; Emoji_Gender_Base # E3.0 [1] (🤵) person in tuxedo
1F937..1F939 ; Emoji_Gender_Base # E3.0 [3] (🤷..🤹) person shrugging..person juggling
1F93C..1F93E ; Emoji_Gender_Base # E3.0 [3] (🤼..🤾) people wrestling..person playing handball
1F9B8..1F9B9 ; Emoji_Gender_Base # E11.0 [2] (🦸..🦹) superhero..supervillain
1F9CD..1F9CF ; Emoji_Gender_Base # E12.0 [3] (🧍..🧏) person standing..deaf person
1F9D4 ; Emoji_Gender_Base # E5.0 [1] (🧔) person: beard
1F9D6..1F9DF ; Emoji_Gender_Base # E5.0 [10] (🧖..🧟) person in steamy room..zombie

# Total elements: 51

# ================================================

# All omitted code points have Emoji_Explicit_Gender=No

1F385 ; Emoji_Explicit_Gender # E0.6 [1] (🎅) Santa Claus
1F466..1F469 ; Emoji_Explicit_Gender # E0.6 [4] (👦..👩) boy..woman
1F46B ; Emoji_Explicit_Gender # E0.6 [1] (👫) woman and man holding hands
1F46C..1F46D ; Emoji_Explicit_Gender # E1.0 [2] (👬..👭) men holding hands..women holding hands
1F470 ; Emoji_Explicit_Gender # E0.6 [1] (👰) person with veil
1F472 ; Emoji_Explicit_Gender # E0.6 [1] (👲) person with skullcap
1F474..1F475 ; Emoji_Explicit_Gender # E0.6 [2] (👴..👵) old man..old woman
1F478 ; Emoji_Explicit_Gender # E0.6 [1] (👸) princess
1F483 ; Emoji_Explicit_Gender # E0.6 [1] (💃) woman dancing
1F574 ; Emoji_Explicit_Gender # E0.7 [1] (🕴️) person in suit levitating
1F57A ; Emoji_Explicit_Gender # E3.0 [1] (🕺) man dancing
1F930 ; Emoji_Explicit_Gender # E3.0 [1] (🤰) pregnant woman
1F931 ; Emoji_Explicit_Gender # E5.0 [1] (🤱) breast-feeding
1F934..1F936 ; Emoji_Explicit_Gender # E3.0 [3] (🤴..🤶) prince..Mrs. Claus
1F9D5 ; Emoji_Explicit_Gender # E5.0 [1] (🧕) woman with headscarf

# Total elements: 22

# ================================================

# All omitted code points have Multi_Person_Groupings=No

1F46A..1F46B ; Multi_Person_Groupings # E0.6 [2] (👪..👫) family..woman and man holding hands
1F46C..1F46D ; Multi_Person_Groupings # E1.0 [2] (👬..👭) men holding hands..women holding hands
1F46F ; Multi_Person_Groupings # E0.6 [1] (👯) people with bunny ears
1F48F ; Multi_Person_Groupings # E0.6 [1] (💏) kiss
1F491 ; Multi_Person_Groupings # E0.6 [1] (💑) couple with heart
1F91D ; Multi_Person_Groupings # E3.0 [1] (🤝) handshake
1F93C ; Multi_Person_Groupings # E3.0 [1] (🤼) people wrestling

# Total elements: 9

#EOF
Loading