[po] auto sync

github-actions[bot] · github-actions[bot] · commit 43a5cccaaff7 · 2021-06-12T14:09:08.000Z
diff --git a/howto/unicode.po b/howto/unicode.po
@@ -195,7 +195,7 @@ msgstr "与现有的 C 函数（如 ``strlen()`` ）不兼容，因此需要采
 msgid ""
 "Therefore this encoding isn't used very much, and people instead choose "
 "other encodings that are more efficient and convenient, such as UTF-8."
-msgstr ""
+msgstr "因此这种编码用得不多，人们转而选择其他更高效、更方便的编码，比如 UTF-8。 "
 
 #: ../../howto/unicode.rst:126
 msgid ""
@@ -205,26 +205,28 @@ msgid ""
 "and UTF-32 encodings, but they are less frequently used than UTF-8.)  UTF-8 "
 "uses the following rules:"
 msgstr ""
+"UTF-8 是最常用的编码之一，Python 往往默认会采用它。UTF 代表“Unicode Transformation Format”，'8' "
+"表示编码采用 8 位数。（UTF-16 和 UTF-32 编码也是存在的，但其使用频率不如 UTF-8。）UTF-8 的规则如下： "
 
 #: ../../howto/unicode.rst:132
 msgid ""
 "If the code point is < 128, it's represented by the corresponding byte "
 "value."
-msgstr ""
+msgstr "如果码位 < 128，则直接用对应的字节值表示。"
 
 #: ../../howto/unicode.rst:133
 msgid ""
 "If the code point is >= 128, it's turned into a sequence of two, three, or "
 "four bytes, where each byte of the sequence is between 128 and 255."
-msgstr ""
+msgstr "如果码位  >= 128，则转换为 2、3、4 个字节的序列，每个字节值都位于 128 和 255 之间。"
 
 #: ../../howto/unicode.rst:136
 msgid "UTF-8 has several convenient properties:"
-msgstr ""
+msgstr "UTF-8 有几个很方便的特性："
 
 #: ../../howto/unicode.rst:138
 msgid "It can handle any Unicode code point."
-msgstr ""
+msgstr "可以处理任何 Unicode 码位。"
 
 #: ../../howto/unicode.rst:139
 msgid ""
@@ -234,23 +236,26 @@ msgid ""
 " sent through protocols that can't handle zero bytes for anything other than"
 " end-of-string markers."
 msgstr ""
+"Unicode 字符串被转换为一个字节序列，仅在表示空（null ）字符（U+0000）时才会包含零值的字节。这意味着 ``strcpy()`` "
+"之类的C 函数可以处理 UTF-8 字符串，而且用那些不能处理字符串结束符之外的零值字节的协议也能发送。 "
 
 #: ../../howto/unicode.rst:144
 msgid "A string of ASCII text is also valid UTF-8 text."
-msgstr ""
+msgstr "ASCII 字符串也是也是也是合法的 UTF-8 文本。"
 
 #: ../../howto/unicode.rst:145
 msgid ""
 "UTF-8 is fairly compact; the majority of commonly used characters can be "
 "represented with one or two bytes."
-msgstr ""
+msgstr "UTF-8 相当紧凑；大多数常用字符均可用一两个字节表示。"
 
 #: ../../howto/unicode.rst:147
 msgid ""
 "If bytes are corrupted or lost, it's possible to determine the start of the "
 "next UTF-8-encoded code point and resynchronize.  It's also unlikely that "
 "random 8-bit data will look like valid UTF-8."
 msgstr ""
+"如果字节数据被损坏或丢失，则可以找出下一个 UTF-8 码点的开始位置并重新开始同步。随机的 8 位数据也不太可能像是有效的 UTF-8 编码。"
 
 #: ../../howto/unicode.rst:150
 msgid ""
@@ -260,6 +265,8 @@ msgid ""
 "oriented encodings, like UTF-16 and UTF-32, where the sequence of bytes "
 "varies depending on the hardware on which the string was encoded."
 msgstr ""
+"UTF-8 是一种面向字节的编码。编码规定了每个字符由一个或多个字节的序列表示。这避免了整数和双字节编码（如 UTF-16 和 "
+"UTF-32）可能出现的字节顺序问题，那时的字节序列会因执行编码的硬件而异。"
 
 #: ../../howto/unicode.rst:158 ../../howto/unicode.rst:514
 #: ../../howto/unicode.rst:735