📄 rfc1922.txt
字号:
0xA140-0xA1F5 <-> 0x2121-0x2256 0xA1F6 <-> 0x2258 0xA1F7 <-> 0x2257 0xA1F8-0xA2AE <-> 0x2259-0x234E 0xA2AF-0xA3BF <-> 0x2421-0x2570 0xA3C0-0xA3E0 <-> 0x4221-0x4241 (ETen and Microsoft defined as reserved area)A.2. Big5 (ETen, IBM, and Microsoft version) Level 1 correspondence to CNS 11643-1992 Plane 1: 0xA440-0xACFD <-> 0x4421-0x5322 0xACFE <-> 0x5753 0xAD40-0xAFCF <-> 0x5323-0x5752 0xAFD0-0xBBC7 <-> 0x5754-0x6B4F 0xBBC8-0xBE51 <-> 0x6B51-0x6F5B 0xBE52 <-> 0x6B50 0xBE53-0xC1AA <-> 0x6F5C-0x7534 0xC1AB-0xC2CA <-> 0x7536-0x7736 0xC2CB <-> 0x7535 0xC2CC-0xC360 <-> 0x7737-0x782C 0xC361-0xC3B8 <-> 0x782E-0x7863 0xC3B9 <-> 0x7865 0xC3BA <-> 0x7864 0xC3BB-0xC455 <-> 0x7866-0x7961 0xC456 <-> 0x782D 0xC457-0xC67E <-> 0x7962-0x7D4BA.3. Big5 (ETen, IBM, and Microsoft version) Level 2 correspondence to CNS 11643-1992 Plane 2: 0xC940-0xC949 <-> 0x2121-0x212A 0xC94A <-> 0x4442 # duplicate of Level 1's 0xA461 0xC94B-0xC96B <-> 0x212B-0x214B 0xC96C-0xC9BD <-> 0x214D-0x217C 0xC9BE <-> 0x214C 0xC9BF-0xC9EC <-> 0x217D-0x224CZhu, et al Informational [Page 21]RFC 1922 Chinese Character Encoding March 1996 0xC9ED-0xCAF6 <-> 0x224E-0x2438 0xCAF7 <-> 0x224D 0xCAF8-0xD779 <-> 0x2439-0x387D 0xD77A <-> 0x3F6A 0xD77B-0xDBA6 <-> 0x387E-0x3F69 0xDBA7-0xDDFB <-> 0x3F6B-0x4423 0xDDFC <-> 0x4176 # duplicate of 0xDCD1 0xDDFD-0xE8A2 <-> 0x4424-0x554A 0xE8A3-0xE975 <-> 0x554C-0x5721 0xE976-0xEB5A <-> 0x5723-0x5A27 0xEB5B-0xEBF0 <-> 0x5A29-0x5B3E 0xEBF1 <-> 0x554B 0xEBF2-0xECDD <-> 0x5B3F-0x5C69 0xECDE <-> 0x5722 0xECDF-0xEDA9 <-> 0x5C6A-0x5D73 0xEDAA-0xEEEA <-> 0x5D75-0x6038 0xEEEB <-> 0x642F 0xEEEC-0xF055 <-> 0x6039-0x6242 0xF056 <-> 0x5D74 0xF057-0xF0CA <-> 0x6243-0x6336 0xF0CB <-> 0x5A28 0xF0CC-0xF162 <-> 0x6337-0x642E 0xF163-0xF16A <-> 0x6430-0x6437 0xF16B <-> 0x6761 0xF16C-0xF267 <-> 0x6438-0x6572 0xF268 <-> 0x6934 0xF269-0xF2C2 <-> 0x6573-0x664C 0xF2C3-0xF374 <-> 0x664E-0x6760 0xF375-0xF465 <-> 0x6762-0x6933 0xF466-0xF4B4 <-> 0x6935-0x6961 0xF4B5 <-> 0x664D 0xF4B6-0xF4FC <-> 0x6962-0x6A4A 0xF4FD-0xF662 <-> 0x6A4C-0x6C51 0xF663 <-> 0x6A4B 0xF664-0xF976 <-> 0x6C52-0x7165 0xF977-0xF9C3 <-> 0x7167-0x7233 0xF9C4 <-> 0x7166 0xF9C5 <-> 0x7234 0xF9C6 <-> 0x7240 0xF9C7-0xF9D1 <-> 0x7235-0x723F 0xF9D2-0xF9D5 <-> 0x7241-0x7244A.4. Big5 (ETen and IBM Version) specific numeric symbols correspondence to CNS 11643 Plane 1: (Microsoft version defined this area as UDC - User Defined Character)Zhu, et al Informational [Page 22]RFC 1922 Chinese Character Encoding March 1996 0xC6A1-0xC6BE <-> 0x2621 - 0x263EA.5. Big5 (ETen and IBM Version) specific KangXi radicals correspondence to CNS 11643 Plane 1: (Microsoft version defined as UDC - User Definable Character) 0xC6BF <-> 0x2723 0xC6C0 <-> 0x2724 0xC6C1 <-> 0x2726 0xC6C2 <-> 0x2728 0xC6C3 <-> 0x272D 0xC6C4 <-> 0x272E 0xC6C5 <-> 0x272F 0xC6C6 <-> 0x2734 0xC6C7 <-> 0x2737 0xC6C8 <-> 0x273A 0xC6C9 <-> 0x273C 0xC6CA <-> 0x2742 0xC6CB <-> 0x2747 0xC6CC <-> 0x274E 0xC6CD <-> 0x2753 0xC6CE <-> 0x2754 0xC6CF <-> 0x2755 0xC6D0 <-> 0x2759 0xC6D1 <-> 0x275A 0xC6D2 <-> 0x2761 0xC6D3 <-> 0x2766 0xC6D4 <-> 0x2829 0xC6D5 <-> 0x282A 0xC6D6 <-> 0x2863 0xC6D7 <-> 0x286CA.6. Big5 (ETen and Microsoft version) specific Ideographs correspondence to CNS 11643 Plane 3: (IBM version defined as UDC) 0xF9D6 <-> 0x4337 0xF9D7 <-> 0x4F50 0xF9D8 <-> 0x444E 0xF9D9 <-> 0x504A 0xF9DA <-> 0x2C5D 0xF9DB <-> 0x3D7E 0xF9DC <-> 0x4B5CA.7. Big5 (ETen version only) specific symbols correspondence to CNS 11643 Plane 4: 0xC879 <-> 0x2123Zhu, et al Informational [Page 23]RFC 1922 Chinese Character Encoding March 1996 0xC87B <-> 0x2124 0xC87D <-> 0x212A 0xC8A2 <-> 0x2152A.8. Other Big5 specific symbols which cannot mapping to CNS 11643: 0xC6D8-0xC878 <-> none (ETen and IBM Version) 0xC87A <-> none (ETen version only) 0xC87C <-> none (ETen version only) 0xC87E-0xC8A1 <-> none (ETen version only) 0xC8A3-0xC8CC <-> none (ETen version only) 0xC8CD-0xC8D3 <-> none (ETen and IBM version) 0xF9DD-0xF9FE <-> none (ETen and Microsoft version) Note: However, most of them can be mapped to GB-2312 too. For example, Big5(ETen and IBM version) Hiragana, Katakana, and Cyrillic symbols correspondence to GB-2312: 0xC6E7-0xC77A <-> 0x2421-0x2473 # Japanese Hiragana 0xC77B-0xC7F2 <-> 0x2521-0x2576 # Japanese Katakana 0xC7F3-0xC854 <-> 0xA7A1-0xA7C1 # Cyrillic uppercase 0xC855-0xC875 <-> 0xA7D1-0xA7F1 # Cyrillic lowercase Please notice that there are also many symbols that could be supported by GB-2312, for detail, please refer to the ftp sites in section 1.4 of the "Specification" part of this document.Zhu, et al Informational [Page 24]RFC 1922 Chinese Character Encoding March 1996References [ASCII] American National Standards Institute, "Coded character set: 7-bit American National Standard Code for Information Interchange", ANSI X3.4-1986. [BIG5] Institute for Information Industry, "Chinese Coded Character Set in Computer ", March, 1984 [CJKINF] Ken Lunde, On-line documentation of Chinese/Japanese/Korean Information Processing, 1995, available at: ftp://ftp.ora.com/pub/examples/nutshell/ujip/doc/cjk.inf [CNS-5205] "Information processing: 7-Bit Coded Character Set For Information Interchange", CNS-5205. [CNS-11643] "Chinese Standard Interchange Code", CNS-11643 version 1992; "Standard Interchange Code for Generally-Used Chinese Characters", CNS 11643 version 1986. [GB-1988] "7-bit Coding Character Set for Information Interchange", GB 1988-89. [GB-2312] "Coding of Chinese Ideogram Set for Information Interchange Basic Set", GB 2312-80. [GB-7589] "Code of Chinese Ideograms Set for Information Interchange, the 2nd Supplementary Set", UDC 681.3.048, GB 7589-87. [GB-7590] "Code of Chinese Ideogram Set for Information Interchange, the 4th Supplementary Set", UDC 681.3.048, GB 7590-87. [GB-8565] "Information Processing Coded Character Sets for Text Communication", UDC 681.3, GB 8565-88. [GB-12345] "Code of Chinese Ideogram Set for Information Interchange Supplementary Set", GB/T 12345-90. [GB-13000] "Information Technology: Universal Multiple-Octet Coded Character Set(UCS) Part 1: Architecture and Basic Multilingual Plane", GB13000.1 [GB-13131] "Code of Chinese Ideogram Set for Information Interchange, the 3rd Supplementary Set", GB 13131-91. [GB-13132] "Code of Chinese Ideogram Set for Information Interchange, the 5th Supplementary Set", GB 13132-91.Zhu, et al Informational [Page 25]RFC 1922 Chinese Character Encoding March 1996 [ISO-646] International Organization for Standardization (ISO), "Information Technology: ISO 7-bit Coded Character Set for Information Interchange", International Standard, Ref. No. ISO/IEC 646:1991. [ISO-2022] International Organization for Standardization (ISO), "Information Processing: ISO 7-bit and 8-bit coded character sets: Code extension techniques", International Standard, Ref. No. ISO 2022-1986 (E). [ISO-10021] Information Technology: Text communication: Message-Oriented Text Interchange Systems (MOTIS), ISO 10021, October 1988. [ISO-10646] ISO/IEC 10646-1:1993(E) Information Technology: Universal Multiple-octet Coded Character Set (UCS) Part 1: Architecture and Basic Multilingual Plane" [ISOREG] International Organization for Standardization (ISO), "International Register of Coded Character Sets To Be Used With Escape Sequences". [MIME-1] Borenstein, N., and Freed, N., "MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies", RFC 1521, Bellcore, Innosoft, September 1993. [MIME-2] Moore, K., "MIME (Multipurpose Internet Mail Extensions) Part Two: Message Header Extensions for Non-ASCII Text", RFC 1522, University of Tennessee, September 1993. [RFC-822] Crocker, D., "Standard for the Format of ARPA Internet Text Messages", STD 11, RFC 822, University of Delaware, August 1982. [RFC-854] Postel, J., Reynolds J., Telnet Protocol Specification, RFC 854, ISI, May 1983. [RFC-1036] Horton, M., and Adams, R., "Standard for Interchange of USENET Messages", RFC 1036, AT&T Bell Laboratories, Center for Seismic Studies, December 1987. [RFC-1468] Murai J., Crispin, M., and van der Poel, E., Japanese Character Encoding for Internet Messages, June 1993. [RFC-1557] Choi U., Chon K., and Park H., Korean Character Encoding for Internet Messages, December 1993.Zhu, et al Informational [Page 26]RFC 1922 Chinese Character Encoding March 1996 [RFC-1641] Goldsmith D., and Davis M., "Using Unicode with MIME", RFC 1641, Taligent Inc., July 1994 [RFC-1642] Goldsmith D., and Davis M.," UTF-7, A Mail-Safe Transformation Format of Unicode", July 1994 [RFC-1700] Reynolds J., and Postel J., "Assigned Numbers",RFC 1700, STD 2, ISI, October 1994 [SMTP] Postel, J. B. "Simple Mail Transfer Protocol", STD 10, RFC 821, USC/Information Sciences Institute, August 1982. [SMTPEXT] Klensin J., Freed N., Rose M., Stefferud E., and Crocker D., "SMTP Service Extensions", RFC 1651, July 1994. [Unicode 1.1] "The Unicode Standard, Version 1.1", Addison-Wesley, Reading, MA (to be published; the contents of this standard is currently available by combining [Unicode92], [Unicode93], and [Unicode4]). [Unicode92] The Unicode Consortium, "The Unicode Standard: Worldwide Character Encoding: Version 1.0", Volume 1, Addison-Wesley, Reading, MA, 1992 (ISBN 0-201-56788-1). [Unicode93] The Unicode Consortium, "The Unicode Standard: Worldwide Character Encoding: Version 1.0", Volume 2, Addison-Wesley, Reading, MA, 1992 (ISBN 0-201-60845-6). [Unicode4] The Unicode Consortium, "The Unicode Standard: Version 1.1 (Prepublication Edition)", Unicode Technical Report #4 (avaliable from the Unicode Consortium).Zhu, et al Informational [Page 27]
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -