📄 character-sets.txt
字号:
Alias: Latin-9Name: ISO-8859-16MIBenum: 112Source: ISOAlias: iso-ir-226Alias: ISO_8859-16:2001Alias: ISO_8859-16Alias: latin10Alias: l10 Name: GBK MIBenum: 113Source: Chinese IT Standardization Technical Committee Please see: <http://www.iana.org/assignments/charset-reg/GBK>Alias: CP936Alias: MS936Alias: windows-936Name: GB18030MIBenum: 114Source: Chinese IT Standardization Technical Committee Please see: <http://www.iana.org/assignments/charset-reg/GB18030>Alias: NoneName: OSD_EBCDIC_DF04_15MIBenum: 115Source: Fujitsu-Siemens standard mainframe EBCDIC encoding Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF04-15>Alias: NoneName: OSD_EBCDIC_DF03_IRVMIBenum: 116Source: Fujitsu-Siemens standard mainframe EBCDIC encoding Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF03-IRV>Alias: NoneName: OSD_EBCDIC_DF04_1MIBenum: 117Source: Fujitsu-Siemens standard mainframe EBCDIC encoding Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF04-1>Alias: None Name: JIS_EncodingMIBenum: 16Source: JIS X 0202-1991. Uses ISO 2022 escape sequences to shift code sets as documented in JIS X 0202-1991.Alias: csJISEncodingName: Shift_JIS (preferred MIME name)MIBenum: 17Source: This charset is an extension of csHalfWidthKatakana by adding graphic characters in JIS X 0208. The CCS's are JIS X0201:1997 and JIS X0208:1997. The complete definition is shown in Appendix 1 of JIS X0208:1997. This charset can be used for the top-level media type "text".Alias: MS_Kanji Alias: csShiftJISName: Extended_UNIX_Code_Packed_Format_for_JapaneseMIBenum: 18Source: Standardized by OSF, UNIX International, and UNIX Systems Laboratories Pacific. Uses ISO 2022 rules to select code set 0: US-ASCII (a single 7-bit byte set) code set 1: JIS X0208-1990 (a double 8-bit byte set) restricted to A0-FF in both bytes code set 2: Half Width Katakana (a single 7-bit byte set) requiring SS2 as the character prefix code set 3: JIS X0212-1990 (a double 7-bit byte set) restricted to A0-FF in both bytes requiring SS3 as the character prefixAlias: csEUCPkdFmtJapaneseAlias: EUC-JP (preferred MIME name)Name: Extended_UNIX_Code_Fixed_Width_for_JapaneseMIBenum: 19Source: Used in Japan. Each character is 2 octets. code set 0: US-ASCII (a single 7-bit byte set) 1st byte = 00 2nd byte = 20-7E code set 1: JIS X0208-1990 (a double 7-bit byte set) restricted to A0-FF in both bytes code set 2: Half Width Katakana (a single 7-bit byte set) 1st byte = 00 2nd byte = A0-FF code set 3: JIS X0212-1990 (a double 7-bit byte set) restricted to A0-FF in the first byte and 21-7E in the second byteAlias: csEUCFixWidJapaneseName: ISO-10646-UCS-BasicMIBenum: 1002Source: ASCII subset of Unicode. Basic Latin = collection 1 See ISO 10646, Appendix AAlias: csUnicodeASCIIName: ISO-10646-Unicode-Latin1MIBenum: 1003Source: ISO Latin-1 subset of Unicode. Basic Latin and Latin-1 Supplement = collections 1 and 2. See ISO 10646, Appendix A. See RFC 1815.Alias: csUnicodeLatin1Alias: ISO-10646Name: ISO-10646-J-1Source: ISO 10646 Japanese, see RFC 1815.Name: ISO-Unicode-IBM-1261MIBenum: 1005Source: IBM Latin-2, -3, -5, Extended Presentation Set, GCSGID: 1261Alias: csUnicodeIBM1261Name: ISO-Unicode-IBM-1268MIBenum: 1006Source: IBM Latin-4 Extended Presentation Set, GCSGID: 1268Alias: csUnicodeIBM1268Name: ISO-Unicode-IBM-1276MIBenum: 1007Source: IBM Cyrillic Greek Extended Presentation Set, GCSGID: 1276Alias: csUnicodeIBM1276Name: ISO-Unicode-IBM-1264MIBenum: 1008Source: IBM Arabic Presentation Set, GCSGID: 1264Alias: csUnicodeIBM1264Name: ISO-Unicode-IBM-1265MIBenum: 1009Source: IBM Hebrew Presentation Set, GCSGID: 1265Alias: csUnicodeIBM1265Name: ISO-8859-1-Windows-3.0-Latin-1 [HP-PCL5] MIBenum: 2000Source: Extended ISO 8859-1 Latin-1 for Windows 3.0. PCL Symbol Set id: 9UAlias: csWindows30Latin1Name: ISO-8859-1-Windows-3.1-Latin-1 [HP-PCL5] MIBenum: 2001Source: Extended ISO 8859-1 Latin-1 for Windows 3.1. PCL Symbol Set id: 19UAlias: csWindows31Latin1Name: ISO-8859-2-Windows-Latin-2 [HP-PCL5] MIBenum: 2002Source: Extended ISO 8859-2. Latin-2 for Windows 3.1. PCL Symbol Set id: 9EAlias: csWindows31Latin2Name: ISO-8859-9-Windows-Latin-5 [HP-PCL5] MIBenum: 2003Source: Extended ISO 8859-9. Latin-5 for Windows 3.1 PCL Symbol Set id: 5TAlias: csWindows31Latin5Name: Adobe-Standard-Encoding [Adobe]MIBenum: 2005Source: PostScript Language Reference Manual PCL Symbol Set id: 10JAlias: csAdobeStandardEncodingName: Ventura-US [HP-PCL5]MIBenum: 2006Source: Ventura US. ASCII plus characters typically used in publishing, like pilcrow, copyright, registered, trade mark, section, dagger, and double dagger in the range A0 (hex) to FF (hex). PCL Symbol Set id: 14JAlias: csVenturaUS Name: Ventura-International [HP-PCL5]MIBenum: 2007Source: Ventura International. ASCII plus coded characters similar to Roman8. PCL Symbol Set id: 13JAlias: csVenturaInternationalName: PC8-Danish-Norwegian [HP-PCL5]MIBenum: 2012Source: PC Danish Norwegian 8-bit PC set for Danish Norwegian PCL Symbol Set id: 11UAlias: csPC8DanishNorwegianName: PC8-Turkish [HP-PCL5]MIBenum: 2014Source: PC Latin Turkish. PCL Symbol Set id: 9TAlias: csPC8TurkishName: IBM-Symbols [IBM-CIDT] MIBenum: 2015Source: Presentation Set, CPGID: 259Alias: csIBMSymbolsName: IBM-Thai [IBM-CIDT] MIBenum: 2016Source: Presentation Set, CPGID: 838Alias: csIBMThaiName: HP-Legal [HP-PCL5]MIBenum: 2017Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 1UAlias: csHPLegalName: HP-Pi-font [HP-PCL5]MIBenum: 2018Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 15UAlias: csHPPiFontName: HP-Math8 [HP-PCL5]MIBenum: 2019Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 8MAlias: csHPMath8Name: Adobe-Symbol-Encoding [Adobe]MIBenum: 2020Source: PostScript Language Reference Manual PCL Symbol Set id: 5MAlias: csHPPSMathName: HP-DeskTop [HP-PCL5]MIBenum: 2021Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 7JAlias: csHPDesktopName: Ventura-Math [HP-PCL5]MIBenum: 2022Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 6MAlias: csVenturaMathName: Microsoft-Publishing [HP-PCL5]MIBenum: 2023Source: PCL 5 Comparison Guide, Hewlett-Packard, HP part number 5961-0510, October 1992 PCL Symbol Set id: 6JAlias: csMicrosoftPublishingName: Windows-31JMIBenum: 2024Source: Windows Japanese. A further extension of Shift_JIS to include NEC special characters (Row 13), NEC selection of IBM extensions (Rows 89 to 92), and IBM extensions (Rows 115 to 119). The CCS's are JIS X0201:1997, JIS X0208:1997, and these extensions. This charset can be used for the top-level media type "text", but it is of limited or specialized use (see RFC2278). PCL Symbol Set id: 19KAlias: csWindows31JName: GB2312 (preferred MIME name)MIBenum: 2025Source: Chinese for People's Republic of China (PRC) mixed one byte, two byte set: 20-7E = one byte ASCII A1-FE = two byte PRC Kanji See GB 2312-80 PCL Symbol Set Id: 18CAlias: csGB2312Name: Big5 (preferred MIME name)MIBenum: 2026Source: Chinese for Taiwan Multi-byte set. PCL Symbol Set Id: 18TAlias: csBig5Name: windows-1250MIBenum: 2250Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1250) [Lazhintseva]Alias: NoneName: windows-1251MIBenum: 2251Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1251) [Lazhintseva]Alias: NoneName: windows-1252MIBenum: 2252Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1252) [Wendt]Alias: NoneName: windows-1253MIBenum: 2253Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1253) [Lazhintseva]Alias: NoneName: windows-1254MIBenum: 2254Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1254) [Lazhintseva]Alias: NoneName: windows-1255MIBenum: 2255Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1255) [Lazhintseva]Alias: NoneName: windows-1256MIBenum: 2256Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1256) [Lazhintseva]Alias: None Name: windows-1257MIBenum: 2257Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1257) [Lazhintseva]Alias: NoneName: windows-1258MIBenum: 2258Source: Microsoft (http://www.iana.org/assignments/charset-reg/windows-1258) [Lazhintseva]Alias: NoneName: TIS-620MIBenum: 2259Source: Thai Industrial Standards Institute (TISI) [Tantsetthi]Name: HZ-GB-2312MIBenum: 2085Source: RFC 1842, RFC 1843 [RFC1842, RFC1843]REFERENCES----------[RFC1345] Simonsen, K., "Character Mnemonics & Character Sets", RFC 1345, Rationel Almen Planlaegning, Rationel Almen Planlaegning, June 1992.[RFC1428] Vaudreuil, G., "Transition of Internet Mail from Just-Send-8 to 8bit-SMTP/MIME", RFC1428, CNRI, February 1993.[RFC1456] Vietnamese Standardization Working Group, "Conventions for Encoding the Vietnamese Language VISCII: VIetnamese Standard Code for Information Interchange VIQR: VIetnamese Quoted-Readable Specification Revision 1.1", RFC 1456, May 1993.[RFC1468] Murai, J., Crispin, M., and E. van der Poel, "Japanese Character Encoding for Internet Messages", RFC 1468, Keio University, Panda Programming, June 1993.[RFC1489] Chernov, A., "Registration of a Cyrillic Character Set", RFC1489, RELCOM Development Team, July 1993. [RFC1554] Ohta, M., and K. Handa, "ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP", RFC1554, Tokyo Institute of Technology, ETL, December 1993. [RFC1556] Nussbacher, H., "Handling of Bi-directional Texts in MIME", RFC1556, Israeli Inter-University, December 1993. [RFC1557] Choi, U., Chon, K., and H. Park, "Korean Character Encoding for Internet Messages", KAIST, Solvit Chosun Media, December 1993.[RFC1641] Goldsmith, D., and M. Davis, "Using Unicode with MIME", RFC1641, Taligent, Inc., July 1994. [RFC1642] Goldsmith, D., and M. Davis, "UTF-7", RFC1642, Taligent, Inc., July 1994.[RFC1815] Ohta, M., "Character Sets ISO-10646 and ISO-10646-J-1", RFC 1815, Tokyo Institute of Technology, July 1995.[Adobe] Adobe Systems Incorporated, PostScript Language Reference Manual, second edition, Addison-Wesley Publishing Company, Inc., 1990.[ECMA Registry] ISO-IR: International Register of Escape Sequences http://www.itscj.ipsj.or.jp/ISO-IE/ Note: The current registration authority is IPSJ/ITSCJ, Japan.[HP-PCL5] Hewlett-Packard Company, "HP PCL 5 Comparison Guide", (P/N 5021-0329) pp B-13, 1996.[IBM-CIDT] IBM Corporation, "ABOUT TYPE: IBM's Technical Reference for Core Interchange Digitized Type", Publication number S544-3708-01[RFC1842] Wei, Y., J. Li, and Y. Jiang, "ASCII Printable Characters-Based Chinese Character Encoding for Internet Messages", RFC 1842, Harvard University, Rice University, University of Maryland, August 1995.[RFC1843] Lee, F., "HZ - A Data Format for Exchanging Files of Arbitrarily Mixed Chinese and ASCII Characters", RFC 1843, Stanford University, August 1995.[RFC2152] Goldsmith, D., M. Davis, "UTF-7: A Mail-Safe Transformation Format of Unicode", RFC 2152, Apple Computer, Inc., Taligent Inc., May 1997.[RFC2279] Yergeau, F., "UTF-8, A Transformation Format of ISO 10646", RFC 2279, Alis Technologies, January, 1998.[RFC2781] Hoffman, P., Yergeau, F., "UTF-16, an encoding of ISO 10646", RFC 2781, February 2000.[RFC3629] Yergeau, F., "UTF-8, a transformation format of ISO 10646", RFC3629, November 2003.PEOPLE------[KXS2] Keld Simonsen <Keld.Simonsen@dkuug.dk>[Choi] Woohyong Choi <whchoi@cosmos.kaist.ac.kr>[Davis] Mark Davis, <mark@unicode.org>, April 2002.[Lazhintseva] Katya Lazhintseva, <katyal@MICROSOFT.com>, May 1996.[Mahdi] Tamer Mahdi, <tamer@ca.ibm.com>, August 2000.[Malyshev] Michael Malyshev, <michael_malyshev@mail.ru>, January 2004[Murai] Jun Murai <jun@wide.ad.jp>[Nussbacher] Hank Nussbacher, <hank@vm.tau.ac.il>[Ohta] Masataka Ohta, <mohta@cc.titech.ac.jp>, July 1995.[Phipps] Toby Phipps, <tphipps@peoplesoft.com>, March 2002.[Pond] Rick Pond, <rickpond@vnet.ibm.com>, March 1997.[Robrigado] Reuel Robrigado, <reuelr@ca.ibm.com>, September 2002.[Scherer] Markus Scherer, <markus.scherer@jtcsv.com>, August 2000, September 2002.[Simonsen] Keld Simonsen, <Keld.Simonsen@rap.dk>, August 2000.[Tantsetthi] Trin Tantsetthi, <trin@mozart.inet.co.th>, September 1998.[Tumasonis] Vladas Tumasonis, <vladas.tumasonis@maf.vu.lt>, August 2000.[Uskov] Alexander Uskov, <auskov@idc.kz>, September 2002.[Wendt] Chris Wendt, <christw@microsoft.com>, December 1999.[Yick] Nicky Yick, <cliac@itsd.gcn.gov.hk>, October 2000.[]
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -