⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 character-sets.txt

📁 linux下开源浏览器WebKit的源码,市面上的很多商用浏览器都是移植自WebKit
💻 TXT
📖 第 1 页 / 共 4 页
字号:
Alias: Latin-9Name: ISO-8859-16MIBenum: 112Source: ISOAlias: iso-ir-226Alias: ISO_8859-16:2001Alias: ISO_8859-16Alias: latin10Alias: l10 Name: GBK                                                 MIBenum: 113Source: Chinese IT Standardization Technical Committee          Please see: <http://www.iana.org/assignments/charset-reg/GBK>Alias: CP936Alias: MS936Alias: windows-936Name: GB18030MIBenum: 114Source: Chinese IT Standardization Technical Committee        Please see: <http://www.iana.org/assignments/charset-reg/GB18030>Alias: NoneName:  OSD_EBCDIC_DF04_15MIBenum:  115Source:  Fujitsu-Siemens standard mainframe EBCDIC encoding         Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF04-15>Alias:   NoneName:  OSD_EBCDIC_DF03_IRVMIBenum:  116Source:  Fujitsu-Siemens standard mainframe EBCDIC encoding         Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF03-IRV>Alias:  NoneName:  OSD_EBCDIC_DF04_1MIBenum:  117Source:  Fujitsu-Siemens standard mainframe EBCDIC encoding         Please see: <http://www.iana.org/assignments/charset-reg/OSD-EBCDIC-DF04-1>Alias:  None   Name: JIS_EncodingMIBenum: 16Source: JIS X 0202-1991.  Uses ISO 2022 escape sequences to        shift code sets as documented in JIS X 0202-1991.Alias: csJISEncodingName: Shift_JIS  (preferred MIME name)MIBenum: 17Source: This charset is an extension of csHalfWidthKatakana by        adding graphic characters in JIS X 0208.  The CCS's are        JIS X0201:1997 and JIS X0208:1997.  The        complete definition is shown in Appendix 1 of JIS        X0208:1997.        This charset can be used for the top-level media type "text".Alias: MS_Kanji Alias: csShiftJISName: Extended_UNIX_Code_Packed_Format_for_JapaneseMIBenum: 18Source: Standardized by OSF, UNIX International, and UNIX Systems        Laboratories Pacific.  Uses ISO 2022 rules to select               code set 0: US-ASCII (a single 7-bit byte set)               code set 1: JIS X0208-1990 (a double 8-bit byte set)                           restricted to A0-FF in both bytes               code set 2: Half Width Katakana (a single 7-bit byte set)                           requiring SS2 as the character prefix               code set 3: JIS X0212-1990 (a double 7-bit byte set)                           restricted to A0-FF in both bytes                           requiring SS3 as the character prefixAlias: csEUCPkdFmtJapaneseAlias: EUC-JP  (preferred MIME name)Name: Extended_UNIX_Code_Fixed_Width_for_JapaneseMIBenum: 19Source: Used in Japan.  Each character is 2 octets.                code set 0: US-ASCII (a single 7-bit byte set)                              1st byte = 00                              2nd byte = 20-7E                code set 1: JIS X0208-1990 (a double 7-bit byte set)                            restricted  to A0-FF in both bytes                 code set 2: Half Width Katakana (a single 7-bit byte set)                              1st byte = 00                              2nd byte = A0-FF                code set 3: JIS X0212-1990 (a double 7-bit byte set)                            restricted to A0-FF in                             the first byte                and 21-7E in the second byteAlias: csEUCFixWidJapaneseName: ISO-10646-UCS-BasicMIBenum: 1002Source: ASCII subset of Unicode.  Basic Latin = collection 1        See ISO 10646, Appendix AAlias: csUnicodeASCIIName: ISO-10646-Unicode-Latin1MIBenum: 1003Source: ISO Latin-1 subset of Unicode. Basic Latin and Latin-1          Supplement  = collections 1 and 2.  See ISO 10646,          Appendix A.  See RFC 1815.Alias: csUnicodeLatin1Alias: ISO-10646Name: ISO-10646-J-1Source: ISO 10646 Japanese, see RFC 1815.Name: ISO-Unicode-IBM-1261MIBenum: 1005Source: IBM Latin-2, -3, -5, Extended Presentation Set, GCSGID: 1261Alias: csUnicodeIBM1261Name: ISO-Unicode-IBM-1268MIBenum: 1006Source: IBM Latin-4 Extended Presentation Set, GCSGID: 1268Alias: csUnicodeIBM1268Name: ISO-Unicode-IBM-1276MIBenum: 1007Source: IBM Cyrillic Greek Extended Presentation Set, GCSGID: 1276Alias: csUnicodeIBM1276Name: ISO-Unicode-IBM-1264MIBenum: 1008Source: IBM Arabic Presentation Set, GCSGID: 1264Alias: csUnicodeIBM1264Name: ISO-Unicode-IBM-1265MIBenum: 1009Source: IBM Hebrew Presentation Set, GCSGID: 1265Alias: csUnicodeIBM1265Name: ISO-8859-1-Windows-3.0-Latin-1                           [HP-PCL5] MIBenum: 2000Source: Extended ISO 8859-1 Latin-1 for Windows 3.0.          PCL Symbol Set id: 9UAlias: csWindows30Latin1Name: ISO-8859-1-Windows-3.1-Latin-1                           [HP-PCL5] MIBenum: 2001Source: Extended ISO 8859-1 Latin-1 for Windows 3.1.          PCL Symbol Set id: 19UAlias: csWindows31Latin1Name: ISO-8859-2-Windows-Latin-2                               [HP-PCL5] MIBenum: 2002Source: Extended ISO 8859-2.  Latin-2 for Windows 3.1.        PCL Symbol Set id: 9EAlias: csWindows31Latin2Name: ISO-8859-9-Windows-Latin-5                               [HP-PCL5] MIBenum: 2003Source: Extended ISO 8859-9.  Latin-5 for Windows 3.1        PCL Symbol Set id: 5TAlias: csWindows31Latin5Name: Adobe-Standard-Encoding                                    [Adobe]MIBenum: 2005Source: PostScript Language Reference Manual        PCL Symbol Set id: 10JAlias: csAdobeStandardEncodingName: Ventura-US                                               [HP-PCL5]MIBenum: 2006Source: Ventura US.  ASCII plus characters typically used in         publishing, like pilcrow, copyright, registered, trade mark,         section, dagger, and double dagger in the range A0 (hex)         to FF (hex).          PCL Symbol Set id: 14JAlias: csVenturaUS  Name: Ventura-International                                    [HP-PCL5]MIBenum: 2007Source: Ventura International.  ASCII plus coded characters similar         to Roman8.        PCL Symbol Set id: 13JAlias: csVenturaInternationalName: PC8-Danish-Norwegian                                     [HP-PCL5]MIBenum: 2012Source: PC Danish Norwegian        8-bit PC set for Danish Norwegian        PCL Symbol Set id: 11UAlias: csPC8DanishNorwegianName: PC8-Turkish                                              [HP-PCL5]MIBenum: 2014Source: PC Latin Turkish.  PCL Symbol Set id: 9TAlias: csPC8TurkishName: IBM-Symbols                                             [IBM-CIDT] MIBenum: 2015Source: Presentation Set, CPGID: 259Alias: csIBMSymbolsName: IBM-Thai                                                [IBM-CIDT] MIBenum: 2016Source: Presentation Set, CPGID: 838Alias: csIBMThaiName: HP-Legal                                                 [HP-PCL5]MIBenum: 2017Source: PCL 5 Comparison Guide, Hewlett-Packard,        HP part number 5961-0510, October 1992        PCL Symbol Set id: 1UAlias: csHPLegalName: HP-Pi-font                                               [HP-PCL5]MIBenum: 2018Source: PCL 5 Comparison Guide, Hewlett-Packard,        HP part number 5961-0510, October 1992        PCL Symbol Set id: 15UAlias: csHPPiFontName: HP-Math8                                                 [HP-PCL5]MIBenum: 2019Source: PCL 5 Comparison Guide, Hewlett-Packard,        HP part number 5961-0510, October 1992        PCL Symbol Set id: 8MAlias: csHPMath8Name: Adobe-Symbol-Encoding                                      [Adobe]MIBenum: 2020Source: PostScript Language Reference Manual        PCL Symbol Set id: 5MAlias: csHPPSMathName: HP-DeskTop                                               [HP-PCL5]MIBenum: 2021Source: PCL 5 Comparison Guide, Hewlett-Packard,        HP part number 5961-0510, October 1992        PCL Symbol Set id: 7JAlias: csHPDesktopName: Ventura-Math                                             [HP-PCL5]MIBenum: 2022Source: PCL 5 Comparison Guide, Hewlett-Packard,        HP part number 5961-0510, October 1992        PCL Symbol Set id: 6MAlias: csVenturaMathName: Microsoft-Publishing                                     [HP-PCL5]MIBenum: 2023Source: PCL 5 Comparison Guide, Hewlett-Packard,        HP part number 5961-0510, October 1992        PCL Symbol Set id: 6JAlias: csMicrosoftPublishingName: Windows-31JMIBenum: 2024Source: Windows Japanese.  A further extension of Shift_JIS        to include NEC special characters (Row 13), NEC        selection of IBM extensions (Rows 89 to 92), and IBM        extensions (Rows 115 to 119).  The CCS's are        JIS X0201:1997, JIS X0208:1997, and these extensions.        This charset can be used for the top-level media type "text",        but it is of limited or specialized use (see RFC2278).        PCL Symbol Set id: 19KAlias: csWindows31JName: GB2312  (preferred MIME name)MIBenum: 2025Source: Chinese for People's Republic of China (PRC) mixed one byte,         two byte set:           20-7E = one byte ASCII           A1-FE = two byte PRC Kanji         See GB 2312-80         PCL Symbol Set Id: 18CAlias: csGB2312Name: Big5  (preferred MIME name)MIBenum: 2026Source: Chinese for Taiwan Multi-byte set.        PCL Symbol Set Id: 18TAlias: csBig5Name: windows-1250MIBenum: 2250Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1250) [Lazhintseva]Alias: NoneName: windows-1251MIBenum: 2251Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1251) [Lazhintseva]Alias: NoneName: windows-1252MIBenum: 2252Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1252)       [Wendt]Alias: NoneName: windows-1253MIBenum: 2253Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1253) [Lazhintseva]Alias: NoneName: windows-1254MIBenum: 2254Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1254) [Lazhintseva]Alias: NoneName: windows-1255MIBenum: 2255Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1255) [Lazhintseva]Alias: NoneName: windows-1256MIBenum: 2256Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1256) [Lazhintseva]Alias: None Name: windows-1257MIBenum: 2257Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1257) [Lazhintseva]Alias: NoneName: windows-1258MIBenum: 2258Source: Microsoft  (http://www.iana.org/assignments/charset-reg/windows-1258) [Lazhintseva]Alias: NoneName: TIS-620MIBenum: 2259Source: Thai Industrial Standards Institute (TISI)	     [Tantsetthi]Name: HZ-GB-2312MIBenum: 2085Source: RFC 1842, RFC 1843                              [RFC1842, RFC1843]REFERENCES----------[RFC1345]  Simonsen, K., "Character Mnemonics & Character Sets",           RFC 1345, Rationel Almen Planlaegning, Rationel Almen           Planlaegning, June 1992.[RFC1428]  Vaudreuil, G., "Transition of Internet Mail from           Just-Send-8 to 8bit-SMTP/MIME", RFC1428, CNRI, February           1993.[RFC1456]  Vietnamese Standardization Working Group, "Conventions for           Encoding the Vietnamese Language VISCII: VIetnamese            Standard Code for Information Interchange VIQR: VIetnamese            Quoted-Readable Specification Revision 1.1", RFC 1456, May           1993.[RFC1468]  Murai, J., Crispin, M., and E. van der Poel, "Japanese           Character Encoding for Internet Messages", RFC 1468,           Keio University, Panda Programming, June 1993.[RFC1489]  Chernov, A., "Registration of a Cyrillic Character Set",           RFC1489, RELCOM Development Team, July 1993. [RFC1554]  Ohta, M., and K. Handa, "ISO-2022-JP-2: Multilingual           Extension of ISO-2022-JP", RFC1554, Tokyo Institute of           Technology, ETL, December 1993. [RFC1556]  Nussbacher, H., "Handling of Bi-directional Texts in MIME",           RFC1556, Israeli Inter-University, December 1993. [RFC1557]  Choi, U., Chon, K., and H. Park, "Korean Character Encoding           for Internet Messages", KAIST, Solvit Chosun Media,           December 1993.[RFC1641]  Goldsmith, D., and M. Davis, "Using Unicode with MIME",           RFC1641, Taligent, Inc., July 1994. [RFC1642]  Goldsmith, D., and M. Davis, "UTF-7", RFC1642, Taligent,           Inc., July 1994.[RFC1815]  Ohta, M., "Character Sets ISO-10646 and ISO-10646-J-1",           RFC 1815, Tokyo Institute of Technology, July 1995.[Adobe]    Adobe Systems Incorporated, PostScript Language Reference           Manual, second edition, Addison-Wesley Publishing Company,           Inc., 1990.[ECMA Registry]  ISO-IR: International Register of Escape Sequences           http://www.itscj.ipsj.or.jp/ISO-IE/  Note: The current           registration authority is IPSJ/ITSCJ, Japan.[HP-PCL5]  Hewlett-Packard Company, "HP PCL 5 Comparison Guide",            (P/N 5021-0329) pp B-13, 1996.[IBM-CIDT] IBM Corporation, "ABOUT TYPE: IBM's Technical Reference           for Core Interchange Digitized Type", Publication number           S544-3708-01[RFC1842]  Wei, Y., J. Li, and Y. Jiang, "ASCII Printable           Characters-Based Chinese Character Encoding for Internet           Messages", RFC 1842, Harvard University, Rice University,           University of Maryland, August 1995.[RFC1843]  Lee, F., "HZ - A Data Format for Exchanging Files of           Arbitrarily Mixed Chinese and ASCII Characters", RFC 1843,           Stanford University, August 1995.[RFC2152]  Goldsmith, D., M. Davis, "UTF-7: A Mail-Safe Transformation	   Format of Unicode", RFC 2152, Apple Computer, Inc.,	   Taligent Inc., May 1997.[RFC2279]  Yergeau, F., "UTF-8, A Transformation Format of ISO 10646",           RFC 2279, Alis Technologies, January, 1998.[RFC2781]  Hoffman, P., Yergeau, F., "UTF-16, an encoding of ISO 10646",           RFC 2781, February 2000.[RFC3629]  Yergeau, F., "UTF-8, a transformation format of ISO 10646",           RFC3629, November 2003.PEOPLE------[KXS2] Keld Simonsen <Keld.Simonsen@dkuug.dk>[Choi] Woohyong Choi <whchoi@cosmos.kaist.ac.kr>[Davis] Mark Davis, <mark@unicode.org>, April 2002.[Lazhintseva] Katya Lazhintseva, <katyal@MICROSOFT.com>, May 1996.[Mahdi] Tamer Mahdi, <tamer@ca.ibm.com>, August 2000.[Malyshev] Michael Malyshev, <michael_malyshev@mail.ru>, January 2004[Murai] Jun Murai <jun@wide.ad.jp>[Nussbacher] Hank Nussbacher, <hank@vm.tau.ac.il>[Ohta] Masataka Ohta, <mohta@cc.titech.ac.jp>, July 1995.[Phipps] Toby Phipps, <tphipps@peoplesoft.com>, March 2002.[Pond] Rick Pond, <rickpond@vnet.ibm.com>, March 1997.[Robrigado] Reuel Robrigado, <reuelr@ca.ibm.com>, September 2002.[Scherer] Markus Scherer, <markus.scherer@jtcsv.com>, August 2000,           September 2002.[Simonsen] Keld Simonsen, <Keld.Simonsen@rap.dk>, August 2000.[Tantsetthi] Trin Tantsetthi, <trin@mozart.inet.co.th>, September 1998.[Tumasonis] Vladas Tumasonis, <vladas.tumasonis@maf.vu.lt>, August 2000.[Uskov] Alexander Uskov, <auskov@idc.kz>, September 2002.[Wendt] Chris Wendt, <christw@microsoft.com>, December 1999.[Yick] Nicky Yick, <cliac@itsd.gcn.gov.hk>, October 2000.[]

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -