📄 idcharsets.pas
字号:
{ $HDR$}
{**********************************************************************}
{ Unit archived using Team Coherence }
{ Team Coherence is Copyright 2002 by Quality Software Components }
{ }
{ For further information / comments, visit our WEB site at }
{ http://www.TeamCoherence.com }
{**********************************************************************}
{}
{ $Log: 13750: IdCharsets.pas
{
Rev 1.3 10/16/2003 10:49:18 PM DSiders
Added localization comments.
}
{
{ Rev 1.2 10/8/2003 9:49:02 PM GGrieve
{ merge all TIdCharset to here
}
{
{ Rev 1.1 10/3/2003 5:39:26 PM GGrieve
{ dotnet work
}
{
{ Rev 1.0 11/14/2002 02:14:14 PM JPMugaas
}
{
This file is automatically created from
http://www.iana.org/assignments/character-sets
All character set constants are prefixed with "idcs", this could lead
to having a constant named idcscs... because some IANA names are actually
cs...
All constants have been renamed to fit Delphi's naming scheme,
'-', '.', ':' and '+' are converted to '_'
If a collision occurs, a '_' is appended to the name.
Care is taken to
a) put the preferred charset first in a list of identical ones
b) not append a '_' to the preferred charset
Two functions can be found here:
1)
* function FindPreferredCharset(const Charset: TIdCharSet): TIdCharSet;
is provided to find the preferred identical charset from an arbitrary
charset given.
2)
* function FindCharset(const s: string): TIdCharset;
can be used to find a charset from a given string
(if not found idcsINVALID is returned)
For references and people see the end of the file (copied from above location)
Johannes Berg - 2002-08-22
-- header of the original file follows --
===================================================================
CHARACTER SETS
(last updated 2002-06-14)
These are the official names for character sets that may be used in
the Internet and may be referred to in Internet documentation. These
names are expressed in ANSI_X3.4-1968 which is commonly called
US-ASCII or simply ASCII. The character set most commonly use in the
Internet and used especially in protocol standards is US-ASCII, this
is strongly encouraged. The use of the name US-ASCII is also
encouraged.
The character set names may be up to 40 characters taken from the
printable characters of US-ASCII. However, no distinction is made
between use of upper and lower case letters.
The MIBenum value is a unique value for use in MIBs to identify coded
character sets.
The value space for MIBenum values has been divided into three
regions. The first region (3-999) consists of coded character sets
that have been standardized by some standard setting organization.
This region is intended for standards that do not have subset
implementations. The second region (1000-1999) is for the Unicode and
ISO/IEC 10646 coded character sets together with a specification of a
(set of) sub-repertoires that may occur. The third region (>1999) is
intended for vendor specific coded character sets.
Assigned MIB enum Numbers
-------------------------
0-2 Reserved
3-999 Set By Standards Organizations
1000-1999 Unicode / 10646
2000-2999 Vendor
The aliases that start with "cs" have been added for use with the
Printer MIB (see RFC 1759) and contain the standard numbers along with
suggestive names in order to facilitate applications that want to
display the names in user interfaces. The "cs" stands for character
set and is provided for applications that need a lower case first
letter but want to use mixed case thereafter that cannot contain any
special characters, such as underbar ("_") and dash ("-"). If the
character set is from an ISO standard, its cs alias is the ISO
standard number or name. If the character set is not from an ISO
standard, but is registered with ISO (ECMA is the current ISO
Registration Authority), the ISO Registry number is specified as
ISOnnn followed by letters suggestive of the name or standards number
of the code set. When a national or international standard is
revised, the year of revision is added to the cs alias of the new
character set entry in the IANA Registry in order to distinguish the
revised character set from the original character set.
Character Set Reference
}
unit IdCharsets;
interface
// once upon a time Indy had 3 different declarations of TIdCharSet
// now all use this one. For reference, one of the more widely used
// enums and the equivalents in the full enum is listed here:
//
// csGB2312 idcsGB2312 *
// csBig5 idcsBig5 *
// csIso2022jp idcsISO_2022_JP *
// csEucKR idcsEUC_KR *
// csIso88591 idcsISO_8859_1 *
// csWindows1251 idcswindows_1251 *
// csKOI8r idcsKOI8_R *
// csKOI8u idcsKOI8_U *
// csUnicode idcsUNICODE_1_1
//
//
// Classic UTF-8 is idcsUTF_8
//
type
TIdCharSet = (
idcsINVALID, { signifies an invalid character was found when searching
for a charset by name }
idcsUS_ASCII,
idcsANSI_X3_4_1968, {Alias of idcsUS_ASCII}
idcsiso_ir_6, {Alias of idcsUS_ASCII}
idcsANSI_X3_4_1986, {Alias of idcsUS_ASCII}
idcsISO_646_irv_1991, {Alias of idcsUS_ASCII}
idcsASCII, {Alias of idcsUS_ASCII}
idcsISO646_US, {Alias of idcsUS_ASCII}
idcsus, {Alias of idcsUS_ASCII}
idcsIBM367, {Alias of idcsUS_ASCII}
idcscp367, {Alias of idcsUS_ASCII}
idcscsASCII, {Alias of idcsUS_ASCII}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsISO_10646_UTF_1,
idcscsISO10646UTF1, {Alias of idcsISO_10646_UTF_1}
{ Source:
Universal Transfer Format (1), this is the multibyte encoding, that
subsets ASCII-7. It does not have byte ordering issues. }
idcsISO_646_basic_1983,
idcsref, {Alias of idcsISO_646_basic_1983}
idcscsISO646basic1983, {Alias of idcsISO_646_basic_1983}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsINVARIANT,
idcscsINVARIANT, {Alias of idcsINVARIANT}
{ References: RFC1345,KXS2 }
idcsISO_646_irv_1983,
idcsiso_ir_2, {Alias of idcsISO_646_irv_1983}
idcsirv, {Alias of idcsISO_646_irv_1983}
idcscsISO2IntlRefVersion, {Alias of idcsISO_646_irv_1983}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsBS_4730,
idcsiso_ir_4, {Alias of idcsBS_4730}
idcsISO646_GB, {Alias of idcsBS_4730}
idcsgb, {Alias of idcsBS_4730}
idcsuk, {Alias of idcsBS_4730}
idcscsISO4UnitedKingdom, {Alias of idcsBS_4730}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsNATS_SEFI,
idcsiso_ir_8_1, {Alias of idcsNATS_SEFI}
idcscsNATSSEFI, {Alias of idcsNATS_SEFI}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsNATS_SEFI_ADD,
idcsiso_ir_8_2, {Alias of idcsNATS_SEFI_ADD}
idcscsNATSSEFIADD, {Alias of idcsNATS_SEFI_ADD}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsNATS_DANO,
idcsiso_ir_9_1, {Alias of idcsNATS_DANO}
idcscsNATSDANO, {Alias of idcsNATS_DANO}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsNATS_DANO_ADD,
idcsiso_ir_9_2, {Alias of idcsNATS_DANO_ADD}
idcscsNATSDANOADD, {Alias of idcsNATS_DANO_ADD}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsSEN_850200_B,
idcsiso_ir_10, {Alias of idcsSEN_850200_B}
idcsFI, {Alias of idcsSEN_850200_B}
idcsISO646_FI, {Alias of idcsSEN_850200_B}
idcsISO646_SE, {Alias of idcsSEN_850200_B}
idcsse, {Alias of idcsSEN_850200_B}
idcscsISO10Swedish, {Alias of idcsSEN_850200_B}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsSEN_850200_C,
idcsiso_ir_11, {Alias of idcsSEN_850200_C}
idcsISO646_SE2, {Alias of idcsSEN_850200_C}
idcsse2, {Alias of idcsSEN_850200_C}
idcscsISO11SwedishForNames, {Alias of idcsSEN_850200_C}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsKS_C_5601_1987,
idcsiso_ir_149, {Alias of idcsKS_C_5601_1987}
idcsKS_C_5601_1989, {Alias of idcsKS_C_5601_1987}
idcsKSC_5601, {Alias of idcsKS_C_5601_1987}
idcskorean, {Alias of idcsKS_C_5601_1987}
idcscsKSC56011987, {Alias of idcsKS_C_5601_1987}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcscsISO2022KR,
idcsISO_2022_KR, {Alias of idcscsISO2022KR}
{ References: RFC1557,Choi }
{ Source:
RFC-1557 (see also KS_C_5601-1987) }
idcscsEUCKR,
idcsEUC_KR, {Alias of idcscsEUCKR}
{ References: RFC1557,Choi }
{ Source:
RFC-1557 (see also KS_C_5861-1992) }
idcscsISO2022JP,
idcsISO_2022_JP, {Alias of idcscsISO2022JP}
{ References: RFC1468,Murai }
{ Source:
RFC-1468 (see also RFC-2237) }
idcscsISO2022JP2,
idcsISO_2022_JP_2, {Alias of idcscsISO2022JP2}
{ References: RFC1554,Ohta }
{ Source:
RFC-1554 }
idcsISO_2022_CN,
{ References: RFC1922 }
{ Source:
RFC-1922 }
idcsISO_2022_CN_EXT,
{ References: RFC1922 }
{ Source:
RFC-1922 }
idcsJIS_C6220_1969_jp,
idcsJIS_C6220_1969, {Alias of idcsJIS_C6220_1969_jp}
idcsiso_ir_13, {Alias of idcsJIS_C6220_1969_jp}
idcskatakana, {Alias of idcsJIS_C6220_1969_jp}
idcsx0201_7, {Alias of idcsJIS_C6220_1969_jp}
idcscsISO13JISC6220jp, {Alias of idcsJIS_C6220_1969_jp}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsJIS_C6220_1969_ro,
idcsiso_ir_14, {Alias of idcsJIS_C6220_1969_ro}
idcsjp, {Alias of idcsJIS_C6220_1969_ro}
idcsISO646_JP, {Alias of idcsJIS_C6220_1969_ro}
idcscsISO14JISC6220ro, {Alias of idcsJIS_C6220_1969_ro}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsIT,
idcsiso_ir_15, {Alias of idcsIT}
idcsISO646_IT, {Alias of idcsIT}
idcscsISO15Italian, {Alias of idcsIT}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsPT,
idcsiso_ir_16, {Alias of idcsPT}
idcsISO646_PT, {Alias of idcsPT}
idcscsISO16Portuguese, {Alias of idcsPT}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsES,
idcsiso_ir_17, {Alias of idcsES}
idcsISO646_ES, {Alias of idcsES}
idcscsISO17Spanish, {Alias of idcsES}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsgreek7_old,
idcsiso_ir_18, {Alias of idcsgreek7_old}
idcscsISO18Greek7Old, {Alias of idcsgreek7_old}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcslatin_greek,
idcsiso_ir_19, {Alias of idcslatin_greek}
idcscsISO19LatinGreek, {Alias of idcslatin_greek}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsDIN_66003,
idcsiso_ir_21, {Alias of idcsDIN_66003}
idcsde, {Alias of idcsDIN_66003}
idcsISO646_DE, {Alias of idcsDIN_66003}
idcscsISO21German, {Alias of idcsDIN_66003}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcscsISO25French,
idcsNF_Z_62_010_, {Alias of idcscsISO25French}
idcsiso_ir_25, {Alias of idcscsISO25French}
idcsISO646_FR1, {Alias of idcscsISO25French}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsLatin_greek_1,
idcsiso_ir_27, {Alias of idcsLatin_greek_1}
idcscsISO27LatinGreek1, {Alias of idcsLatin_greek_1}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsISO_5427,
idcsiso_ir_37, {Alias of idcsISO_5427}
idcscsISO5427Cyrillic, {Alias of idcsISO_5427}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsJIS_C6226_1978,
idcsiso_ir_42, {Alias of idcsJIS_C6226_1978}
idcscsISO42JISC62261978, {Alias of idcsJIS_C6226_1978}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsBS_viewdata,
idcsiso_ir_47, {Alias of idcsBS_viewdata}
idcscsISO47BSViewdata, {Alias of idcsBS_viewdata}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsINIS,
idcsiso_ir_49, {Alias of idcsINIS}
idcscsISO49INIS, {Alias of idcsINIS}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsINIS_8,
idcsiso_ir_50, {Alias of idcsINIS_8}
idcscsISO50INIS8, {Alias of idcsINIS_8}
{ References: RFC1345,KXS2 }
{ Source:
ECMA registry }
idcsINIS_cyrillic,
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -