📄 casefolding.txt
字号:
# CaseFolding-5.0.0.txt# Date: 2006-03-03, 08:22:43 GMT [MD]## Unicode Character Database# Copyright (c) 1991-2006 Unicode, Inc.# For terms of use, see http://www.unicode.org/terms_of_use.html# For documentation, see UCD.html## Case Folding Properties## This file is a supplement to the UnicodeData file.# It provides a case folding mapping generated from the Unicode Character Database.# If all characters are mapped according to the full mapping below, then# case differences (according to UnicodeData.txt and SpecialCasing.txt)# are eliminated.## The data supports both implementations that require simple case foldings# (where string lengths don't change), and implementations that allow full case folding# (where string lengths may grow). Note that where they can be supported, the# full case foldings are superior: for example, they allow "MASSE" and "Maße" to match.## All code points not listed in this file map to themselves.## NOTE: case folding does not preserve normalization formats!## For information on case folding, see# UTR #21 Case Mappings, at http://www.unicode.org/unicode/reports/tr21/## ================================================================================# Format# ================================================================================# The entries in this file are in the following machine-readable format:## <code>; <status>; <mapping>; # <name>## The status field is:# C: common case folding, common mappings shared by both simple and full mappings.# F: full case folding, mappings that cause strings to grow in length. Multiple characters are separated by spaces.# S: simple case folding, mappings to single characters where different from F.# T: special case for uppercase I and dotted uppercase I# - For non-Turkic languages, this mapping is normally not used.# - For Turkic languages (tr, az), this mapping can be used instead of the normal mapping for these characters.# Note that the Turkic mappings do not maintain canonical equivalence without additional processing.# See the discussions of case mapping in the Unicode Standard for more information.## Usage:# A. To do a simple case folding, use the mappings with status C + S.# B. To do a full case folding, use the mappings with status C + F.## The mappings with status T can be used or omitted depending on the desired case-folding# behavior. (The default option is to exclude them.)## =================================================================0041; C; 0061; # LATIN CAPITAL LETTER A0042; C; 0062; # LATIN CAPITAL LETTER B0043; C; 0063; # LATIN CAPITAL LETTER C0044; C; 0064; # LATIN CAPITAL LETTER D0045; C; 0065; # LATIN CAPITAL LETTER E0046; C; 0066; # LATIN CAPITAL LETTER F0047; C; 0067; # LATIN CAPITAL LETTER G0048; C; 0068; # LATIN CAPITAL LETTER H0049; C; 0069; # LATIN CAPITAL LETTER I0049; T; 0131; # LATIN CAPITAL LETTER I004A; C; 006A; # LATIN CAPITAL LETTER J004B; C; 006B; # LATIN CAPITAL LETTER K004C; C; 006C; # LATIN CAPITAL LETTER L004D; C; 006D; # LATIN CAPITAL LETTER M004E; C; 006E; # LATIN CAPITAL LETTER N004F; C; 006F; # LATIN CAPITAL LETTER O0050; C; 0070; # LATIN CAPITAL LETTER P0051; C; 0071; # LATIN CAPITAL LETTER Q0052; C; 0072; # LATIN CAPITAL LETTER R0053; C; 0073; # LATIN CAPITAL LETTER S0054; C; 0074; # LATIN CAPITAL LETTER T0055; C; 0075; # LATIN CAPITAL LETTER U0056; C; 0076; # LATIN CAPITAL LETTER V0057; C; 0077; # LATIN CAPITAL LETTER W0058; C; 0078; # LATIN CAPITAL LETTER X0059; C; 0079; # LATIN CAPITAL LETTER Y005A; C; 007A; # LATIN CAPITAL LETTER Z00B5; C; 03BC; # MICRO SIGN00C0; C; 00E0; # LATIN CAPITAL LETTER A WITH GRAVE00C1; C; 00E1; # LATIN CAPITAL LETTER A WITH ACUTE00C2; C; 00E2; # LATIN CAPITAL LETTER A WITH CIRCUMFLEX00C3; C; 00E3; # LATIN CAPITAL LETTER A WITH TILDE00C4; C; 00E4; # LATIN CAPITAL LETTER A WITH DIAERESIS00C5; C; 00E5; # LATIN CAPITAL LETTER A WITH RING ABOVE00C6; C; 00E6; # LATIN CAPITAL LETTER AE00C7; C; 00E7; # LATIN CAPITAL LETTER C WITH CEDILLA00C8; C; 00E8; # LATIN CAPITAL LETTER E WITH GRAVE00C9; C; 00E9; # LATIN CAPITAL LETTER E WITH ACUTE00CA; C; 00EA; # LATIN CAPITAL LETTER E WITH CIRCUMFLEX00CB; C; 00EB; # LATIN CAPITAL LETTER E WITH DIAERESIS00CC; C; 00EC; # LATIN CAPITAL LETTER I WITH GRAVE00CD; C; 00ED; # LATIN CAPITAL LETTER I WITH ACUTE00CE; C; 00EE; # LATIN CAPITAL LETTER I WITH CIRCUMFLEX00CF; C; 00EF; # LATIN CAPITAL LETTER I WITH DIAERESIS00D0; C; 00F0; # LATIN CAPITAL LETTER ETH00D1; C; 00F1; # LATIN CAPITAL LETTER N WITH TILDE00D2; C; 00F2; # LATIN CAPITAL LETTER O WITH GRAVE00D3; C; 00F3; # LATIN CAPITAL LETTER O WITH ACUTE00D4; C; 00F4; # LATIN CAPITAL LETTER O WITH CIRCUMFLEX00D5; C; 00F5; # LATIN CAPITAL LETTER O WITH TILDE00D6; C; 00F6; # LATIN CAPITAL LETTER O WITH DIAERESIS00D8; C; 00F8; # LATIN CAPITAL LETTER O WITH STROKE00D9; C; 00F9; # LATIN CAPITAL LETTER U WITH GRAVE00DA; C; 00FA; # LATIN CAPITAL LETTER U WITH ACUTE00DB; C; 00FB; # LATIN CAPITAL LETTER U WITH CIRCUMFLEX00DC; C; 00FC; # LATIN CAPITAL LETTER U WITH DIAERESIS00DD; C; 00FD; # LATIN CAPITAL LETTER Y WITH ACUTE00DE; C; 00FE; # LATIN CAPITAL LETTER THORN00DF; F; 0073 0073; # LATIN SMALL LETTER SHARP S0100; C; 0101; # LATIN CAPITAL LETTER A WITH MACRON0102; C; 0103; # LATIN CAPITAL LETTER A WITH BREVE0104; C; 0105; # LATIN CAPITAL LETTER A WITH OGONEK0106; C; 0107; # LATIN CAPITAL LETTER C WITH ACUTE0108; C; 0109; # LATIN CAPITAL LETTER C WITH CIRCUMFLEX010A; C; 010B; # LATIN CAPITAL LETTER C WITH DOT ABOVE010C; C; 010D; # LATIN CAPITAL LETTER C WITH CARON010E; C; 010F; # LATIN CAPITAL LETTER D WITH CARON0110; C; 0111; # LATIN CAPITAL LETTER D WITH STROKE0112; C; 0113; # LATIN CAPITAL LETTER E WITH MACRON0114; C; 0115; # LATIN CAPITAL LETTER E WITH BREVE0116; C; 0117; # LATIN CAPITAL LETTER E WITH DOT ABOVE0118; C; 0119; # LATIN CAPITAL LETTER E WITH OGONEK011A; C; 011B; # LATIN CAPITAL LETTER E WITH CARON011C; C; 011D; # LATIN CAPITAL LETTER G WITH CIRCUMFLEX011E; C; 011F; # LATIN CAPITAL LETTER G WITH BREVE0120; C; 0121; # LATIN CAPITAL LETTER G WITH DOT ABOVE0122; C; 0123; # LATIN CAPITAL LETTER G WITH CEDILLA0124; C; 0125; # LATIN CAPITAL LETTER H WITH CIRCUMFLEX0126; C; 0127; # LATIN CAPITAL LETTER H WITH STROKE0128; C; 0129; # LATIN CAPITAL LETTER I WITH TILDE012A; C; 012B; # LATIN CAPITAL LETTER I WITH MACRON012C; C; 012D; # LATIN CAPITAL LETTER I WITH BREVE012E; C; 012F; # LATIN CAPITAL LETTER I WITH OGONEK0130; F; 0069 0307; # LATIN CAPITAL LETTER I WITH DOT ABOVE0130; T; 0069; # LATIN CAPITAL LETTER I WITH DOT ABOVE0132; C; 0133; # LATIN CAPITAL LIGATURE IJ0134; C; 0135; # LATIN CAPITAL LETTER J WITH CIRCUMFLEX0136; C; 0137; # LATIN CAPITAL LETTER K WITH CEDILLA0139; C; 013A; # LATIN CAPITAL LETTER L WITH ACUTE013B; C; 013C; # LATIN CAPITAL LETTER L WITH CEDILLA013D; C; 013E; # LATIN CAPITAL LETTER L WITH CARON013F; C; 0140; # LATIN CAPITAL LETTER L WITH MIDDLE DOT0141; C; 0142; # LATIN CAPITAL LETTER L WITH STROKE0143; C; 0144; # LATIN CAPITAL LETTER N WITH ACUTE0145; C; 0146; # LATIN CAPITAL LETTER N WITH CEDILLA0147; C; 0148; # LATIN CAPITAL LETTER N WITH CARON0149; F; 02BC 006E; # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE014A; C; 014B; # LATIN CAPITAL LETTER ENG014C; C; 014D; # LATIN CAPITAL LETTER O WITH MACRON014E; C; 014F; # LATIN CAPITAL LETTER O WITH BREVE0150; C; 0151; # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE0152; C; 0153; # LATIN CAPITAL LIGATURE OE0154; C; 0155; # LATIN CAPITAL LETTER R WITH ACUTE0156; C; 0157; # LATIN CAPITAL LETTER R WITH CEDILLA0158; C; 0159; # LATIN CAPITAL LETTER R WITH CARON015A; C; 015B; # LATIN CAPITAL LETTER S WITH ACUTE015C; C; 015D; # LATIN CAPITAL LETTER S WITH CIRCUMFLEX015E; C; 015F; # LATIN CAPITAL LETTER S WITH CEDILLA0160; C; 0161; # LATIN CAPITAL LETTER S WITH CARON0162; C; 0163; # LATIN CAPITAL LETTER T WITH CEDILLA0164; C; 0165; # LATIN CAPITAL LETTER T WITH CARON0166; C; 0167; # LATIN CAPITAL LETTER T WITH STROKE0168; C; 0169; # LATIN CAPITAL LETTER U WITH TILDE016A; C; 016B; # LATIN CAPITAL LETTER U WITH MACRON016C; C; 016D; # LATIN CAPITAL LETTER U WITH BREVE016E; C; 016F; # LATIN CAPITAL LETTER U WITH RING ABOVE0170; C; 0171; # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE0172; C; 0173; # LATIN CAPITAL LETTER U WITH OGONEK0174; C; 0175; # LATIN CAPITAL LETTER W WITH CIRCUMFLEX0176; C; 0177; # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX0178; C; 00FF; # LATIN CAPITAL LETTER Y WITH DIAERESIS0179; C; 017A; # LATIN CAPITAL LETTER Z WITH ACUTE017B; C; 017C; # LATIN CAPITAL LETTER Z WITH DOT ABOVE017D; C; 017E; # LATIN CAPITAL LETTER Z WITH CARON017F; C; 0073; # LATIN SMALL LETTER LONG S0181; C; 0253; # LATIN CAPITAL LETTER B WITH HOOK0182; C; 0183; # LATIN CAPITAL LETTER B WITH TOPBAR0184; C; 0185; # LATIN CAPITAL LETTER TONE SIX0186; C; 0254; # LATIN CAPITAL LETTER OPEN O0187; C; 0188; # LATIN CAPITAL LETTER C WITH HOOK0189; C; 0256; # LATIN CAPITAL LETTER AFRICAN D018A; C; 0257; # LATIN CAPITAL LETTER D WITH HOOK018B; C; 018C; # LATIN CAPITAL LETTER D WITH TOPBAR018E; C; 01DD; # LATIN CAPITAL LETTER REVERSED E018F; C; 0259; # LATIN CAPITAL LETTER SCHWA0190; C; 025B; # LATIN CAPITAL LETTER OPEN E0191; C; 0192; # LATIN CAPITAL LETTER F WITH HOOK0193; C; 0260; # LATIN CAPITAL LETTER G WITH HOOK0194; C; 0263; # LATIN CAPITAL LETTER GAMMA0196; C; 0269; # LATIN CAPITAL LETTER IOTA0197; C; 0268; # LATIN CAPITAL LETTER I WITH STROKE0198; C; 0199; # LATIN CAPITAL LETTER K WITH HOOK019C; C; 026F; # LATIN CAPITAL LETTER TURNED M019D; C; 0272; # LATIN CAPITAL LETTER N WITH LEFT HOOK019F; C; 0275; # LATIN CAPITAL LETTER O WITH MIDDLE TILDE01A0; C; 01A1; # LATIN CAPITAL LETTER O WITH HORN01A2; C; 01A3; # LATIN CAPITAL LETTER OI01A4; C; 01A5; # LATIN CAPITAL LETTER P WITH HOOK01A6; C; 0280; # LATIN LETTER YR01A7; C; 01A8; # LATIN CAPITAL LETTER TONE TWO01A9; C; 0283; # LATIN CAPITAL LETTER ESH01AC; C; 01AD; # LATIN CAPITAL LETTER T WITH HOOK01AE; C; 0288; # LATIN CAPITAL LETTER T WITH RETROFLEX HOOK01AF; C; 01B0; # LATIN CAPITAL LETTER U WITH HORN01B1; C; 028A; # LATIN CAPITAL LETTER UPSILON01B2; C; 028B; # LATIN CAPITAL LETTER V WITH HOOK01B3; C; 01B4; # LATIN CAPITAL LETTER Y WITH HOOK01B5; C; 01B6; # LATIN CAPITAL LETTER Z WITH STROKE01B7; C; 0292; # LATIN CAPITAL LETTER EZH01B8; C; 01B9; # LATIN CAPITAL LETTER EZH REVERSED01BC; C; 01BD; # LATIN CAPITAL LETTER TONE FIVE01C4; C; 01C6; # LATIN CAPITAL LETTER DZ WITH CARON01C5; C; 01C6; # LATIN CAPITAL LETTER D WITH SMALL LETTER Z WITH CARON01C7; C; 01C9; # LATIN CAPITAL LETTER LJ01C8; C; 01C9; # LATIN CAPITAL LETTER L WITH SMALL LETTER J01CA; C; 01CC; # LATIN CAPITAL LETTER NJ01CB; C; 01CC; # LATIN CAPITAL LETTER N WITH SMALL LETTER J01CD; C; 01CE; # LATIN CAPITAL LETTER A WITH CARON01CF; C; 01D0; # LATIN CAPITAL LETTER I WITH CARON01D1; C; 01D2; # LATIN CAPITAL LETTER O WITH CARON01D3; C; 01D4; # LATIN CAPITAL LETTER U WITH CARON01D5; C; 01D6; # LATIN CAPITAL LETTER U WITH DIAERESIS AND MACRON01D7; C; 01D8; # LATIN CAPITAL LETTER U WITH DIAERESIS AND ACUTE01D9; C; 01DA; # LATIN CAPITAL LETTER U WITH DIAERESIS AND CARON01DB; C; 01DC; # LATIN CAPITAL LETTER U WITH DIAERESIS AND GRAVE01DE; C; 01DF; # LATIN CAPITAL LETTER A WITH DIAERESIS AND MACRON01E0; C; 01E1; # LATIN CAPITAL LETTER A WITH DOT ABOVE AND MACRON01E2; C; 01E3; # LATIN CAPITAL LETTER AE WITH MACRON01E4; C; 01E5; # LATIN CAPITAL LETTER G WITH STROKE01E6; C; 01E7; # LATIN CAPITAL LETTER G WITH CARON01E8; C; 01E9; # LATIN CAPITAL LETTER K WITH CARON01EA; C; 01EB; # LATIN CAPITAL LETTER O WITH OGONEK01EC; C; 01ED; # LATIN CAPITAL LETTER O WITH OGONEK AND MACRON01EE; C; 01EF; # LATIN CAPITAL LETTER EZH WITH CARON01F0; F; 006A 030C; # LATIN SMALL LETTER J WITH CARON01F1; C; 01F3; # LATIN CAPITAL LETTER DZ01F2; C; 01F3; # LATIN CAPITAL LETTER D WITH SMALL LETTER Z01F4; C; 01F5; # LATIN CAPITAL LETTER G WITH ACUTE01F6; C; 0195; # LATIN CAPITAL LETTER HWAIR01F7; C; 01BF; # LATIN CAPITAL LETTER WYNN01F8; C; 01F9; # LATIN CAPITAL LETTER N WITH GRAVE01FA; C; 01FB; # LATIN CAPITAL LETTER A WITH RING ABOVE AND ACUTE01FC; C; 01FD; # LATIN CAPITAL LETTER AE WITH ACUTE01FE; C; 01FF; # LATIN CAPITAL LETTER O WITH STROKE AND ACUTE0200; C; 0201; # LATIN CAPITAL LETTER A WITH DOUBLE GRAVE0202; C; 0203; # LATIN CAPITAL LETTER A WITH INVERTED BREVE0204; C; 0205; # LATIN CAPITAL LETTER E WITH DOUBLE GRAVE0206; C; 0207; # LATIN CAPITAL LETTER E WITH INVERTED BREVE0208; C; 0209; # LATIN CAPITAL LETTER I WITH DOUBLE GRAVE020A; C; 020B; # LATIN CAPITAL LETTER I WITH INVERTED BREVE020C; C; 020D; # LATIN CAPITAL LETTER O WITH DOUBLE GRAVE020E; C; 020F; # LATIN CAPITAL LETTER O WITH INVERTED BREVE0210; C; 0211; # LATIN CAPITAL LETTER R WITH DOUBLE GRAVE0212; C; 0213; # LATIN CAPITAL LETTER R WITH INVERTED BREVE0214; C; 0215; # LATIN CAPITAL LETTER U WITH DOUBLE GRAVE0216; C; 0217; # LATIN CAPITAL LETTER U WITH INVERTED BREVE0218; C; 0219; # LATIN CAPITAL LETTER S WITH COMMA BELOW021A; C; 021B; # LATIN CAPITAL LETTER T WITH COMMA BELOW021C; C; 021D; # LATIN CAPITAL LETTER YOGH021E; C; 021F; # LATIN CAPITAL LETTER H WITH CARON0220; C; 019E; # LATIN CAPITAL LETTER N WITH LONG RIGHT LEG0222; C; 0223; # LATIN CAPITAL LETTER OU0224; C; 0225; # LATIN CAPITAL LETTER Z WITH HOOK0226; C; 0227; # LATIN CAPITAL LETTER A WITH DOT ABOVE0228; C; 0229; # LATIN CAPITAL LETTER E WITH CEDILLA022A; C; 022B; # LATIN CAPITAL LETTER O WITH DIAERESIS AND MACRON022C; C; 022D; # LATIN CAPITAL LETTER O WITH TILDE AND MACRON022E; C; 022F; # LATIN CAPITAL LETTER O WITH DOT ABOVE0230; C; 0231; # LATIN CAPITAL LETTER O WITH DOT ABOVE AND MACRON0232; C; 0233; # LATIN CAPITAL LETTER Y WITH MACRON
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -