⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 unicode::collate.3

📁 视频监控网络部分的协议ddns,的模块的实现代码,请大家大胆指正.
💻 3
📖 第 1 页 / 共 3 页
字号:
.\" Automatically generated by Pod::Man 2.16 (Pod::Simple 3.05).\".\" Standard preamble:.\" ========================================================================.de Sh \" Subsection heading.br.if t .Sp.ne 5.PP\fB\\$1\fR.PP...de Sp \" Vertical space (when we can't use .PP).if t .sp .5v.if n .sp...de Vb \" Begin verbatim text.ft CW.nf.ne \\$1...de Ve \" End verbatim text.ft R.fi...\" Set up some character translations and predefined strings.  \*(-- will.\" give an unbreakable dash, \*(PI will give pi, \*(L" will give a left.\" double quote, and \*(R" will give a right double quote.  \*(C+ will.\" give a nicer C++.  Capital omega is used to do unbreakable dashes and.\" therefore won't be available.  \*(C` and \*(C' expand to `' in nroff,.\" nothing in troff, for use with C<>..tr \(*W-.ds C+ C\v'-.1v'\h'-1p'\s-2+\h'-1p'+\s0\v'.1v'\h'-1p'.ie n \{\.    ds -- \(*W-.    ds PI pi.    if (\n(.H=4u)&(1m=24u) .ds -- \(*W\h'-12u'\(*W\h'-12u'-\" diablo 10 pitch.    if (\n(.H=4u)&(1m=20u) .ds -- \(*W\h'-12u'\(*W\h'-8u'-\"  diablo 12 pitch.    ds L" "".    ds R" "".    ds C` "".    ds C' ""'br\}.el\{\.    ds -- \|\(em\|.    ds PI \(*p.    ds L" ``.    ds R" '''br\}.\".\" Escape single quotes in literal strings from groff's Unicode transform..ie \n(.g .ds Aq \(aq.el       .ds Aq '.\".\" If the F register is turned on, we'll generate index entries on stderr for.\" titles (.TH), headers (.SH), subsections (.Sh), items (.Ip), and index.\" entries marked with X<> in POD.  Of course, you'll have to process the.\" output yourself in some meaningful fashion..ie \nF \{\.    de IX.    tm Index:\\$1\t\\n%\t"\\$2"...    nr % 0.    rr F.\}.el \{\.    de IX...\}.\".\" Accent mark definitions (@(#)ms.acc 1.5 88/02/08 SMI; from UCB 4.2)..\" Fear.  Run.  Save yourself.  No user-serviceable parts..    \" fudge factors for nroff and troff.if n \{\.    ds #H 0.    ds #V .8m.    ds #F .3m.    ds #[ \f1.    ds #] \fP.\}.if t \{\.    ds #H ((1u-(\\\\n(.fu%2u))*.13m).    ds #V .6m.    ds #F 0.    ds #[ \&.    ds #] \&.\}.    \" simple accents for nroff and troff.if n \{\.    ds ' \&.    ds ` \&.    ds ^ \&.    ds , \&.    ds ~ ~.    ds /.\}.if t \{\.    ds ' \\k:\h'-(\\n(.wu*8/10-\*(#H)'\'\h"|\\n:u".    ds ` \\k:\h'-(\\n(.wu*8/10-\*(#H)'\`\h'|\\n:u'.    ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'^\h'|\\n:u'.    ds , \\k:\h'-(\\n(.wu*8/10)',\h'|\\n:u'.    ds ~ \\k:\h'-(\\n(.wu-\*(#H-.1m)'~\h'|\\n:u'.    ds / \\k:\h'-(\\n(.wu*8/10-\*(#H)'\z\(sl\h'|\\n:u'.\}.    \" troff and (daisy-wheel) nroff accents.ds : \\k:\h'-(\\n(.wu*8/10-\*(#H+.1m+\*(#F)'\v'-\*(#V'\z.\h'.2m+\*(#F'.\h'|\\n:u'\v'\*(#V'.ds 8 \h'\*(#H'\(*b\h'-\*(#H'.ds o \\k:\h'-(\\n(.wu+\w'\(de'u-\*(#H)/2u'\v'-.3n'\*(#[\z\(de\v'.3n'\h'|\\n:u'\*(#].ds d- \h'\*(#H'\(pd\h'-\w'~'u'\v'-.25m'\f2\(hy\fP\v'.25m'\h'-\*(#H'.ds D- D\\k:\h'-\w'D'u'\v'-.11m'\z\(hy\v'.11m'\h'|\\n:u'.ds th \*(#[\v'.3m'\s+1I\s-1\v'-.3m'\h'-(\w'I'u*2/3)'\s-1o\s+1\*(#].ds Th \*(#[\s+2I\s-2\h'-\w'I'u*3/5'\v'-.3m'o\v'.3m'\*(#].ds ae a\h'-(\w'a'u*4/10)'e.ds Ae A\h'-(\w'A'u*4/10)'E.    \" corrections for vroff.if v .ds ~ \\k:\h'-(\\n(.wu*9/10-\*(#H)'\s-2\u~\d\s+2\h'|\\n:u'.if v .ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'\v'-.4m'^\v'.4m'\h'|\\n:u'.    \" for low resolution devices (crt and lpr).if \n(.H>23 .if \n(.V>19 \\{\.    ds : e.    ds 8 ss.    ds o a.    ds d- d\h'-1'\(ga.    ds D- D\h'-1'\(hy.    ds th \o'bp'.    ds Th \o'LP'.    ds ae ae.    ds Ae AE.\}.rm #[ #] #H #V #F C.\" ========================================================================.\".IX Title "Unicode::Collate 3".TH Unicode::Collate 3 "2007-12-18" "perl v5.10.0" "Perl Programmers Reference Guide".\" For nroff, turn off justification.  Always turn off hyphenation; it makes.\" way too many mistakes in technical documents..if n .ad l.nh.SH "NAME"Unicode::Collate \- Unicode Collation Algorithm.SH "SYNOPSIS".IX Header "SYNOPSIS".Vb 1\&  use Unicode::Collate;\&\&  #construct\&  $Collator = Unicode::Collate\->new(%tailoring);\&\&  #sort\&  @sorted = $Collator\->sort(@not_sorted);\&\&  #compare\&  $result = $Collator\->cmp($a, $b); # returns 1, 0, or \-1.\&\&  # If %tailoring is false (i.e. empty),\&  # $Collator should do the default collation..Ve.SH "DESCRIPTION".IX Header "DESCRIPTION"This module is an implementation of Unicode Technical Standard #10(a.k.a. \s-1UTS\s0 #10) \- Unicode Collation Algorithm (a.k.a. \s-1UCA\s0)..Sh "Constructor and Tailoring".IX Subsection "Constructor and Tailoring"The \f(CW\*(C`new\*(C'\fR method returns a collator object..PP.Vb 10\&   $Collator = Unicode::Collate\->new(\&      UCA_Version => $UCA_Version,\&      alternate => $alternate, # deprecated: use of \*(Aqvariable\*(Aq is recommended.\&      backwards => $levelNumber, # or \e@levelNumbers\&      entry => $element,\&      hangul_terminator => $term_primary_weight,\&      ignoreName => qr/$ignoreName/,\&      ignoreChar => qr/$ignoreChar/,\&      katakana_before_hiragana => $bool,\&      level => $collationLevel,\&      normalization  => $normalization_form,\&      overrideCJK => \e&overrideCJK,\&      overrideHangul => \e&overrideHangul,\&      preprocess => \e&preprocess,\&      rearrange => \e@charList,\&      table => $filename,\&      undefName => qr/$undefName/,\&      undefChar => qr/$undefChar/,\&      upper_before_lower => $bool,\&      variable => $variable,\&   );.Ve.IP "UCA_Version" 4.IX Item "UCA_Version"If the tracking version number of \s-1UCA\s0 is given,behavior of that tracking version is emulated on collating.If omitted, the return value of \f(CW\*(C`UCA_Version()\*(C'\fR is used.\&\f(CW\*(C`UCA_Version()\*(C'\fR should return the latest tracking version supported..SpThe supported tracking version: 8, 9, 11, or 14..Sp.Vb 6\&     UCA       Unicode Standard         DUCET (@version)\&     \-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\&      8              3.1                3.0.1 (3.0.1d9)\&      9     3.1 with Corrigendum 3      3.1.1 (3.1.1)\&     11              4.0                4.0.0 (4.0.0)\&     14             4.1.0               4.1.0 (4.1.0).Ve.SpNote: Recent \s-1UTS\s0 #10 renames \*(L"Tracking Version\*(R" to \*(L"Revision.\*(R".IP "alternate" 4.IX Item "alternate"\&\-\- see 3.2.2 Alternate Weighting, version 8 of \s-1UTS\s0 #10.SpFor backward compatibility, \f(CW\*(C`alternate\*(C'\fR (old name) can be usedas an alias for \f(CW\*(C`variable\*(C'\fR..IP "backwards" 4.IX Item "backwards"\&\-\- see 3.1.2 French Accents, \s-1UTS\s0 #10..Sp.Vb 1\&     backwards => $levelNumber or \e@levelNumbers.Ve.SpWeights in reverse order; ex. level 2 (diacritic ordering) in French.If omitted, forwards at all the levels..IP "entry" 4.IX Item "entry"\&\-\- see 3.1 Linguistic Features; 3.2.1 File Format, \s-1UTS\s0 #10..SpIf the same character (or a sequence of characters) existsin the collation element table through \f(CW\*(C`table\*(C'\fR,mapping to collation elements is overrided.If it does not exist, the mapping is defined additionally..Sp.Vb 12\&    entry => <<\*(AqENTRY\*(Aq, # for DUCET v4.0.0 (allkeys\-4.0.0.txt)\&0063 0068 ; [.0E6A.0020.0002.0063] # ch\&0043 0068 ; [.0E6A.0020.0007.0043] # Ch\&0043 0048 ; [.0E6A.0020.0008.0043] # CH\&006C 006C ; [.0F4C.0020.0002.006C] # ll\&004C 006C ; [.0F4C.0020.0007.004C] # Ll\&004C 004C ; [.0F4C.0020.0008.004C] # LL\&00F1      ; [.0F7B.0020.0002.00F1] # n\-tilde\&006E 0303 ; [.0F7B.0020.0002.00F1] # n\-tilde\&00D1      ; [.0F7B.0020.0008.00D1] # N\-tilde\&004E 0303 ; [.0F7B.0020.0008.00D1] # N\-tilde\&ENTRY\&\&    entry => <<\*(AqENTRY\*(Aq, # for DUCET v4.0.0 (allkeys\-4.0.0.txt)\&00E6 ; [.0E33.0020.0002.00E6][.0E8B.0020.0002.00E6] # ae ligature as <a><e>\&00C6 ; [.0E33.0020.0008.00C6][.0E8B.0020.0008.00C6] # AE ligature as <A><E>\&ENTRY.Ve.Sp\&\fB\s-1NOTE:\s0\fR The code point in the \s-1UCA\s0 file format (before \f(CW\*(Aq;\*(Aq\fR)\&\fBmust\fR be a Unicode code point (defined as hexadecimal),but not a native code point.So \f(CW0063\fR must always denote \f(CW\*(C`U+0063\*(C'\fR,but not a character of \f(CW"\ex63"\fR..SpWeighting may vary depending on collation element table.So ensure the weights defined in \f(CW\*(C`entry\*(C'\fR will be consistent withthose in the collation element table loaded via \f(CW\*(C`table\*(C'\fR..SpIn \s-1DUCET\s0 v4.0.0, primary weight of \f(CW\*(C`C\*(C'\fR is \f(CW0E60\fRand that of \f(CW\*(C`D\*(C'\fR is \f(CW\*(C`0E6D\*(C'\fR. So setting primary weight of \f(CW\*(C`CH\*(C'\fR to \f(CW\*(C`0E6A\*(C'\fR(as a value between \f(CW0E60\fR and \f(CW\*(C`0E6D\*(C'\fR)makes ordering as \f(CW\*(C`C < CH < D\*(C'\fR.Exactly speaking \s-1DUCET\s0 already has some characters between \f(CW\*(C`C\*(C'\fR and \f(CW\*(C`D\*(C'\fR:\&\f(CW\*(C`small capital C\*(C'\fR (\f(CW\*(C`U+1D04\*(C'\fR) with primary weight \f(CW0E64\fR,\&\f(CW\*(C`c\-hook/C\-hook\*(C'\fR (\f(CW\*(C`U+0188/U+0187\*(C'\fR) with \f(CW0E65\fR,and \f(CW\*(C`c\-curl\*(C'\fR (\f(CW\*(C`U+0255\*(C'\fR) with \f(CW0E69\fR.Then primary weight \f(CW\*(C`0E6A\*(C'\fR for \f(CW\*(C`CH\*(C'\fR makes \f(CW\*(C`CH\*(C'\fRordered between \f(CW\*(C`c\-curl\*(C'\fR and \f(CW\*(C`D\*(C'\fR..IP "hangul_terminator" 4.IX Item "hangul_terminator"\&\-\- see 7.1.4 Trailing Weights, \s-1UTS\s0 #10..SpIf a true value is given (non-zero but should be positive),it will be added as a terminator primary weight to the end ofevery standard Hangul syllable. Secondary and any higher weightsfor terminator are set to zero.If the value is false or \f(CW\*(C`hangul_terminator\*(C'\fR key does not exist,insertion of terminator weights will not be performed..SpBoundaries of Hangul syllables are determinedaccording to conjoining Jamo behavior in \fIthe Unicode Standard\fRand \fIHangulSyllableType.txt\fR..Sp\&\fBImplementation Note:\fR(1) For expansion mapping (Unicode character mappedto a sequence of collation elements), a terminator will not be addedbetween collation elements, even if Hangul syllable boundary exists there.Addition of terminator is restricted to the next positionto the last collation element..Sp(2) Non-conjoining Hangul letters(Compatibility Jamo, halfwidth Jamo, and enclosed letters) are notautomatically terminated with a terminator primary weight.These characters may need terminator included in a collation elementtable beforehand..IP "ignoreChar" 4.IX Item "ignoreChar".PD 0

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -