⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 特征模板.html

📁 crf(condintional random fields)简介 用音字转换实例来介绍crf模型
💻 HTML
字号:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE></TITLE>
<META http-equiv=Content-Type content="text/html; charset=GB2312">
<META content="MSHTML 6.00.2900.3157" name=GENERATOR></HEAD>
<BODY>
<P class=MsoNormal 
style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 24pt; mso-char-indent-count: 2.0"><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">特征模板是对上下文特定位置和特定信息的一种考虑。可以根据需要设计出多种模板,它使人的指导自然地加入到模型的建立过程。</SPAN><SPAN 
lang=EN-US 
style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt"><?xml:namespace 
prefix = o ns = "urn:schemas-microsoft-com:office:office" 
/><o:p></o:p></SPAN></P>
<P class=MsoNormal 
style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 24pt; mso-char-indent-count: 2.0"><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">下面对两种特征模板进行说明:</SPAN></P>
<OL>
  <LI>
  <DIV class=MsoNormal 
  style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 24pt; mso-char-indent-count: 2.0"><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt"><STRONG>一元特征模版</STRONG>如下图</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">所示:<BR><BR><IMG 
  src="file://C:\Documents and Settings\Administrator\My Documents\CRF帮助文档\feature.JPG" 
  align=baseline><BR><BR>例如模版</SPAN><SPAN lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">U00:%x[-2,0]</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">,其中</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">U00</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">表示模板的标识符,</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">-2</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">和</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">0</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">分别表示相对当前位置的行号和列号,因此</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">[-2,0]</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">表示的含义是相对当前位置的前两行,与当前位置同列的那个拼音。如图中" 
  U00:shi:被 " 这个特征的含义是:当前汉字为 “被”&nbsp;,前第二个音为"shi"。其对应图中的权重为<IMG 
  src="CRF帮助文档/mu1.JPG" align=baseline>。</SPAN></DIV></LI>
  <LI>
  <DIV class=MsoNormal 
  style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 24pt; mso-char-indent-count: 2.0"><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt"><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt"><STRONG>二元特征模版</STRONG>如下图</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">所示:<BR><BR><IMG 
  src="file://C:\Documents and Settings\Administrator\My Documents\CRF帮助文档\B_features.JPG" 
  align=baseline><BR><BR></SPAN></SPAN><FONT face=宋体>例如模版</FONT><SPAN lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">B00:%x[-2,0]</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">,其中B</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">00</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">表示模板的标识符,</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">-2</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">和</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">0</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">分别表示相对当前位置的行号和列号,因此</SPAN><SPAN 
  lang=EN-US 
  style="FONT-SIZE: 12pt; mso-bidi-font-size: 9.0pt; mso-font-kerning: 0pt">[-2,0]</SPAN><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-font-kerning: 0pt">表示的含义是相对当前位置的前两行,与当前位置同列的那个拼音。如图中" 
  B00:shi:悲观 " 这个特征的含义是:当前汉字和下一个汉字为 “悲观”&nbsp;,前第二个音为"shi",其对应图中的权重为<IMG 
  src="CRF帮助文档/lameda1.JPG" align=baseline>。</SPAN></DIV></LI></OL></BODY></HTML>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -