⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 语料格式.html

📁 crf(condintional random fields)简介 用音字转换实例来介绍crf模型
💻 HTML
字号:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE></TITLE>
<META http-equiv=Content-Type content="text/html; charset=GB2312">
<META content="MSHTML 6.00.2900.3157" name=GENERATOR></HEAD>
<BODY>
<P><FONT size=4><FONT face=@KNLe><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA"><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA"><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-family: :ZLe; mso-font-kerning: 0pt"><SPAN lang=EN-US>下面以音字转换作为例子介绍训练和测试语料的格式。在音字转换中,观察序列X是用户输入的拼音序列,例如:wo lai zi 
ha er bin gong ye da xue. 
状态序列是每个音所对应的汉字构成的序列。因此语料的格式如下:</SPAN></SPAN></SPAN></SPAN></FONT></FONT></P>
<OL>
  <LI>
  <DIV><FONT size=4><FONT face=@KNLe 
  ><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA" 
  ><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA" 
  ><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-family: :ZLe; mso-font-kerning: 0pt" 
  ><SPAN lang=EN-US 
  >训练语料<BR><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<IMG 
  src="file://C:\Documents and Settings\Administrator\My Documents\CRF帮助文档\format1.JPG" 
  align=baseline>&nbsp;&nbsp;&nbsp;&nbsp;<BR></SPAN></SPAN></SPAN></SPAN></FONT></FONT></DIV>
  <LI>
  <DIV><FONT size=4><FONT face=@KNLe 
  ><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA" 
  ><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA" 
  ><SPAN 
  style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-family: :ZLe; mso-font-kerning: 0pt" 
  ><SPAN lang=EN-US 
  >测试语料<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
  <BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <IMG 
  src="file://C:\Documents and Settings\Administrator\My Documents\CRF帮助文档\format2.JPG" 
  align=baseline>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
  &nbsp;&nbsp;&nbsp;&nbsp; <IMG 
  src="file://C:\Documents and Settings\Administrator\My Documents\CRF帮助文档\format3.JPG" 
  align=baseline><BR><BR 
  >说明:左边的测试语料格式是为了计算通过建立的CRF模型进行测试后的识别率。右边的则不能计算。</SPAN></SPAN></SPAN></SPAN></FONT></FONT></DIV></LI></OL>
<P>&nbsp;</P>
<P><FONT size=4><FONT face=@KNLe><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA"><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA"></SPAN></SPAN></FONT></FONT>&nbsp;</P>
<P><FONT size=4><FONT face=@KNLe><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA"><SPAN 
style="FONT-SIZE: 12pt; FONT-FAMILY: KNLe; mso-bidi-font-size: 9.0pt; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-language: ZH-CN; mso-bidi-language: AR-SA">&nbsp;</P>
<P class=MsoNormal 
style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 24pt; TEXT-ALIGN: left; mso-char-indent-count: 2.0; mso-layout-grid-align: none" 
align=left>&nbsp;</P>
<P>
<P>&nbsp;</P>
<P></SPAN></SPAN></FONT></FONT>&nbsp;</P>
<P></P></BODY></HTML>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -