nbest-format.5

来自「这是一款很好用的工具包」· 5 代码 · 共 80 行

80 行

.\" $Id: nbest-format.5,v 1.4 2001/08/11 20:03:03 stolcke Exp $.TH nbest-format 5 "$Date: 2001/08/11 20:03:03 $" "SRILM File Formats".SH NAMEnbest-format \- File formats for N-best hypotheses lists.SH DESCRIPTIONSRILM currently understands three different formats for lists of N-best hypotheses for rescoring or 1-best hypothesis extraction.The first two formats originated in the SRI Decipher(TM) recognitionsystem, the third format is particular to SRILM..PPThe first format consists of the header.br	NBestList1.0.brfollowed by one or more lines of the form.br	(\fIscore\fP) \fIw1 w2 w3\fP ....brwhere.I scoreis a composite acoustic/language model scorefrom the recognizer, on the bytelog scale.(A bytelog is a logarithm to base 1.0001, divided by 1024 and rounded to an integer.)This format is output by the SRI Decipher(TM) recognizer,by the.BR "ngram \-nbest" ,and by.BR "nbest-lattice \-write-nbest \-decipher-nbest" ..PPThe second Decipher(TM) format is an extension of the first formatthat encodes word-level scores and time alignments.It is marked by a header of the form.br	NBestList2.0.brThe hypotheses are in the format.br	(\fIscore\fP) \fIw1\fP ( st: \fIst1\fP et: \fIet1\fP g: \fIg1\fP a: \fIa1\fP ) \fIw2\fP ....brwhere words are followed by start and end times, language model and acoustic scores (bytelog-scaled), respectively.  This format may also contain scores and time marks for sub-word units(phones and HMM states), in the same format as above, but with the.IR w 'sdenoting phone and state names.  Sub-word units will have time marks that are contained in the duration of the preceding word units,and may thus be easily identified..PPThe third format understood by SRILM listshypotheses in the format.br	\fIascore\fP \fIlscore\fP \fInwords\fP \fIw1 w2 w3\fP ....brwhere the first three columns contain theacoustic model log probability, the language model log probability,and the number of words in the hypothesis string, respectively.All scores are logarithms base 10.(This format must not be preceded by an ``NBestList'' header.)This format is output by the.B "ngram \-rescore"and by.B nbest-lattice \-write-nbest without the.B \-decipher-nbestoption..SH "SEE ALSO"ngram(1), nbest-lattice(1), segment-nbest(1), nbest-scripts(1), pfsg-scripts(1)..SH BUGSAll these formats are somewhat ad hoc and could use a more rationaldesign.The ``NBestList1.0'' format is particularly cumbersome because it conflates acoustic and language model scores..brA generalization to an arbitrary number of separate scores would be nice..SH AUTHORManual page written by Andreas Stolcke <stolcke@speech.sri.com>..brCopyright 1999-2001 SRI International

nbest-format.5 - 源码说明

本页面展示了「这是一款很好用的工具包」中的 nbest-format.5 源码文件，采用 5 编程语言编写，共 80 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫下载站收录了大量与工具包相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?