⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 ppl-from-log.gawk

📁 这是一款很好用的工具包
💻 GAWK
字号:
#!/usr/local/bin/gawk -f## ppl-from-log --#	Recomputes perplexity from (a subset of) the output of ##		ngram -debug 2 -ppl ##	This is useful if one wants to analyse predictability of certain#	words/contexts.## usage: ppl-from-log [howmany=<numsents>] ppl-log-file## Copyright (c) 1995, SRI International.  All Rights Reserved## $Header: /home/speech/stolcke/project/srilm/devel/utils/src/RCS/ppl-from-log,v 1.3 1995/11/05 03:13:01 stolcke Exp $#function result () {	ppl = exp(-sum/(sentences + words - oovs) * M_LN10);	printf "file %s: %d sentences, %d words, %d oovs\n", \		FILENAME, sentences, words, oovs;	printf "%d zeroprobs, logprob= %f, ppl= %f\n", \			 0, sum , ppl;}BEGIN {	M_LN10 = 2.30258509299404568402;	# from <math.h>}/^	p\( / {	if ($0 ~ /\[ -[Ii]nf/) {		oovs ++;	} else {		sum += $10;	}	if ($2 == "</s>") {		sentences ++;	} else {		words ++;	}	next;}/ ppl= / {	sents ++;	if (howmany > 0 && sents == howmany) {		result();		exit 0;	}	next;}{	next;}END {	result();}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -