dropize_phrase_table.pl.svn-base

来自「moses开源的机器翻译系统」· SVN-BASE 代码 · 共 30 行

SVN-BASE
30
字号
#!/usr/bin/perl#add_empties_to_phrase_table: go through an old-style pharaoh phrase table (no empty target sources) and add one such line for each single-word source phrase in the table,#complete with factors (note the number and type of factors are hardcoded here);#also add deletion-cost factors as necessary to all lines#Evan Herbst 7 / 11 / 06#usage: aetpt INPUT_PTABLE OUTPUT_PTABLEmy ($inputFile, $outputFile) = @ARGV;my $DELETIONCOST = 2.718; #weight for an individual deletionopen(INFILE, "<$inputFile") or die "couldn't open '$inputFile' for read\n";open(OUTFILE, ">$outputFile") or die "couldn't open '$outputFile' for write\n";my ($lastSrcPhrase, $srcPhrase);while(my $line = <INFILE>){	chop($line);	$lastSrcPhrase = $srcPhrase;	my @tokens = split(/\|\|\|/, $line);	$srcPhrase = $tokens[0];	if($srcPhrase ne $lastSrcPhrase && $srcPhrase =~ /^\s*\S+\s*$/) #new source phrase of a single word; add deletion line	{		print OUTFILE "$srcPhrase |||  ||| 1 1 1 1 2.718 $DELETIONCOST\n";	}	print OUTFILE "$line 1\n";}close(INFILE);close(OUTFILE);

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?