readme.txt.svn-base

来自「moses开源的机器翻译系统」· SVN-BASE 代码 · 共 41 行

SVN-BASE
41
字号
Re-implementation of Johnson et al. (2007)'s phrasetable filtering strategy.This implementation relies on Joy Zhang's SALM Suffix Array toolkit. It isavailable here:  http://projectile.is.cs.cmu.edu/research/public/tools/salm/salm.htm--Chris Dyer <redpony@umd.edu>BUILD INSTRUCTIONS---------------------------------1. Download and build SALM.2. make SALMDIR=/path/to/SALMUSAGE INSTRUCTIONS---------------------------------1. Using the SALM/Bin/Linux/Index/IndexSA.O32, create a suffix array index   of the source and target sides of your training bitext.2. cat phrase-table.txt | ./filter-pt -e TARG.suffix -f SOURCE.suffix \    -l <FILTER-VALUE>   FILTER-VALUE is the -log prob threshold described in Johnson et al.     (2007)'s paper.  It may be either 'a+e', 'a-e', or a positive real     value.3. Run with no options to see more use-cases.REFERENCES---------------------------------H. Johnson, J. Martin, G. Foster and R. Kuhn. (2007) Improving Translation  Quality by Discarding Most of the Phrasetable. In Proceedings of the 2007  Joint Conference on Empirical Methods in Natural Language Processing and  Computational Natural Language Learning (EMNLP-CoNLL), pp. 967-975.

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?