📄 julian.txt
字号:
frequency cut-off.
(default: -1 = disabled)
-sscalc
Perform spectral subtraction using head part of each file. With
this option, Julius assume there are certain length of silence
at each input file. Valid only for rawfile input. Conflict
with "-ssload".
-sscalclen
With "-sscalc", specify the length of head part silence in mil-
liseconds (default: 300)
-ssload filename
Perform spectral subtraction for speech input using pre-esti-
mated noise spectrum from file. The noise spectrum data should
be computed beforehand by mkss. Valid for all speech input.
Conflict with "-sscalc".
-ssalpha value
Alpha coefficient of spectral subtraction. Noise will be sub-
tracted stronger as this value gets larger, but distortion of
the resulting signal also becomes remarkable. (default: 2.0)
-ssfloor value
Flooring coefficient of spectral subtraction. The spectral
parameters that go under zero after subtraction will be substi-
tuted by the source signal with this coefficient multiplied.
(default: 0.5)
GMM-based Input Verification and Rejection
-gmm filename
GMM definition file in HTK format. If specified, GMM-based input
verification will be performed concurrently with the 1st pass,
and you can reject the input according to the result as speci-
fied by "-gmmreject". Note that the GMM should be defined as
one-state HMMs, and their training parameter should be the same
as the acoustic model you want to use with.
-gmmnum N
Number of Gaussian components to be computed per frame on GMM
calculation. Only the N-best Gaussians will be computed for
rapid calculation. The default is 10 and specifying smaller
value will speed up GMM calculation, but too small value (1 or
2) may cause degradation of identification performance.
-gmmreject string
Comma-separated list of GMM names to be rejected as invalid
input. When recognition, the log likelihoods of GMMs accumu-
lated for the entire input will be computed concurrently with
the 1st pass. If the GMM name of the maximum score is within
this string, the 2nd pass will not be executed and the input
will be rejected.
Language Model (Finite State Grammar)
The recognition grammar can be specified in three ways: "-gram",
"-gramlist" or combination of "-dfa" and "-v".
Multiple grammars can be specified by using "-gram" and "-gramlist".
When you use these options several times, all of them will be read at
startup. Note that this is a different behavior from other options
(last one override previous ones). You can use "-nogram" to reset the
already specified grammars at that point.
-gram gramprefix1[,gramprefix2[,gramprefix3,...]]
Comma-separated list of grammars to be used. the argument
should be prefix of a grammar, i.e. if you have "foo.dfa" and
"foo.dict", you can specify them by single argument "foo". Mul-
tiple grammars can be specified as comma-separated list.
-gramlist listfile
Specify a grammar list file that contains list of grammars to be
used. The list file should contain the prefixs of grammars,
each per line. A relative path in the list file will be treated
as relative to the list file, not the current path or configura-
tion file.
-dfa dfa_filename
Finite state automaton grammar file.
-v dictionary_file
Word dictionary file (required)
-nogram
Remove the current list of grammars already specified by the
options above.
-penalty1 float
Word insertion penalty for the first pass. (default: 0.0)
-penalty2 float
Word insertion penalty for the second pass. (default: 0.0)
-spmodel {WORD|WORD[OUTSYM]|#num}
Name of short pause model as defined in the hmmdefs. In Julian,
a word whose pronunciation consists of only this short pause
model is called 'short pause word', and handled especially in
recognition: even if its appearance in a sentence is explicitly
specified in the grammar, it can be skipped while parsing. This
behavior is for dealing with insertion and deletion of short
pause that often appear unintensionally in user utterances.
They can be specified in a style as shown below (default: "sp").
Example
Word_name <s>
Word_name[output_symbol] <s>[silB]
#Word_ID #14
(Word_ID is the word position in the dictionary
file starting from 0)
-forcedict
Ignore dictionary errors and force running. Words with errors
will be dropped from dictionary at startup.
Acoustic Model (HMM)
-h hmmfilename
HMM definition file to use. Format (ascii/binary) will be auto-
matically detected. (required)
-hlist HMMlistfilename
HMMList file to use. Required when using triphone based HMMs.
This file provides a mapping between the logical triphones names
genertated from the phonetic representation in the dictionary
and the HMM definition names.
-iwcd1 {best N|max|avg}
When using a triphone model, select method to handle inter-word
triphone context on the first and last phone of a word in the
first pass.
best N: use average likelihood of N-best scores from the same
context triphones
max: use maximum likelihood of the same
context triphones
avg: use average likelihood of the same
context triphones (default)
-force_ccd / -no_ccd
Normally Julius determines whether the specified acoustic model
is a context-dependent model from the model names, i.e., whether
the model names contain character '+' and '-'. You can explic-
itly specify by these options to avoid mis-detection. These
will override the automatic detection result.
-notypecheck
Disable checking of the input parameter type. (default: enabled)
Acoustic Computation
Gaussian Pruning will be automatically enabled when using tied-mixture
based acoutic model. It is disabled by default for non tied-mixture
models, but you can activate pruning to those models by explicitly
specifying "-gprune". Gaussian Selection needs a monophone model con-
verted by mkgshmm.
-gprune {safe|heuristic|beam|none}
Set the Gaussian pruning technique to use.
(default: 'safe' (setup=standard), 'beam' (setup=fast) for tied
mixture model, 'none' for non tied-mixture model)
-tmix K
With Gaussian Pruning, specify the number of Gaussians to com-
pute per mixture codebook. Small value will speed up computa-
tion, but likelihood error will grow larger. (default: 2)
-gshmm hmmdefs
Specify monophone hmmdefs to use for Gaussian Mixture Selectio.
Monophone model for GMS is generated from an ordinary monophone
HMM model using mkgshmm. This option is disabled by default.
(no GMS applied)
-gsnum N
When using GMS, specify number of monophone state to select from
whole monophone states. (default: 24)
Inter-word Short Pause Handling
-iwsp (Multi-path version only) Enable inter-word context-free short
pause handling. This option appends a skippable short pause
model for every word end. The added model will be skipped on
inter-word context handling. The HMM model to be appended can
be specified by "-spmodel" option.
Search Parameters (First Pass)
-b beamwidth
Beam width (number of HMM nodes) on the first pass. This value
defines search width on the 1st pass, and has great effect on
the total processing time. Smaller width will speed up the
decoding, but too small value will result in a substantial
increase of recognition errors due to search failure. Larger
value will make the search stable and will lead to failure-free
search, but processing time and memory usage will grow in pro-
portion to the width.
default value: acoustic model dependent
400 (monophone)
800 (triphone,PTM)
1000 (triphone,PTM, setup=v2.1)
-1pass Only perform the first pass search.
-realtime
-norealtime
Explicitly specify whether real-time (pipeline) processing will
be done in the first pass or not. For file input, the default
is OFF (-norealtime), for microphone, adinnet and NetAudio
input, the default is ON (-realtime). This option relates to
the way CMN is performed: when OFF, CMN is calculated for each
input using cepstral mean of the whole input. When the realtime
option is ON, MAP-CMN will be performed. When MAP-CMN, the cep-
stral mean of last 5 seconds are used as the initial cepstral
mean at the beginning of each input. Also refer to "-progout".
-cmnsave filename
Save last CMN parameters computed while recognition to the spec-
ified file. The parameters will be saved to the file in each
time a input is recognized, so the output file always keeps the
last CMN parameters. If output file already exist, it will be
overridden.
-cmnload filename
Load initial CMN parameters previously saved in a file by "-cmn-
save". Loading an initial CMN enables Julius to better
recognize the first utterance on a microphone / network input.
Also see "-cmnnoupdate".
-cmnmapweight
Specify weight of initial cepstral mean at the beginning of each
utterance for microphone / network input. Specify larger value
to retain the initial cepstral mean for a longer period, and
smaller value to rely more on the current input. (default:
100.0)
-cmnnoupdate
When microphone / network input, this option makes engine not to
update the cepstral mean at each input and force engine to use
the initial cepstral mean given by "-cmnload" parmanently.
Search Parameters (Second Pass)
-b2 hyponum
Beam width (number of hypothesis) in second pass. If the count
of word expantion at a certain length of hypothesis reaches this
limit while search, shorter hypotheses are not expanded further.
This prevents search to fall in breadth-first-like status stack-
ing on the same position, and improve search failure. (default:
30)
-n candidatenum
The search continues till 'candidate_num' sentence hypotheses
have been found. The obtained sentence hypotheses are sorted by
score, and final result is displayed in the order (see also the
"-output" option).
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -