📄 julian.txt

📁 about sound recognition.i want to downlod
💻 TXT
📖 第 1 页 / 共 3 页
字号:
              frequency cut-off.
              (default: -1 = disabled)

       -sscalc
              Perform spectral subtraction using head part of each file.  With
              this option, Julius assume there are certain length  of  silence
              at  each  input  file.   Valid only for rawfile input.  Conflict
              with "-ssload".

       -sscalclen
              With "-sscalc", specify the length of head part silence in  mil-
              liseconds (default: 300)

       -ssload filename
              Perform  spectral  subtraction  for speech input using pre-esti-
              mated noise spectrum from file.  The noise spectrum data  should
              be  computed  beforehand  by  mkss.  Valid for all speech input.
              Conflict with "-sscalc".

       -ssalpha value
              Alpha coefficient of spectral subtraction.  Noise will  be  sub-
              tracted  stronger  as  this value gets larger, but distortion of
              the resulting signal also becomes remarkable.  (default: 2.0)

       -ssfloor value
              Flooring coefficient  of  spectral  subtraction.   The  spectral
              parameters  that go under zero after subtraction will be substi-
              tuted by the source signal  with  this  coefficient  multiplied.
              (default: 0.5)

   GMM-based Input Verification and Rejection
       -gmm filename
              GMM definition file in HTK format. If specified, GMM-based input
              verification will be performed concurrently with the  1st  pass,
              and  you  can reject the input according to the result as speci-
              fied by "-gmmreject".  Note that the GMM should  be  defined  as
              one-state  HMMs, and their training parameter should be the same
              as the acoustic model you want to use with.

       -gmmnum N
              Number of Gaussian components to be computed per  frame  on  GMM
              calculation.   Only  the  N-best  Gaussians will be computed for
              rapid calculation.  The default is  10  and  specifying  smaller
              value  will  speed up GMM calculation, but too small value (1 or
              2) may cause degradation of identification performance.

       -gmmreject string
              Comma-separated list of GMM names  to  be  rejected  as  invalid
              input.   When  recognition,  the log likelihoods of GMMs accumu-
              lated for the entire input will be  computed  concurrently  with
              the  1st  pass.   If the GMM name of the maximum score is within
              this string, the 2nd pass will not be  executed  and  the  input
              will be rejected.

   Language Model (Finite State Grammar)
       The  recognition  grammar  can  be  specified  in  three ways: "-gram",
       "-gramlist" or combination of "-dfa" and "-v".
       Multiple grammars can be specified by using  "-gram"  and  "-gramlist".
       When  you  use these options several times, all of them will be read at
       startup.  Note that this is a different  behavior  from  other  options
       (last  one override previous ones).  You can use "-nogram" to reset the
       already specified grammars at that point.

       -gram gramprefix1[,gramprefix2[,gramprefix3,...]]
              Comma-separated list of  grammars  to  be  used.   the  argument
              should  be  prefix  of a grammar, i.e. if you have "foo.dfa" and
              "foo.dict", you can specify them by single argument "foo".  Mul-
              tiple grammars can be specified as comma-separated list.

       -gramlist listfile
              Specify a grammar list file that contains list of grammars to be
              used.  The list file should contain  the  prefixs  of  grammars,
              each per line.  A relative path in the list file will be treated
              as relative to the list file, not the current path or configura-
              tion file.

       -dfa dfa_filename
              Finite state automaton grammar file.

       -v dictionary_file
              Word dictionary file (required)

       -nogram
              Remove  the  current  list  of grammars already specified by the
              options above.

       -penalty1 float
              Word insertion penalty for the first pass. (default: 0.0)

       -penalty2 float
              Word insertion penalty for the second pass. (default: 0.0)

       -spmodel {WORD|WORD[OUTSYM]|#num}
              Name of short pause model as defined in the hmmdefs.  In Julian,
              a  word  whose  pronunciation  consists of only this short pause
              model is called 'short pause word', and  handled  especially  in
              recognition:  even if its appearance in a sentence is explicitly
              specified in the grammar, it can be skipped while parsing.  This
              behavior  is  for  dealing  with insertion and deletion of short
              pause that often  appear  unintensionally  in  user  utterances.
              They can be specified in a style as shown below (default: "sp").


                                       Example
           Word_name                     <s>
           Word_name[output_symbol]   <s>[silB]
           #Word_ID                      #14

            (Word_ID is the word position in the dictionary
             file starting from 0)

       -forcedict
              Ignore dictionary errors and force running.  Words  with  errors
              will be dropped from dictionary at startup.

   Acoustic Model (HMM)
       -h hmmfilename
              HMM  definition file to use. Format (ascii/binary) will be auto-
              matically detected. (required)

       -hlist HMMlistfilename
              HMMList file to use.  Required when using triphone  based  HMMs.
              This file provides a mapping between the logical triphones names
              genertated from the phonetic representation  in  the  dictionary
              and the HMM definition names.

       -iwcd1 {best N|max|avg}
              When  using a triphone model, select method to handle inter-word
              triphone context on the first and last phone of a  word  in  the
              first pass.

              best N: use average likelihood of N-best scores from the same
                      context triphones
              max: use maximum likelihood of the same
                   context triphones
              avg: use average likelihood of the same
                   context triphones (default)

       -force_ccd / -no_ccd
              Normally  Julius determines whether the specified acoustic model
              is a context-dependent model from the model names, i.e., whether
              the  model names contain character '+' and '-'.  You can explic-
              itly specify by these options  to  avoid  mis-detection.   These
              will override the automatic detection result.

       -notypecheck
              Disable checking of the input parameter type. (default: enabled)

   Acoustic Computation
       Gaussian Pruning will be automatically enabled when using  tied-mixture
       based  acoutic  model.   It is disabled by default for non tied-mixture
       models, but you can activate pruning  to  those  models  by  explicitly
       specifying  "-gprune".  Gaussian Selection needs a monophone model con-
       verted by mkgshmm.

       -gprune {safe|heuristic|beam|none}
              Set the Gaussian pruning technique to use.
              (default: 'safe' (setup=standard), 'beam' (setup=fast) for  tied
              mixture model, 'none' for non tied-mixture model)

       -tmix K
              With  Gaussian  Pruning, specify the number of Gaussians to com-
              pute per mixture codebook. Small value will  speed  up  computa-
              tion, but likelihood error will grow larger. (default: 2)

       -gshmm hmmdefs
              Specify  monophone hmmdefs to use for Gaussian Mixture Selectio.
              Monophone model for GMS is generated from an ordinary  monophone
              HMM  model  using  mkgshmm.  This option is disabled by default.
              (no GMS applied)

       -gsnum N
              When using GMS, specify number of monophone state to select from
              whole monophone states. (default: 24)

   Inter-word Short Pause Handling
       -iwsp  (Multi-path  version  only) Enable inter-word context-free short
              pause handling.  This option appends  a  skippable  short  pause
              model  for  every  word end.  The added model will be skipped on
              inter-word context handling.  The HMM model to be  appended  can
              be specified by "-spmodel" option.

   Search Parameters (First Pass)
       -b beamwidth
              Beam  width (number of HMM nodes) on the first pass.  This value
              defines search width on the 1st pass, and has  great  effect  on
              the  total  processing  time.   Smaller  width will speed up the
              decoding, but too small  value  will  result  in  a  substantial
              increase  of  recognition  errors due to search failure.  Larger
              value will make the search stable and will lead to  failure-free
              search,  but  processing time and memory usage will grow in pro-
              portion to the width.

              default value: acoustic model dependent
                400 (monophone)
                800 (triphone,PTM)
               1000 (triphone,PTM, setup=v2.1)

       -1pass Only perform the first pass search.

       -realtime

       -norealtime
              Explicitly specify whether real-time (pipeline) processing  will
              be  done  in the first pass or not.  For file input, the default
              is OFF  (-norealtime),  for  microphone,  adinnet  and  NetAudio
              input,  the  default  is ON (-realtime).  This option relates to
              the way CMN is performed: when OFF, CMN is calculated  for  each
              input using cepstral mean of the whole input.  When the realtime
              option is ON, MAP-CMN will be performed.  When MAP-CMN, the cep-
              stral  mean  of  last 5 seconds are used as the initial cepstral
              mean at the beginning of each input.  Also refer to  "-progout".

       -cmnsave filename
              Save last CMN parameters computed while recognition to the spec-
              ified file.  The parameters will be saved to the  file  in  each
              time  a input is recognized, so the output file always keeps the
              last CMN parameters.  If output file already exist, it  will  be
              overridden.

       -cmnload filename
              Load initial CMN parameters previously saved in a file by "-cmn-
              save".   Loading  an  initial  CMN  enables  Julius  to   better
              recognize  the  first utterance on a microphone / network input.
              Also see "-cmnnoupdate".

       -cmnmapweight
              Specify weight of initial cepstral mean at the beginning of each
              utterance  for microphone / network input.  Specify larger value
              to retain the initial cepstral mean for  a  longer  period,  and
              smaller  value  to  rely  more  on the current input.  (default:
              100.0)

       -cmnnoupdate
              When microphone / network input, this option makes engine not to
              update  the  cepstral mean at each input and force engine to use
              the initial cepstral mean given by "-cmnload" parmanently.

   Search Parameters (Second Pass)
       -b2 hyponum
              Beam width (number of hypothesis) in second pass.  If the  count
              of word expantion at a certain length of hypothesis reaches this
              limit while search, shorter hypotheses are not expanded further.
              This prevents search to fall in breadth-first-like status stack-
              ing on the same position, and improve search failure.  (default:
              30)

       -n candidatenum
              The  search  continues  till 'candidate_num' sentence hypotheses
              have been found.  The obtained sentence hypotheses are sorted by
              score,  and final result is displayed in the order (see also the
              "-output" option).
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -