⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 hcopy.tex

📁 该压缩包为最新版htk的源代码,htk是现在比较流行的语音处理软件,请有兴趣的朋友下载使用
💻 TEX
字号:
%/* ----------------------------------------------------------- */%/*                                                             */%/*                          ___                                */%/*                       |_| | |_/   SPEECH                    */%/*                       | | | | \   RECOGNITION               */%/*                       =========   SOFTWARE                  */ %/*                                                             */%/*                                                             */%/* ----------------------------------------------------------- */%/*         Copyright: Microsoft Corporation                    */%/*          1995-2000 Redmond, Washington USA                  */%/*                    http://www.microsoft.com                */%/*                                                             */%/*   Use of this software is governed by a License Agreement   */%/*    ** See the file License for the Conditions of Use  **    */%/*    **     This banner notice must not be removed      **    */%/*                                                             */%/* ----------------------------------------------------------- */%% HTKBook - Steve Young  24/11/97%\newpage\mysect{HCopy}{HCopy}\subsection{Function}\index{hcopy@\htool{HCopy}|(}This program will copy one or more data files to a designated outputfile, optionally converting the data into a parameterised form.  Whilethe source files can be in any supported format, the output format isalways \HTKFF.  By default, the whole of the source file is copied tothe target but options exist to only copy a specified segment.  Hence,this program is used to convert data files in other formats to the \HTKFF\ format, to concatenate or segment data files, and to parameterise theresult.  If any option is set which leads to the extraction of a segmentof the source file rather than all of it, then segments will beextracted from all source files and concatenated to the target.Labels will be copied/concatenated if any of the options indicatinglabels are specified (\texttt{-i -l -x -G -I -L -P -X}). In this case, eachsource data file must have an associated label file, and a target labelfile is created. The name of the target label file is the root name ofthe target data file with the extension \texttt{.lab}, unless the \texttt{-X}option is used.  This new label file will contain the appropriatelycopied/truncated/concatenated labels to correspond with the target datafile; all start and end boundaries are recalculated if necessary. When used in conjunction with \htool{HSLab}, \htool{HCopy} provides a facility for taskssuch as cropping silence surrounding recorded utterances.  Since inputfiles may be coerced, \htool{HCopy} can also be used to convert the parameterkind of a file, for example from WAVEFORM to MFCC, depending on theconfiguration options. Not all possible conversions can actually be performed; see Table~\href{t:validcons} for a list of valid conversions. Conversions must be specified via a configuration file as described in chapter~\ref{c:speechio}. Note also that the parameterisation qualifier \texttt{\_N} cannot be used when saving files to disk, and is meant only for on-the-fly parameterisation. \subsection{Use}\htool{HCopy} is invoked by typing the command line\begin{verbatim}   HCopy [options] sa1 [ + sa2 + ... ] ta  [ sb1 [ + sb2 + ... ] tb ... ]\end{verbatim}This causes the contents of the one or more source files \texttt{sa1}, \texttt{sa2}, \ldots to be concatenatedand the result copied to the given target file \texttt{ta}. To avoid the overheadof reinvoking the tool when processing large databases, multiplesources and targets may be specified, for example\begin{verbatim}HCopy srcA.wav + srcB.wav tgtAB.wav srcC.wav tgtD.wav\end{verbatim}will create two new files \texttt{tgtAB.wav} and  \texttt{tgtD.wav}.\htool{HCopy} takes file arguments from a script specified using the \texttt{-S} optionexactly as from thecommand line, except that any newlines are ignored. The allowable options to \htool{HCopy} are as follows where all timesand durations are given in 100 ns units andare written as floating-point numbers.\begin{optlist} \ttitem{-a i}  Use level i of associated label files with the \texttt{-n}   and  \texttt{-x} options.  Note that this is not the same as using the \texttt{TRANSLEVEL} configuration variable since the  \texttt{-a} optionstill allows all levels to be copied through to the output files.  \ttitem{-e f} End copying from the source file at time \texttt{f}.  The default is the end of the file. If \texttt{f} is negative or zero, it is interpreted as a time relative to the end of the file, while a positive value indicates an absolute time from the start of the file. \ttitem{-i mlf} Output label files to master file \texttt{mlf}. \ttitem{-l s} Output label files to the directory \texttt{s}.    The default is to output to the current directory.   \ttitem{-m t} Set a margin of duration \texttt{t} around the    segments defined by the \texttt{-n} and \texttt{-x} options.  \ttitem{-n i [j]} Extract the speech segment corresponding to the {\tt     i}'th label in the source file.  If \texttt{j} is specified, then the     segment corresponding to the sequence of labels \texttt{i} to \texttt{j}     is extracted. Labels are numbered from their position in the     label file.  A negative index can be used to count from the end    of the label list.   Thus, \texttt{-n 1 -1} would specify the segment    starting at the first label and ending at the last.  \ttitem{-s f} Start copying from the source file at time \texttt{f}.     The default is 0.0, ie the beginning of the file.  \ttitem{-t n} Set the line width to \texttt{n} chars when formatting    trace output.  \ttitem{-x s [n]} Extract the speech segment corresponding to the       first occurrence of label \texttt{s} in the source file.  If \texttt{n}       is specified, then the \texttt{n}'th occurrence is extracted. If       multiple files are being concatenated, segments are extracted       from each file in turn, and the label must exist for each       concatenated file.\stdoptF\stdoptG\stdoptI\stdoptL\stdoptO\stdoptP\stdoptX\end{optlist}\stdopts{HCopy}Note that the parameter kind conversion mechanisms described in chapter~\ref{c:speechio}will be applied to all source files.  In particular, if an automaticconversion is requested via the configuration file, then \htool{HCopy} will copyor concatenate the converted source files, not the actual contents.Similarly, automatic byte swapping may occur depending on the sourceformat and the configuration variable \texttt{BYTEORDER}. Because thesampling rate may change during conversions, the options that specify a position within a file i.e. \texttt{-s} and \texttt{-e}use absolute times rather than sample index numbers.  All times in \HTK\are given in units of 100ns andare written as floating-point numbers. To save writing long strings of zeros, standard exponential notation may be used, for example \texttt{-s 1E6} indicates astart time of 0.1 seconds from the beginning of the file.\begin{center}\begin{tabular}{|r||cccccccccc|} \cline{2-11} \multicolumn{1}{c}{} & \multicolumn{10}{|c|}{\it Outputs } \\\cline{2-11} \cline{2-11} \hline          ~ & ~ & ~ & ~ & L  & ~ & ~ & ~ & ~ & ~ & ~  \\           ~ & W & ~ & ~ & P  & ~ & ~ & ~ & ~ & ~ & D  \\           ~ & A & ~ & ~ & C  & ~ & ~ & ~ & M & ~ & I  \\           ~ & V & ~ & L & E  & I & ~ & ~ & E & ~ & S  \\           ~ & E & ~ & P & P  & R & ~ & F & L & ~ & C  \\           ~ & F & ~ & R & S  & E & M & B & S & U & R  \\           ~ & O & L & E & T  & E & F & A & P & S & E  \\           ~ & R & P & F & R  & F & C & N & E & E & T  \\ {\it Inputs} & M & C & C & A  & C & C & K & C & R & E  \\ \hline ~WAVEFORM & $\surd$ & $\surd$ & $\surd$ & $\surd$  &$\surd$& $\surd$ & $\surd$ & $\surd$ & ~ &$\surd$ \\ ~~~~~~LPC & ~ & $\surd$ & $\surd$ & $\surd$  &$\surd$& ~ & ~ & ~ & ~ &$\surd$ \\ ~~~LPREFC & ~ & $\surd$ & $\surd$ & $\surd$  &$\surd$& ~ & ~ & ~ & ~ &$\surd$\\LPCEPSTRA & ~ & $\surd$ & $\surd$ & $\surd$  &$\surd$& ~ & ~ & ~ & ~ &$\surd$\\~~~~IREFC & ~ & $\surd$ & $\surd$ & $\surd$  &$\surd$& ~ & ~ & ~ & ~ &$\surd$\\~~~~~MFCC & ~ & ~ & ~ &  ~ & ~ & $\surd$ & ~ & ~ & ~ &$\surd$\\ ~~~~FBANK & ~ & ~ & ~ &  ~ & ~ & $\surd$ & $\surd$ & ~ & ~ &$\surd$\\~~MELSPEC & ~ & ~ & ~ &  ~ & ~ & $\surd$ & $\surd$ & $\surd$ & ~ &$\surd$\\~~~~~USER & ~ & ~ & ~ &  ~ & ~ & ~ & ~ & ~ & $\surd$ &$\surd$ \\~DISCRETE & ~ & ~ & ~ &  ~ & ~ & ~ & ~ & ~ & ~ & $\surd$  \\ \hline\end{tabular}\tabcap{validcons}{Valid Parameter Conversions}\end{center}Note that truncations are performed {\em after\/} any desired coding,which may result in a loss of time resolution if the target file formathas a lower sampling rate. Also, because of windowing effects,truncation, coding, and concatenation operations are not necessarilyinterchangeable.  If in doubt, perform all truncation/concatenation inthe waveform domain and then perform parameterisation as a last, separateinvocation of \htool{HCopy}.\subsection{Trace Output}\htool{HCopy} supports the following trace options where eachtrace flag is given using an octal base\begin{optlist}\ttitem{00001}  basic progress reporting. \ttitem{00002}  source and target file formats and parameter kinds.\ttitem{00004}  segment boundaries computed from label files.\ttitem{00010}  display memory usage after processing each file.\end{optlist}Trace flags are set using the \texttt{-T} option or the  \texttt{TRACE} configuration variable.\index{hcopy@\htool{HCopy}|)}%%% Local Variables: %%% mode: latex%%% TeX-master: "../htkbook"%%% End: 

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -