⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 hslab.tex

📁 该压缩包为最新版htk的源代码,htk是现在比较流行的语音处理软件,请有兴趣的朋友下载使用
💻 TEX
字号:
%/* ----------------------------------------------------------- */%/*                                                             */%/*                          ___                                */%/*                       |_| | |_/   SPEECH                    */%/*                       | | | | \   RECOGNITION               */%/*                       =========   SOFTWARE                  */ %/*                                                             */%/*                                                             */%/* ----------------------------------------------------------- */%/*         Copyright: Microsoft Corporation                    */%/*          1995-2000 Redmond, Washington USA                  */%/*                    http://www.microsoft.com                */%/*                                                             */%/*   Use of this software is governed by a License Agreement   */%/*    ** See the file License for the Conditions of Use  **    */%/*    **     This banner notice must not be removed      **    */%/*                                                             */%/* ----------------------------------------------------------- */%% HTKBook - Steve Young  30/10/95%\newpage\mysect{HSLab}{HSLab}\newcommand{\hbut}[1]{\texttt{#1}}\newcommand{\hbbut}[1]{\fbox{\texttt{#1}}}\mysubsect{Function}{HSLab-Function}\index{hslab@\htool{HSLab}|(}\htool{HSLab} is an interactive label editor for manipulating speech label files. An example of using \htool{HSLab} would be to load a sampled waveform file, determine the boundaries of the speech units of interest and assign labels to them. Alternatively, an existing label file can be loaded and edited bychanging current label boundaries, deleting and creating new labels.\htool{HSLab} is the only tool in the HTK package which makes use of the graphics library \htool{HGraf}.When started \htool{HSLab} displays a window which is split into two parts: adisplay section and a control section (see Fig~\href{f:hslab}). The display section contains the plotted speech waveform with the associated labels. The control section consists of a palette of buttons which are used to invoke the various facilities available in the tool. The buttons are laid out into three different groups depending on the function they perform. Group one (top row) contains buttons related to basic input/output commands. Group two (middle row) implements the viewing and record/playback functions. The buttons in group three (bottom row) are used for labelling. To invoke a particular function, place the mouse pointer onto the corresponding button and click once. All commands which require further interaction with the user after invocation will display guiding text in the message area telling the user what he or she is expected to do next. For example, to delete a label, the user will click on \hbut{Delete}, the message ``Please select label to delete'' will appear in the message area and the user will be expected to click in that part of the display section corresponding to the label to be deleted (not on the label itself).A {\em marked region} is a slice of the waveform currently visible in the window. A region is marked by clicking on \hbut{Mark} and specifying two boundaries by clicking in the display section. When marked, a region will be displayed in inverse colours. In the presence of a marked region the commands \hbut{Play}, \hbut{Label} and \hbut{Label as} will be applied to the specified region rather than to the whole of the waveform visible on the screen. Part of the waveform can also be made into a marked region with the commands \hbut{Zoom Out} and \hbut{Select}. \hbut{Zoom Out} will take the user back to the previous level of magnification and the waveform being displayed before the execution of the command will become a marked region. \hbut{Select} will make the part of the waveform corresponding to a particular label into a marked region. This can be useful for playing back existing labels. Labelling is carried out with \hbut{Label } and \hbut{Label as}. \hbut{Label}will assign {\em The Current Label} to a specified slice of the waveform,whilst \hbut{Label as} will prompt the user to type-in the labelling string. {\em The Current Label} is shown in the button in the bottom right corner of the control section.  It defaults to ``Speech'' and it can be changed by clicking on the button it resides in. Multiple alternative transcriptions are manipulated using the \hbut{Set [?]} and \hbut{New} buttons. The former is used to select the desired transcription, the latter is used to create a new alternative transcription.\centrefig{hslab}{120}{HSLab display window}\mysubsect{Use}{HSLab-Use}\htool{HSLab} is invoked by typing the command line\begin{verbatim}   HSLab [options] dataFile\end{verbatim}where \texttt{dataFile} is a data file in any of the supported formats with a \texttt{WAVEFORM} sample kind.   If the given data file does not exist, then\htool{HSLab} will assume that a new file is to be recorded with this name.The available options for \htool{HSLab} are\begin{optlist}  \ttitem{-a} With this switch present, the numeric part of the global labelling  string is automatically incremented after every \texttt{Label} operation.  \ttitem{-i file} This option allows transcription files to be output to the  named master label file (MLF).   \ttitem{-n} Normally \htool{HSLab} expects to load an existing label file  whose name is derived from the speech data file. This option tells   \htool{HSLab} that a new empty transcription is to be created for the loaded   data-file.  \ttitem{-s string} This option allows the user to set the string displayed  in the ``command'' button used to trigger external commands.\stdoptF\stdoptG\stdoptI\stdoptL\stdoptX\end{optlist}\stdopts{HSLab}The following is a summary of the function of each \htool{HSLab}button.\begin{optlist}  \ttitem{\hbbut{Load}} Load a speech data file and associated transcription.      If changes have been made to the currently loaded transcription       since it was last saved the user will be prompted to save these       changes before loading the new file.  \ttitem{\hbbut{Save}} Save changes made to the transcription into a specified      label file.  \ttitem{\hbbut{About}} Print information about HSLab.  \ttitem{\hbbut{Quit}} Exit from \htool{HSLab}. If alterations have been made       to the currently loaded transcription since it was last saved, the user       will be prompted to save these changes before exiting.  \ttitem{\hbbut{Command}} This button is used to trigger an external command which       can process the waveform file currently loaded in \htool{HSLab}. This       is accomplished by setting the environment variable \texttt{HSLABCMD} to the       shell command required to perform the processing. When the \texttt{Command}      button is pressed, any occurrence of \texttt{\$} in the shell command is       replaced by the name of the currently loaded waveform file. Note that       only the filename without its extension is substituted. The string       displayed in the ``command'' button can be changed using the \texttt{-s}       option.  \ttitem{\hbbut{Mark}} Mark a region of the displayed waveform. The user       is prompted to specify the start and the end point of a region with      the mouse pointer. The marked region will be displayed in inverse       colours. Only one region can be marked at a time.  \ttitem{\hbbut{Unmark}} Unmark a previously marked region.  \ttitem{\hbbut{<--}} Scroll the display to the left.  \ttitem{\hbbut{-->}} Scroll the display to the right.  \ttitem{\hbbut{Z.In}} Zoom into a part of the displayed waveform. If there is       a currently marked region then that region will be zoomed into,       otherwise, the user will be prompted to select a slice of the       waveform by specifying two points using the mouse pointer.  \ttitem{\hbbut{Z.Out}} Restore the previous viewing level.    \ttitem{\hbbut{Restore}} Display the complete waveform into the window. Any       marked regions will be unmarked.    \ttitem{\hbbut{Play}} If there is a marked region of the waveform then      that portion of the signal will be played through the internal speaker.      Otherwise, the command will apply to the waveform visible on the       screen.  \ttitem{\hbbut{Rec}} This initiates recording from the audio input device. The maximum      duration of a recording is limited to 2 mins at 16KHz sampling rate. Two bar-graphs       are displayed: the first (red) shows the number of samples recorded, the second      bar (green) displays the energy of the incoming signal. Once pressed, the      \texttt{Rec} button changes into \texttt{Stop} which, in turn, is used to terminate      the operation. When finished, the audio data stored in the buffer is written      out to disk. Each recording is stored in alternating files       \texttt{dataFile\_0} and \texttt{dataFile\_1}.  \ttitem{\hbbut{Pause}} Clicking on this button pauses/un-pauses the recording operation.  \ttitem{\hbbut{Volume}} This button is used to select the playback volume of the      audio device.  \ttitem{\hbbut{x1}} This button selects the current level of waveform magnification.      The available factors are $\times 1$,  $\times 2$,  $\times 4$,  $\times 8$,        $\times 16$, and  $\times 32$.  \ttitem{\hbbut{Label}} If a marked region exists, then the waveform contained in the       region will be labelled with {\em The Current Label}. Otherwise, the       command will be applied to the waveform visible on the screen.  \ttitem{\hbbut{Labelas}} Same as above, however, the user is prompted to type in       the labelling string.  \ttitem{\hbbut{Delete}} Delete a label.  \ttitem{\hbbut{Edit}} Edit the string of a label.  \ttitem{\hbbut{Select}} Select a label as a marked region.  \ttitem{\hbbut{Adjust}} Adjust the boundaries of a label.  To select the label       boundary to adjust, click in the display near to the label boundary.  \ttitem{\hbbut{Set [?]}} This button is used to select the current alternative       transcription displayed and used in \htool{HSLab}.  \ttitem{\hbbut{New}} Creates a new alternative transcription. If an empty alternative       transcription already exists, then a new transcription is not created.  \ttitem{\hbbut{Undo}} Single level undo operation for labelling commands.  \ttitem{\hbbut{Speech}} Change the current labelling string (the button in the   bottom right of the control area).\end{optlist}The following ``mouse'' shortcuts are provided. To mark a region position the pointer atone of the desired boundaries, then press the left mouse button and while holding it downposition the pointer at the other region boundary. Upon releasing the mouse button the marked region will be hilighted. To play a label position the mouse cursor anywhere withinthe corresponding label ``slice'' in the label area of the display and click the leftmouse button.\mysubsect{Tracing}{HSLab-Tracing}\htool{HSLab} does not provide any trace options.\index{hslab@\htool{HSLab}|)}%%% Local Variables: %%% mode: latex%%% TeX-master: "../htkbook"%%% End: 

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -