⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 htkbook.out

📁 该压缩包为最新版htk的源代码,htk是现在比较流行的语音处理软件,请有兴趣的朋友下载使用
💻 OUT
📖 第 1 页 / 共 2 页
字号:
\BOOKMARK [-1][-]{part.1}{I Tutorial Overview}{}\BOOKMARK [0][-]{chapter.1}{The Fundamentals of HTK}{part.1}\BOOKMARK [1][-]{section.1.1}{General Principles of HMMs}{chapter.1}\BOOKMARK [1][-]{section.1.2}{Isolated Word Recognition}{chapter.1}\BOOKMARK [1][-]{section.1.3}{Output Probability Specification}{chapter.1}\BOOKMARK [1][-]{section.1.4}{Baum-Welch Re-Estimation}{chapter.1}\BOOKMARK [1][-]{section.1.5}{Recognition and Viterbi Decoding}{chapter.1}\BOOKMARK [1][-]{section.1.6}{Continuous Speech Recognition}{chapter.1}\BOOKMARK [1][-]{section.1.7}{Speaker Adaptation}{chapter.1}\BOOKMARK [0][-]{chapter.2}{An Overview of the HTK Toolkit}{part.1}\BOOKMARK [1][-]{section.2.1}{HTK Software Architecture}{chapter.2}\BOOKMARK [1][-]{section.2.2}{Generic Properties of a HTK Tool}{chapter.2}\BOOKMARK [1][-]{section.2.3}{The Toolkit}{chapter.2}\BOOKMARK [2][-]{subsection.2.3.1}{Data Preparation Tools}{section.2.3}\BOOKMARK [2][-]{subsection.2.3.2}{Training Tools}{section.2.3}\BOOKMARK [2][-]{subsection.2.3.3}{Recognition Tools}{section.2.3}\BOOKMARK [2][-]{subsection.2.3.4}{Analysis Tool}{section.2.3}\BOOKMARK [1][-]{section.2.4}{Whats New In Version 3.2}{chapter.2}\BOOKMARK [2][-]{subsection.2.4.1}{New In Version 3.1}{section.2.4}\BOOKMARK [2][-]{subsection.2.4.2}{New In Version 2.2}{section.2.4}\BOOKMARK [2][-]{subsection.2.4.3}{Features Added To Version 2.1}{section.2.4}\BOOKMARK [0][-]{chapter.3}{A Tutorial Example of Using HTK}{part.1}\BOOKMARK [1][-]{section.3.1}{Data Preparation}{chapter.3}\BOOKMARK [2][-]{subsection.3.1.1}{Step 1 - the Task Grammar}{section.3.1}\BOOKMARK [2][-]{subsection.3.1.2}{Step 2 - the Dictionary}{section.3.1}\BOOKMARK [2][-]{subsection.3.1.3}{Step 3 - Recording the Data}{section.3.1}\BOOKMARK [2][-]{subsection.3.1.4}{Step 4 - Creating the Transcription Files}{section.3.1}\BOOKMARK [2][-]{subsection.3.1.5}{Step 5 - Coding the Data}{section.3.1}\BOOKMARK [1][-]{section.3.2}{Creating Monophone HMMs}{chapter.3}\BOOKMARK [2][-]{subsection.3.2.1}{Step 6 - Creating Flat Start Monophones}{section.3.2}\BOOKMARK [2][-]{subsection.3.2.2}{Step 7 - Fixing the Silence Models}{section.3.2}\BOOKMARK [2][-]{subsection.3.2.3}{Step 8 - Realigning the Training Data}{section.3.2}\BOOKMARK [1][-]{section.3.3}{Creating Tied-State Triphones}{chapter.3}\BOOKMARK [2][-]{subsection.3.3.1}{Step 9 - Making Triphones from Monophones}{section.3.3}\BOOKMARK [2][-]{subsection.3.3.2}{Step 10 - Making Tied-State Triphones}{section.3.3}\BOOKMARK [1][-]{section.3.4}{Recogniser Evaluation}{chapter.3}\BOOKMARK [2][-]{subsection.3.4.1}{Step 11 - Recognising the Test Data}{section.3.4}\BOOKMARK [1][-]{section.3.5}{Running the Recogniser Live}{chapter.3}\BOOKMARK [1][-]{section.3.6}{Summary}{chapter.3}\BOOKMARK [-1][-]{part.2}{II HTK in Depth}{}\BOOKMARK [0][-]{chapter.4}{The Operating Environment}{part.2}\BOOKMARK [1][-]{section.4.1}{The Command Line}{chapter.4}\BOOKMARK [1][-]{section.4.2}{Script Files}{chapter.4}\BOOKMARK [1][-]{section.4.3}{Configuration Files}{chapter.4}\BOOKMARK [1][-]{section.4.4}{Standard Options}{chapter.4}\BOOKMARK [1][-]{section.4.5}{Error Reporting}{chapter.4}\BOOKMARK [1][-]{section.4.6}{Strings and Names}{chapter.4}\BOOKMARK [1][-]{section.4.7}{Memory Management}{chapter.4}\BOOKMARK [1][-]{section.4.8}{Input/Output via Pipes and Networks}{chapter.4}\BOOKMARK [1][-]{section.4.9}{Byte-swapping of HTK data files}{chapter.4}\BOOKMARK [1][-]{section.4.10}{Summary}{chapter.4}\BOOKMARK [0][-]{chapter.5}{Speech Input/Output}{part.2}\BOOKMARK [1][-]{section.5.1}{General Mechanism}{chapter.5}\BOOKMARK [1][-]{section.5.2}{Speech Signal Processing}{chapter.5}\BOOKMARK [1][-]{section.5.3}{Linear Prediction Analysis}{chapter.5}\BOOKMARK [1][-]{section.5.4}{Filterbank Analysis}{chapter.5}\BOOKMARK [1][-]{section.5.5}{Vocal Tract Length Normalisation}{chapter.5}\BOOKMARK [1][-]{section.5.6}{Cepstral Features}{chapter.5}\BOOKMARK [1][-]{section.5.7}{Perceptual Linear Prediction}{chapter.5}\BOOKMARK [1][-]{section.5.8}{Energy Measures}{chapter.5}\BOOKMARK [1][-]{section.5.9}{Delta, Acceleration and Third Differential Coefficients}{chapter.5}\BOOKMARK [1][-]{section.5.10}{Storage of Parameter Files}{chapter.5}\BOOKMARK [2][-]{subsection.5.10.1}{HTK Format Parameter Files}{section.5.10}\BOOKMARK [2][-]{subsection.5.10.2}{Esignal Format Parameter Files}{section.5.10}\BOOKMARK [1][-]{section.5.11}{Waveform File Formats}{chapter.5}\BOOKMARK [2][-]{subsection.5.11.1}{HTK File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.2}{Esignal File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.3}{TIMIT File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.4}{NIST File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.5}{SCRIBE File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.6}{SDES1 File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.7}{AIFF File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.8}{SUNAU8 File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.9}{OGI File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.10}{WAV File Format}{section.5.11}\BOOKMARK [2][-]{subsection.5.11.11}{ALIEN and NOHEAD File Formats}{section.5.11}\BOOKMARK [1][-]{section.5.12}{Direct Audio Input/Output}{chapter.5}\BOOKMARK [1][-]{section.5.13}{Multiple Input Streams}{chapter.5}\BOOKMARK [1][-]{section.5.14}{Vector Quantisation}{chapter.5}\BOOKMARK [1][-]{section.5.15}{Viewing Speech with HList}{chapter.5}\BOOKMARK [1][-]{section.5.16}{Copying and Coding using HCopy}{chapter.5}\BOOKMARK [1][-]{section.5.17}{Version 1.5 Compatibility}{chapter.5}\BOOKMARK [1][-]{section.5.18}{Summary}{chapter.5}\BOOKMARK [0][-]{chapter.6}{Transcriptions and Label Files}{part.2}\BOOKMARK [1][-]{section.6.1}{Label File Structure}{chapter.6}\BOOKMARK [1][-]{section.6.2}{Label File Formats}{chapter.6}\BOOKMARK [2][-]{subsection.6.2.1}{HTK Label Files}{section.6.2}\BOOKMARK [2][-]{subsection.6.2.2}{ESPS Label Files}{section.6.2}\BOOKMARK [2][-]{subsection.6.2.3}{TIMIT Label Files}{section.6.2}\BOOKMARK [2][-]{subsection.6.2.4}{SCRIBE Label Files}{section.6.2}\BOOKMARK [1][-]{section.6.3}{Master Label Files}{chapter.6}\BOOKMARK [2][-]{subsection.6.3.1}{General Principles of MLFs}{section.6.3}\BOOKMARK [2][-]{subsection.6.3.2}{Syntax and Semantics}{section.6.3}\BOOKMARK [2][-]{subsection.6.3.3}{MLF Search}{section.6.3}\BOOKMARK [2][-]{subsection.6.3.4}{MLF Examples}{section.6.3}\BOOKMARK [1][-]{section.6.4}{Editing Label Files}{chapter.6}\BOOKMARK [1][-]{section.6.5}{Summary}{chapter.6}\BOOKMARK [0][-]{chapter.7}{HMM Definition Files}{part.2}\BOOKMARK [1][-]{section.7.1}{The HMM Parameters}{chapter.7}\BOOKMARK [1][-]{section.7.2}{Basic HMM Definitions}{chapter.7}\BOOKMARK [1][-]{section.7.3}{Macro Definitions}{chapter.7}\BOOKMARK [1][-]{section.7.4}{HMM Sets}{chapter.7}\BOOKMARK [1][-]{section.7.5}{Tied-Mixture Systems}{chapter.7}\BOOKMARK [1][-]{section.7.6}{Discrete Probability HMMs}{chapter.7}\BOOKMARK [1][-]{section.7.7}{Input Linear Transforms}{chapter.7}\BOOKMARK [1][-]{section.7.8}{Tee Models}{chapter.7}\BOOKMARK [1][-]{section.7.9}{Regression Class Trees for Adaptation}{chapter.7}\BOOKMARK [1][-]{section.7.10}{Binary Storage Format}{chapter.7}\BOOKMARK [1][-]{section.7.11}{The HMM Definition Language}{chapter.7}\BOOKMARK [0][-]{chapter.8}{HMM Parameter Estimation}{part.2}\BOOKMARK [1][-]{section.8.1}{Training Strategies}{chapter.8}\BOOKMARK [1][-]{section.8.2}{Initialisation using HInit}{chapter.8}\BOOKMARK [1][-]{section.8.3}{Flat Starting with HCompV}{chapter.8}\BOOKMARK [1][-]{section.8.4}{Isolated Unit Re-Estimation using HRest}{chapter.8}\BOOKMARK [1][-]{section.8.5}{Embedded Training using HERest}{chapter.8}\BOOKMARK [1][-]{section.8.6}{Single-Pass Retraining}{chapter.8}\BOOKMARK [1][-]{section.8.7}{Two-model Re-Estimation}{chapter.8}\BOOKMARK [1][-]{section.8.8}{Parameter Re-Estimation Formulae}{chapter.8}\BOOKMARK [2][-]{subsection.8.8.1}{Viterbi Training \(HInit\)}{section.8.8}\BOOKMARK [2][-]{subsection.8.8.2}{Forward/Backward Probabilities}{section.8.8}\BOOKMARK [2][-]{subsection.8.8.3}{Single Model Reestimation\(HRest\)}{section.8.8}\BOOKMARK [2][-]{subsection.8.8.4}{Embedded Model Reestimation\(HERest\)}{section.8.8}\BOOKMARK [0][-]{chapter.9}{HMM Adaptation}{part.2}\BOOKMARK [1][-]{section.9.1}{Model Adaptation using Linear Transformations}{chapter.9}\BOOKMARK [2][-]{subsection.9.1.1}{Linear Transformations}{section.9.1}\BOOKMARK [2][-]{subsection.9.1.2}{Base Class Definitions}{section.9.1}\BOOKMARK [2][-]{subsection.9.1.3}{Regression Class Trees}{section.9.1}\BOOKMARK [2][-]{subsection.9.1.4}{Transform Model File Format}{section.9.1}\BOOKMARK [2][-]{subsection.9.1.5}{Hierarchy of Transform}{section.9.1}\BOOKMARK [2][-]{subsection.9.1.6}{Mutiple Stream Systems}{section.9.1}\BOOKMARK [1][-]{section.9.2}{Adaptive Training with Linear Transforms}{chapter.9}\BOOKMARK [1][-]{section.9.3}{Model Adaptation using MAP}{chapter.9}\BOOKMARK [1][-]{section.9.4}{MLLR Formulae}{chapter.9}\BOOKMARK [2][-]{subsection.9.4.1}{Estimation of the Mean Transformation Matrix}{section.9.4}\BOOKMARK [2][-]{subsection.9.4.2}{Estimation of the Diagonal Variance Transformation Matrix}{section.9.4}\BOOKMARK [0][-]{chapter.10}{HMM System Refinement}{part.2}\BOOKMARK [1][-]{section.10.1}{Using HHEd}{chapter.10}\BOOKMARK [1][-]{section.10.2}{Constructing Context-Dependent Models}{chapter.10}\BOOKMARK [1][-]{section.10.3}{Parameter Tying and Item Lists}{chapter.10}\BOOKMARK [1][-]{section.10.4}{Data-Driven Clustering}{chapter.10}\BOOKMARK [1][-]{section.10.5}{Tree-Based Clustering}{chapter.10}\BOOKMARK [1][-]{section.10.6}{Mixture Incrementing}{chapter.10}\BOOKMARK [1][-]{section.10.7}{Regression Class Tree Construction}{chapter.10}\BOOKMARK [1][-]{section.10.8}{Miscellaneous Operations}{chapter.10}\BOOKMARK [0][-]{chapter.11}{Discrete and Tied-Mixture Models}{part.2}\BOOKMARK [1][-]{section.11.1}{Modelling Discrete Sequences}{chapter.11}\BOOKMARK [1][-]{section.11.2}{Using Discrete Models with Speech}{chapter.11}\BOOKMARK [1][-]{section.11.3}{Tied Mixture Systems}{chapter.11}\BOOKMARK [1][-]{section.11.4}{Parameter Smoothing}{chapter.11}\BOOKMARK [0][-]{chapter.12}{Networks, Dictionaries and Language Models}{part.2}\BOOKMARK [1][-]{section.12.1}{How Networks are Used}{chapter.12}\BOOKMARK [1][-]{section.12.2}{Word Networks and Standard Lattice Format}{chapter.12}\BOOKMARK [1][-]{section.12.3}{Building a Word Network with HParse}{chapter.12}\BOOKMARK [1][-]{section.12.4}{Bigram Language Models}{chapter.12}\BOOKMARK [1][-]{section.12.5}{Building a Word Network with HBuild}{chapter.12}\BOOKMARK [1][-]{section.12.6}{Testing a Word Network using HSGen}{chapter.12}\BOOKMARK [1][-]{section.12.7}{Constructing a Dictionary}{chapter.12}\BOOKMARK [1][-]{section.12.8}{Word Network Expansion}{chapter.12}\BOOKMARK [1][-]{section.12.9}{Other Kinds of Recognition System}{chapter.12}\BOOKMARK [0][-]{chapter.13}{Decoding}{part.2}\BOOKMARK [1][-]{section.13.1}{Decoder Operation}{chapter.13}\BOOKMARK [1][-]{section.13.2}{Decoder Organisation}{chapter.13}\BOOKMARK [1][-]{section.13.3}{Recognition using Test Databases}{chapter.13}\BOOKMARK [1][-]{section.13.4}{Evaluating Recognition Results}{chapter.13}\BOOKMARK [1][-]{section.13.5}{Generating Forced Alignments}{chapter.13}\BOOKMARK [1][-]{section.13.6}{Decoding and Adaptation}{chapter.13}\BOOKMARK [2][-]{subsection.13.6.1}{Recognition with Adapted HMMs}{section.13.6}\BOOKMARK [2][-]{subsection.13.6.2}{Unsupervised Adaptation}{section.13.6}\BOOKMARK [1][-]{section.13.7}{Recognition using Direct Audio Input}{chapter.13}\BOOKMARK [1][-]{section.13.8}{N-Best Lists and Lattices}{chapter.13}\BOOKMARK [-1][-]{part.3}{III Language Modelling}{}\BOOKMARK [0][-]{chapter.14}{Fundamentals of language modelling}{part.3}\BOOKMARK [1][-]{section.14.1}{n-gram language models}{chapter.14}\BOOKMARK [2][-]{subsection.14.1.1}{Word n-gram models}{section.14.1}\BOOKMARK [2][-]{subsection.14.1.2}{Equivalence classes}{section.14.1}\BOOKMARK [2][-]{subsection.14.1.3}{Class n-gram models}{section.14.1}\BOOKMARK [1][-]{section.14.2}{Statistically-derived Class Maps}{chapter.14}\BOOKMARK [2][-]{subsection.14.2.1}{Word exchange algorithm}{section.14.2}\BOOKMARK [1][-]{section.14.3}{Robust model estimation}{chapter.14}\BOOKMARK [2][-]{subsection.14.3.1}{Estimating probabilities}{section.14.3}\BOOKMARK [2][-]{subsection.14.3.2}{Smoothing probabilities}{section.14.3}\BOOKMARK [1][-]{section.14.4}{Perplexity}{chapter.14}\BOOKMARK [1][-]{section.14.5}{Overview of n-Gram Construction Process}{chapter.14}\BOOKMARK [1][-]{section.14.6}{Class-Based Language Models}{chapter.14}\BOOKMARK [0][-]{chapter.15}{A Tutorial Example of Building Language Models}{part.3}\BOOKMARK [1][-]{section.15.1}{Database preparation}{chapter.15}\BOOKMARK [1][-]{section.15.2}{Mapping OOV words}{chapter.15}\BOOKMARK [1][-]{section.15.3}{Language model generation}{chapter.15}\BOOKMARK [1][-]{section.15.4}{Testing the LM perplexity}{chapter.15}\BOOKMARK [1][-]{section.15.5}{Generating and using count-based models}{chapter.15}\BOOKMARK [1][-]{section.15.6}{Model interpolation}{chapter.15}\BOOKMARK [1][-]{section.15.7}{Class-based models}{chapter.15}\BOOKMARK [1][-]{section.15.8}{Problem solving}{chapter.15}\BOOKMARK [2][-]{subsection.15.8.1}{File format problems}{section.15.8}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -