📄 voicebox speech processing toolbox for matlab.htm
字号:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<!-- saved from url=(0125)http://216.239.35.100/search?q=cache:I5-mf9encRUC:www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html+matlab+toolbox&hl=zh-CN -->
<HTML><HEAD><TITLE>VOICEBOX: Speech Processing Toolbox for MATLAB</TITLE>
<META content="text/html; charset=ISO-8859-1" http-equiv=Content-Type>
<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY link=#0000ff vLink=#800080>
<TABLE border=1 width="100%">
<TBODY>
<TR>
<TD>
<TABLE bgColor=#ffffff border=1 cellPadding=10 cellSpacing=0 width="100%"
color="#ffffff">
<TBODY>
<TR>
<TD><FONT color=black face=arial,sans-serif size=-1>This is <B><FONT
color=#0039b6>G</FONT> <FONT color=#c41200>o</FONT> <FONT
color=#f3c518>o</FONT> <FONT color=#0039b6>g</FONT> <FONT
color=#30a72f>l</FONT> <FONT color=#c41200>e</FONT></B>'s <A
href="http://www.google.com/help/features.html#cached"><FONT
color=blue>cache</FONT></A> of <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html"><FONT
color=blue>http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html</FONT></A>.<BR><B><FONT
color=#0039b6>G</FONT> <FONT color=#c41200>o</FONT> <FONT
color=#f3c518>o</FONT> <FONT color=#0039b6>g</FONT> <FONT
color=#30a72f>l</FONT> <FONT color=#c41200>e</FONT></B>'s cache is
the snapshot that we took of the page as we crawled the web.<BR>The
page may have changed since that time. Click here for the <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html"><FONT
color=blue>current page</FONT></A> without highlighting.<BR>To link
to or bookmark this page, use the following url:
<CODE>http://www.google.com/search?q=cache:I5-mf9encRUC:www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html+matlab+toolbox&hl=zh-CN</CODE></FONT><BR><BR>
<CENTER><FONT size=-1><I>Google is not affiliated with the authors
of this page nor responsible for its
content.</I></FONT></CENTER></TD></TR>
<TR>
<TD>
<TABLE border=0 cellPadding=0 cellSpacing=0>
<TBODY>
<TR>
<TD><FONT color=black face=arial,sans-serif size=-1>These
search terms have been highlighted: </FONT></TD>
<TD bgColor=#ffff66><B><FONT color=black face=arial,sans-serif
size=-1>matlab </FONT></B></TD>
<TD bgColor=#a0ffff><B><FONT color=black face=arial,sans-serif
size=-1>toolbox </FONT></B></TD></TR></TBODY></TABLE></TD></TR></TBODY></TABLE></TD></TR></TBODY></TABLE>
<HR>
<META content="Microsoft FrontPage 3.0" name=GENERATOR>
<META content="E:\Program Files\Microsoft Office\Office\html.dot" name=Template>
<H1>VOICEBOX: Speech Processing <B
style="BACKGROUND-COLOR: #a0ffff; COLOR: black">Toolbox</B> for <B
style="BACKGROUND-COLOR: #ffff66; COLOR: black">MATLAB</B></H1>
<H2>Introduction</H2>
<P>VOICEBOX is a speech processing <B
style="BACKGROUND-COLOR: #a0ffff; COLOR: black">toolbox</B> consists of <B
style="BACKGROUND-COLOR: #ffff66; COLOR: black">MATLAB</B> routines that are
maintained by and mostly written by <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/dmb.html">Mike Brookes</A>, <A
href="http://www.ee.ic.ac.uk/">Department of Electrical & Electronic
Engineering</A>, <A href="http://www.ic.ac.uk/">Imperial College</A>, Exhibition
Road, London SW7 2BT, UK. Several of the routines require <B
style="BACKGROUND-COLOR: #ffff66; COLOR: black">MATLAB</B> V5.</P>
<P>The routines are available as a <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.tar.Z">compressed
tar file</A> or as a <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.zip">zip archive</A>
and are made available under the terms of the <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/copying.txt">GNU Public
License</A>. </P>
<P>Please send any comments, suggestions, bug reports etc to <A
href="mailto:mike.brookes@ic.ac.uk">mike.brookes@ic.ac.uk</A>. </P>
<HR>
<H2>Contents</H2>
<HR>
<DL>
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#file">Audio
File Input/Output </A>
<DD>Read and write WAV and other speech file formats
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#frequency">Frequency
Scales </A>
<DD>Convert between Hz, Mel, Erb and MIDI frequency scales
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#fourier">Fourier/DCT/Hartley
Transforms</A>
<DD>Various related transforms
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#random">Random
Number Generation</A>
<DD>Generate random vectors and noise signals
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#distance">Vector
Distances</A>
<DD>Calculate distances between vector lists
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#analysis">Speech
Analysis</A>
<DD>Active level estimation, Spectrograms
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#lpc">LPC
Analysis of Speech</A>
<DD>Linear Predictive Coding routines
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#synthesis">Speech
Synthesis</A>
<DD>Glottal waveform models
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#enhance">Speech
Enhancement</A>
<DD>Spectral noise subtraction
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#coding">Speech
Coding</A>
<DD>PCM coding, Vector quantisation
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#recog">Speech
Recognition</A>
<DD>Front-end processing for recognition
<DT><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#utility">Utility
Functions</A>
<DD>Miscellaneous utility functions </DD></DL>
<HR>
<HR>
<H2><A name=file>Audio File Input/Output</A></H2>
<BLOCKQUOTE>
<P>Routines are available to read and, in some cases write, a variety of file
formats:</P>
<TABLE border=0 cellPadding=2 width="100%">
<TBODY>
<TR>
<TD width=50><B>Read</B></TD>
<TD width=50><B>Write</B></TD>
<TD width=30><B>Suffix</B></TD>
<TD> </TD></TR>
<TR>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readwav.txt">readwav</A></TD>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/writewav.txt">writewav</A></TD>
<TD width=30>.wav</TD>
<TD>These routines allow an arbitrary number of channels and can deal
with linear PCM (any precision up to 32 bits), A-law PCM and Mu-law PCM.
Large files can be read and written in small chunks.</TD></TR>
<TR>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readhtk.txt">readhtk</A></TD>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/writehtk.txt">writehtk</A></TD>
<TD width=30>.htk</TD>
<TD>Read and write waveform files used by Entropic's Hidden Markov
Toolkit.</TD></TR>
<TR>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readsfs.txt">readsfs</A></TD>
<TD width=50> </TD>
<TD width=30>.sfs</TD>
<TD>Speech Filing system files from Mark Huckvale at UCL.</TD></TR>
<TR>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readsph.txt">readsph</A></TD>
<TD width=50> </TD>
<TD width=30>.sph</TD>
<TD>NIST Sphere format files (including TIMIT).</TD></TR>
<TR>
<TD width=50><A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readaif.txt">readaif</A></TD>
<TD width=50> </TD>
<TD width=30>.aif</TD>
<TD>Audio Interchange File Format used by Mac
users.</TD></TR></TBODY></TABLE></BLOCKQUOTE>
<HR>
<H2><A name=frequency>Frequency Scale Conversion</A></H2>
<UL>
<LI>The <I>mel scale</I> is based on the human perception of sinewave pitch.
The routines <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/mel2frq.txt">mel2frq</A>
and <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2mel.txt">frq2mel</A>
convert between this scale and frequency in Hz.
<LI>The <I>erb</I> scale is based on the equivalent rectangular bandwidths of
the human ear. The routines <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/erb2frq.txt">erb2frq</A>
and <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2erb.txt">frq2erb</A>
convert between the erb rate scale and frequency in Hz.
<LI>The <I>midi standard</I> specifies a numbering of <I>semitones</I> with
middle C being 60. The routines <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2midi.txt">frq2midi</A>
and <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/midi2frq.txt">midi2frq</A>
convert between this musical frequency scale and Hz. <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2midi.txt">frq2midi</A>
will in addition output note names in a character format. <A
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/midi2frq.txt">midi2frq</A>
can use the normal equal tempered scale or else the pythagorean scale of just
intonation. </LI></UL>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -