⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 voicebox speech processing toolbox for matlab.htm

📁 MATLAB工具箱 内容挺丰富
💻 HTM
📖 第 1 页 / 共 2 页
字号:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<!-- saved from url=(0125)http://216.239.35.100/search?q=cache:I5-mf9encRUC:www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html+matlab+toolbox&hl=zh-CN -->
<HTML><HEAD><TITLE>VOICEBOX: Speech Processing Toolbox for MATLAB</TITLE>
<META content="text/html; charset=ISO-8859-1" http-equiv=Content-Type>
<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY link=#0000ff vLink=#800080>
<TABLE border=1 width="100%">
  <TBODY>
  <TR>
    <TD>
      <TABLE bgColor=#ffffff border=1 cellPadding=10 cellSpacing=0 width="100%" 
      color="#ffffff">
        <TBODY>
        <TR>
          <TD><FONT color=black face=arial,sans-serif size=-1>This is <B><FONT 
            color=#0039b6>G</FONT> <FONT color=#c41200>o</FONT> <FONT 
            color=#f3c518>o</FONT> <FONT color=#0039b6>g</FONT> <FONT 
            color=#30a72f>l</FONT> <FONT color=#c41200>e</FONT></B>'s <A 
            href="http://www.google.com/help/features.html#cached"><FONT 
            color=blue>cache</FONT></A> of <A 
            href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html"><FONT 
            color=blue>http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html</FONT></A>.<BR><B><FONT 
            color=#0039b6>G</FONT> <FONT color=#c41200>o</FONT> <FONT 
            color=#f3c518>o</FONT> <FONT color=#0039b6>g</FONT> <FONT 
            color=#30a72f>l</FONT> <FONT color=#c41200>e</FONT></B>'s cache is 
            the snapshot that we took of the page as we crawled the web.<BR>The 
            page may have changed since that time. Click here for the <A 
            href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html"><FONT 
            color=blue>current page</FONT></A> without highlighting.<BR>To link 
            to or bookmark this page, use the following url: 
            <CODE>http://www.google.com/search?q=cache:I5-mf9encRUC:www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html+matlab+toolbox&amp;hl=zh-CN</CODE></FONT><BR><BR>
            <CENTER><FONT size=-1><I>Google is not affiliated with the authors 
            of this page nor responsible for its 
        content.</I></FONT></CENTER></TD></TR>
        <TR>
          <TD>
            <TABLE border=0 cellPadding=0 cellSpacing=0>
              <TBODY>
              <TR>
                <TD><FONT color=black face=arial,sans-serif size=-1>These 
                  search terms have been highlighted:&nbsp;</FONT></TD>
                <TD bgColor=#ffff66><B><FONT color=black face=arial,sans-serif 
                  size=-1>matlab&nbsp;</FONT></B></TD>
                <TD bgColor=#a0ffff><B><FONT color=black face=arial,sans-serif 
                  size=-1>toolbox&nbsp;</FONT></B></TD></TR></TBODY></TABLE></TD></TR></TBODY></TABLE></TD></TR></TBODY></TABLE>
<HR>

<META content="Microsoft FrontPage 3.0" name=GENERATOR>
<META content="E:\Program Files\Microsoft Office\Office\html.dot" name=Template>
<H1>VOICEBOX: Speech Processing <B 
style="BACKGROUND-COLOR: #a0ffff; COLOR: black">Toolbox</B> for <B 
style="BACKGROUND-COLOR: #ffff66; COLOR: black">MATLAB</B></H1>
<H2>Introduction</H2>
<P>VOICEBOX is a speech processing <B 
style="BACKGROUND-COLOR: #a0ffff; COLOR: black">toolbox</B> consists of <B 
style="BACKGROUND-COLOR: #ffff66; COLOR: black">MATLAB</B> routines that are 
maintained by and mostly written by <A 
href="http://www.ee.ic.ac.uk/hp/staff/dmb/dmb.html">Mike Brookes</A>, <A 
href="http://www.ee.ic.ac.uk/">Department of Electrical &amp; Electronic 
Engineering</A>, <A href="http://www.ic.ac.uk/">Imperial College</A>, Exhibition 
Road, London SW7 2BT, UK. Several of the routines require <B 
style="BACKGROUND-COLOR: #ffff66; COLOR: black">MATLAB</B> V5.</P>
<P>The routines are available as a <A 
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.tar.Z">compressed 
tar file</A> or as a <A 
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.zip">zip archive</A> 
and are made available under the terms of the <A 
href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/copying.txt">GNU Public 
License</A>. </P>
<P>Please send any comments, suggestions, bug reports etc to <A 
href="mailto:mike.brookes@ic.ac.uk">mike.brookes@ic.ac.uk</A>. </P>
<HR>

<H2>Contents</H2>
<HR>

<DL>
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#file">Audio 
  File Input/Output </A>
  <DD>Read and write WAV and other speech file formats 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#frequency">Frequency 
  Scales </A>
  <DD>Convert between Hz, Mel, Erb and MIDI frequency scales 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#fourier">Fourier/DCT/Hartley 
  Transforms</A> 
  <DD>Various related transforms 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#random">Random 
  Number Generation</A> 
  <DD>Generate random vectors and noise signals 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#distance">Vector 
  Distances</A> 
  <DD>Calculate distances between vector lists 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#analysis">Speech 
  Analysis</A> 
  <DD>Active level estimation, Spectrograms 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#lpc">LPC 
  Analysis of Speech</A> 
  <DD>Linear Predictive Coding routines 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#synthesis">Speech 
  Synthesis</A> 
  <DD>Glottal waveform models 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#enhance">Speech 
  Enhancement</A> 
  <DD>Spectral noise subtraction 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#coding">Speech 
  Coding</A> 
  <DD>PCM coding, Vector quantisation 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#recog">Speech 
  Recognition</A> 
  <DD>Front-end processing for recognition 
  <DT><A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html#utility">Utility 
  Functions</A> 
  <DD>Miscellaneous utility functions </DD></DL>
<HR>

<HR>

<H2><A name=file>Audio File Input/Output</A></H2>
<BLOCKQUOTE>
  <P>Routines are available to read and, in some cases write, a variety of file 
  formats:</P>
  <TABLE border=0 cellPadding=2 width="100%">
    <TBODY>
    <TR>
      <TD width=50><B>Read</B></TD>
      <TD width=50><B>Write</B></TD>
      <TD width=30><B>Suffix</B></TD>
      <TD>&nbsp;</TD></TR>
    <TR>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readwav.txt">readwav</A></TD>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/writewav.txt">writewav</A></TD>
      <TD width=30>.wav</TD>
      <TD>These routines allow an arbitrary number of channels and can deal 
        with linear PCM (any precision up to 32 bits), A-law PCM and Mu-law PCM. 
        Large files can be read and written in small chunks.</TD></TR>
    <TR>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readhtk.txt">readhtk</A></TD>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/writehtk.txt">writehtk</A></TD>
      <TD width=30>.htk</TD>
      <TD>Read and write waveform files used by Entropic's Hidden Markov 
        Toolkit.</TD></TR>
    <TR>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readsfs.txt">readsfs</A></TD>
      <TD width=50>&nbsp;</TD>
      <TD width=30>.sfs</TD>
      <TD>Speech Filing system files from Mark Huckvale at UCL.</TD></TR>
    <TR>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readsph.txt">readsph</A></TD>
      <TD width=50>&nbsp;</TD>
      <TD width=30>.sph</TD>
      <TD>NIST Sphere format files (including TIMIT).</TD></TR>
    <TR>
      <TD width=50><A 
        href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/readaif.txt">readaif</A></TD>
      <TD width=50>&nbsp;</TD>
      <TD width=30>.aif</TD>
      <TD>Audio Interchange File Format used by Mac 
users.</TD></TR></TBODY></TABLE></BLOCKQUOTE>
<HR>

<H2><A name=frequency>Frequency Scale Conversion</A></H2>
<UL>
  <LI>The <I>mel scale</I> is based on the human perception of sinewave pitch. 
  The routines <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/mel2frq.txt">mel2frq</A> 
  and <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2mel.txt">frq2mel</A> 
  convert between this scale and frequency in Hz. 
  <LI>The <I>erb</I> scale is based on the equivalent rectangular bandwidths of 
  the human ear. The routines <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/erb2frq.txt">erb2frq</A> 
  and <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2erb.txt">frq2erb</A> 
  convert between the erb rate scale and frequency in Hz. 
  <LI>The <I>midi standard</I> specifies a numbering of <I>semitones</I> with 
  middle C being 60. The routines <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2midi.txt">frq2midi</A> 
  and <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/midi2frq.txt">midi2frq</A> 
  convert between this musical frequency scale and Hz. <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/frq2midi.txt">frq2midi</A> 
  will in addition output note names in a character format. <A 
  href="http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/txt/midi2frq.txt">midi2frq</A> 
  can use the normal equal tempered scale or else the pythagorean scale of just 
  intonation. </LI></UL>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -