📄 mpeg pac basic.htm
字号:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html;CHARSET=iso-8859-1">
<TITLE>Fraunhofer IIS - Basics about MPEG Perceptual Audio Coding</TITLE>
</HEAD>
<BODY BACKGROUND="backgnd.gif" TEXT="#000000" BGCOLOR="#F9FBFB" LINK="#006666" VLINK="#4C4C4C" ALINK="#995500">
<P>
<TABLE BORDER="0">
<TR>
<TD><A HREF="../../index.html"><IMG SRC="iis_125.gif" WIDTH="125" HEIGHT="44" ALT="IIS-Logo" BORDER="0"></A><BR>
<IMG SRC="fhg_tran.gif" WIDTH="150" HEIGHT="1"></TD>
<TD><FONT SIZE="2" FACE="arial,helvetica">
<NOBR><A HREF="../../index.html">Home</A> |</NOBR>
<NOBR><A HREF="http://www.fhg.de/english.html">Fraunhofer Gesellschaft</A> |</NOBR>
<NOBR><A HREF="../../bf/index.html">Business Fields</A> |</NOBR>
<NOBR><A HREF="../../comp/index.html">Competences</A> |</NOBR>
<NOBR><A HREF="../../propro/index.html">Projects & Products</A> |</NOBR>
<A HREF="../../press/index.html">Press</A>
</FONT>
</TD>
</TR>
</TABLE>
</P>
<P>
<TABLE BORDER="0">
<TR>
<TD WIDTH="150" VALIGN="TOP" ROWSPAN="24"><IMG SRC="fhg_tran.gif" WIDTH="150" HEIGHT="1" ALIGN="TOP" BORDER="0"><BR>
<A HREF="../index.html"><FONT FACE="Arial, Helvetica"><B>AMM Home</B></FONT></A><BR><BR>
<A HREF="../techinf/index.html"><FONT FACE="Arial, Helvetica"><B>Technology</B></FONT></A><BR>
<P><TABLE BORDER="0" CELLPADDING="2" CELLSPACING="0">
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="layer3/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>MPEG Layer-3</B></FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="aac/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>MPEG-2 AAC</FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="mpeg4/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>MPEG-4</B></FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="ipmp/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>IPMP</B></FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="error_conceal/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>SHNI</B></FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="nmr/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>Measurements</B><BR></FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
<TR>
<TD WIDTH="5%"> </TD>
<TD><A HREF="video/index.html"><FONT SIZE="2" FACE="Arial, Helvetica"><B>Video</B><BR></FONT></A></TD>
<TD WIDTH="5%"> </TD>
</TR>
</TABLE><BR>
<A HREF="../services/index.html"><FONT FACE="Arial, Helvetica"><B>Services</B></FONT></A><BR><BR>
<A HREF="../application/index.html"><FONT FACE="Arial, Helvetica"><B>Applications</B></FONT></A><BR><BR>
<A HREF="../partners/index.html"><FONT FACE="Arial, Helvetica"><B>Partners</B></FONT></A><BR><BR>
<A HREF="../download/index.html"><FONT FACE="Arial, Helvetica"><B>Download</B></FONT></A><BR><BR>
<A HREF="../support/index.html"><FONT FACE="Arial, Helvetica"><B>Support</B></FONT></A><BR><BR>
<A HREF="../gallery/index.html"><FONT FACE="Arial, Helvetica"><B>Gallery</B></FONT></A><BR><BR>
<A HREF="../events/index.html"><FONT FACE="Arial, Helvetica"><B>Events & Press</B></FONT></A><BR><BR>
<A HREF="../contact/index.html"><FONT FACE="Arial, Helvetica"><B>Contact</B></FONT></A><BR><BR>
<A HREF="../legal/index.html"><FONT FACE="Arial, Helvetica"><B>Licensing</B></FONT></A>
</TD>
<TD COLSPAN="2">
<FONT SIZE="4" FACE="Arial, Helvetica"><B>Basics about MPEG Perceptual Audio Coding</B></FONT></TD>
<TD><IMG SRC="fhg_tran.gif" WIDTH="50" HEIGHT="1" ALIGN="BOTTOM" BORDER="0"></TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD><IMG SRC="fhg_tran.gif" WIDTH="30" HEIGHT="1" ALIGN="BOTTOM" BORDER="0"></TD>
<TD>
<UL>
<LI><FONT FACE="Arial, Helvetica"><B><A HREF="#1">Introduction</A></B></FONT>
<LI><FONT FACE="Arial, Helvetica"><B><A HREF="#2">The purpose of audio compression</A></B></FONT>
<LI><FONT FACE="Arial, Helvetica"><B><A HREF="#3">The two parts of audio compression</A></B></FONT>
<LI><FONT FACE="Arial, Helvetica"><B><A HREF="#4">How does it work?</A></B></FONT>
<LI><FONT FACE="Arial, Helvetica"><B><A HREF="#5">Compression ratios, bitrate and quality</A></B></FONT>
</UL>
</TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD COLSPAN="2"><A NAME="1"></A><FONT FACE="Arial, Helvetica"><B>Introduction</B></FONT></TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD> </TD>
<TD><FONT FACE="Arial, Helvetica">There is a lot of confusion surrounding the terms <B>audio compression</B>, <B>audio encoding</B>, and <B>audio decoding</B>.
This section will give you an overview what <B>audio coding</B> (another one of these terms...) is all about.</FONT>
</TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD COLSPAN="2"><A NAME="2"></A><FONT FACE="Arial, Helvetica"><B>The purpose of audio compression</B></FONT></TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD> </TD>
<TD><P><FONT FACE="Arial, Helvetica">Up to the advent of audio compression, high-quality digital audio data took a lot of hard disk space to store (or channel bandwith to transmit).<BR>
Let us go through a short example. You want to sample your favorite 1-minute song and store it on your harddisk. Because you want CD quality, you
sample at 44.1 kHz, stereo, with 16 bits per sample.<BR>
44.100 Hz means that you have 44.100 values per second coming in from your sound card (or input file). Multiply that by two
because you have two channels. Multiply by another factor of two because you have two bytes per value (that's what 16 bit
means). The song will take up</FONT>
<P><FONT FACE="Arial, Helvetica"><CENTER>44.100 samples/s * 2 channels * 2 bytes/sample * 60 s/min = around 10 MBytes</CENTER></FONT>
<P><FONT FACE="Arial, Helvetica">of storage space on your harddisk. If you wanted to download that over the internet, given an average 28.8 modem, it would take you</FONT>
<P><CENTER><FONT FACE="Arial, Helvetica">10.000.000 bytes * 8 bits/byte / (28.800 bits/s * 60 s/min) = around 49 minutes.<BR>
<BR><B>Just to download one minute of stereo music!</B></FONT></CENTER>
<P><FONT FACE="Arial, Helvetica">Digital audio coding, which - in this context - is synonymously called digital audio compression as well, is the art of minimizing storage
space (or channel bandwidth) requirements for audio data. Modern perceptual audio coding techniques (like MPEG Layer-3 or MPEG-2 AAC) exploit the properties
of the human ear (the perception of sound) to achieve a size reduction by a factor of 12 with little or no perceptible loss of quality.<BR>
Therefore, such schemes are the key technology for high quality low bit-rate applications, like soundtracks for CD-ROM games, solid-state sound memories, Internet
audio, digital audio broadcasting systems, and the like.</FONT>
</TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD COLSPAN="2"><A NAME="3"></A><FONT FACE="Arial, Helvetica"><B>The two parts of audio compression</B></FONT></TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD> </TD>
<TD><P><FONT FACE="Arial, Helvetica">Audio compression really consists of two parts. The first part, called <I>encoding</I>, transforms the digital audio data that resides,
say, in a WAVE file, into a highly compressed form called <I>bitstream</I> (or coded audio data). To play the bitstream on your soundcard, you need the
second part, called <I>decoding</I>. Decoding takes the bitstream and reconstruct it to a WAVE file.</FONT>
</TD>
<TD> </TD>
</TR>
<TR><TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD COLSPAN="2"><A NAME="4"></A><FONT FACE="Arial, Helvetica"><B>How does it work?</B></FONT></TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD> </TD>
<TD><P><FONT FACE="Arial, Helvetica">Highest coding efficiency is achieved with algorithms exploiting signal redundancies and irrelevancies
in the frequency domain based on a model of the human auditory system.<BR>
All coders use the same basic structure. The coding scheme can be described as "perceptual
noise shaping" or "perceptual subband / transform coding". The encoder analyzes the spectral
components of the audio signal by calculating a filterbank (transform) and applies a psychoacoustic
model to estimate the just noticeable noise-level. In its quantization and coding stage, the encoder
tries to allocate the available number of data bits in a way to meet both the bitrate and masking
requirements.<BR>
The decoder is much less complex. Its only task is to synthesize an audio signal out of the coded
spectral components.<BR>
All Layers use the same analysis filterbank (polyphase with 32 subbands). Layer-3 adds a MDCT
transform to increase the frequency resolution.</FONT>
</TD>
<TD> </TD>
</TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD COLSPAN="2"><A NAME="5"></A><FONT FACE="Arial, Helvetica"><B>Compression ratios, bitrate and quality</B></FONT></TD>
<TD> </TD>
</TR>
<TR>
<TD COLSPAN="3" HEIGHT="30"> </TD>
</TR>
<TR>
<TD> </TD>
<TD>
<P><FONT FACE="Arial, Helvetica">It has not been explicitly mentioned up to now: What you end up with after encoding and decoding is not the same sound file
anymore: All superfluous information has been squeezed out, so to say. (More precisely: the redundant and irrelevant parts of the sound signal.) The reconstructed WAVE file differs from
the original WAVE file, but it will sound the same - more or less, depending on how much compression had been performed on it.</FONT>
<P><FONT FACE="Arial, Helvetica">Because compression ratio is a somewhat unwieldy measure, experts use the term bitrate when speaking of the strength of
compression. Bitrate denotes the average number of bits that one second of audio data will consume. The usually units here are kbps, which is kbit per second, or 1000 bits/s.</FONT>
<P><FONT FACE="Arial, Helvetica">For a digital audio signal from a CD, the bit-rate is 1411.2 kbps.
With <A HREF="aac/index.html">MPEG-2 AAC</A>, CD-like sound quality is achieved at 96 kbps.</FONT>
<P><CENTER><A HREF="#top"><FONT SIZE="2">| TOP |</FONT></A></CENTER>
</TD>
<TD> </TD>
</TR>
</TABLE>
</P>
<P>
<TABLE BORDER="0">
<TR>
<TD><IMG SRC="fhg_tran.gif" WIDTH="150" HEIGHT="1"></TD>
<TD>
<FONT SIZE="2" FACE="arial, helvetica"><I><B>Copyright ©1998
<A HREF="http://www.fhg.de/contact.html">Fraunhofer-Gesellschaft</A></B></I></FONT>
</TD>
</TR>
</TABLE>
</P>
</BODY>
</HTML>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -