📄 chp9.htm

📁 数字图象处理入门,非常好的书！！！！推荐！
💻 HTM
📖 第 1 页 / 共 5 页
字号:
style='mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>第</span><span
style='font-family:"Times New Roman"'>9</span><span lang=ZH-CN
style='font-family:黑体;mso-hansi-font-family:"Times New Roman"'>章</span><span
lang=ZH-CN style='font-family:"Times New Roman"'> </span><span lang=ZH-CN
style='font-family:黑体;mso-hansi-font-family:"Times New Roman"'>图象的压缩编码，</span><span
style='font-family:"Times New Roman"'>JPEG</span><span lang=ZH-CN
style='font-family:黑体;mso-hansi-font-family:"Times New Roman"'>压缩编码标准</span><span
style='font-family:"Times New Roman"'><o:p></o:p></span></h1>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>在介绍图象的压缩编码之前，先考虑一个问题：为什么要压缩？其实这个问题不用我回答，你也能想得到。因为图象信息的数据量实在是太惊人了。举一个例子就明白：一张</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>A4(210mm×297mm) </span><span
lang=ZH-CN style='font-size:10.5pt'>幅面的照片，若用中等分辨率</span><span style='font-size:
10.5pt;font-family:"Times New Roman"'>(300dpi)</span><span lang=ZH-CN
style='font-size:10.5pt'>的扫描仪按真彩色扫描，其数据量为多少？让我们来计算一下：共有</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(300×210/25.4)
×(300×297/25.4)</span><span lang=ZH-CN style='font-size:10.5pt'>个象素，每个象素占</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>3</span><span
lang=ZH-CN style='font-size:10.5pt'>个字节，其数据量为</span><span style='font-size:
10.5pt;font-family:"Times New Roman"'>26M</span><span lang=ZH-CN
style='font-size:10.5pt'>字节，其数据量之大可见一斑了。</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>如今在</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>Internet</span><span
lang=ZH-CN style='font-size:10.5pt'>上，传统基于字符界面的应用逐渐被能够浏览图象信息的</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>WWW(World Wide Web)</span><span
lang=ZH-CN style='font-size:10.5pt'>方式所取代。</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>WWW</span><span lang=ZH-CN style='font-size:
10.5pt'>尽管漂亮，但是也带来了一个问题：图象信息的数据量太大了，本来就已经非常紧张的网络带宽变得更加不堪重负，使得</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>World Wide Web</span><span
lang=ZH-CN style='font-size:10.5pt'>变成了</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>World Wide Wait</span><span lang=ZH-CN
style='font-size:10.5pt'>。</span><span style='font-size:10.5pt;font-family:
"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>总之，大数据量的图象信息会给存储器的存储容量，通信干线信道的带宽，以及计算机的处理速度增加极大的压力。单纯靠增加存储器容量，提高信道带宽以及计算机的处理速度等方法来解决这个问题是不现实的，这时就要考虑压缩。</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>压缩的理论基础是信息论。从信息论的角度来看，压缩就是去掉信息中的冗余，即保留不确定的信息，去掉确定的信息</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(</span><span
lang=ZH-CN style='font-size:10.5pt'>可推知的</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>)</span><span lang=ZH-CN style='font-size:10.5pt'>，也就是用一种更接近信息本质的描述来代替原有冗余的描述。这个本质的东西就是信息量</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(</span><span
lang=ZH-CN style='font-size:10.5pt'>即不确定因素</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>)</span><span lang=ZH-CN style='font-size:10.5pt'>。</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>压缩可分为两大类：第一类压缩过程是可逆的，也就是说，从压缩后的图象能够完全恢复出原来的图象，信息没有任何丢失，称为无损压缩；第二类压缩过程是不可逆的，无法完全恢复出原图象，信息有一定的丢失，称为有损压缩。选择哪一类压缩，要折衷考虑，尽管我们希望能够无损压缩，但是通常有损压缩的压缩比</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(</span><span
lang=ZH-CN style='font-size:10.5pt'>即原图象占的字节数与压缩后图象占的字节数之比，压缩比越大，说明压缩效率越高</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>)</span><span
lang=ZH-CN style='font-size:10.5pt'>比无损压缩的高。</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>图象压缩一般通过改变图象的表示方式来达到，因此压缩和编码是分不开的。图象压缩的主要应用是图象信息的传输和存储，可广泛地应用于广播电视、电视会议、计算机通讯、传真、多媒体系统、医学图象、卫星图象等领域。</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>压缩编码的方法有很多，主要分成以下四大类：</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(1)</span><span
lang=ZH-CN style='font-size:10.5pt'>象素编码；</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(2)</span><span lang=ZH-CN style='font-size:
10.5pt'>预测编码；</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>(3)</span><span
lang=ZH-CN style='font-size:10.5pt'>变换编码；</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(4)</span><span lang=ZH-CN style='font-size:
10.5pt'>其它方法。</span><span style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>所谓象素编码是指，编码时对每个象素单独处理，不考虑象素之间的相关性。在象素编码中常用的几种方法有：</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(1)</span><span
lang=ZH-CN style='font-size:10.5pt'>脉冲编码调制</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(Pulse Code Modulation</span><span lang=ZH-CN
style='font-size:10.5pt'>，简称</span><span style='font-size:10.5pt;font-family:
"Times New Roman"'>PCM)</span><span lang=ZH-CN style='font-size:10.5pt'>；</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(2)</span><span
lang=ZH-CN style='font-size:10.5pt'>熵编码</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(Entropy Coding)</span><span lang=ZH-CN
style='font-size:10.5pt'>；</span><span style='font-size:10.5pt;font-family:
"Times New Roman"'>(3)</span><span lang=ZH-CN style='font-size:10.5pt'>行程编码</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(Run Length Coding)</span><span
lang=ZH-CN style='font-size:10.5pt'>；</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(4)</span><span lang=ZH-CN style='font-size:
10.5pt'>位平面编码</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>(Bit
Plane Coding)</span><span lang=ZH-CN style='font-size:10.5pt'>。其中我们要介绍的是熵编码中的哈夫曼</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(Huffman)</span><span
lang=ZH-CN style='font-size:10.5pt'>编码和行程编码</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(</span><span lang=ZH-CN style='font-size:10.5pt'>以读取</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>.PCX</span><span
lang=ZH-CN style='font-size:10.5pt'>文件为例</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>)</span><span lang=ZH-CN style='font-size:10.5pt'>。</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>所谓预测编码是指，去除相邻象素之间的相关性和冗余性，只对新的信息进行编码。举个简单的例子，因为象素的灰度是连续的，所以在一片区域中，相邻象素之间灰度值的差别可能很小。如果我们只记录第一个象素的灰度，其它象素的灰度都用它与前一个象素灰度之差来表示，就能起到压缩的目的。如</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>248</span><span
lang=ZH-CN style='font-size:10.5pt'>，</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>2</span><span lang=ZH-CN style='font-size:10.5pt'>，</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>1</span><span
lang=ZH-CN style='font-size:10.5pt'>，</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>0</span><span lang=ZH-CN style='font-size:10.5pt'>，</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>1</span><span
lang=ZH-CN style='font-size:10.5pt'>，</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>3</span><span lang=ZH-CN style='font-size:10.5pt'>，实际上这</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>6</span><span
lang=ZH-CN style='font-size:10.5pt'>个象素的灰度是</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>248</span><span lang=ZH-CN style='font-size:
10.5pt'>，</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>250</span><span
lang=ZH-CN style='font-size:10.5pt'>，</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>251</span><span lang=ZH-CN style='font-size:
10.5pt'>，</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>251</span><span
lang=ZH-CN style='font-size:10.5pt'>，</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>252</span><span lang=ZH-CN style='font-size:
10.5pt'>，</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>255</span><span
lang=ZH-CN style='font-size:10.5pt'>。表示</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>250</span><span lang=ZH-CN style='font-size:
10.5pt'>需要</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>8</span><span
lang=ZH-CN style='font-size:10.5pt'>个比特，而表示</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>2</span><span lang=ZH-CN style='font-size:10.5pt'>只需要两个比特，这样就实现了压缩。</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>常用的预测编码有Δ调制</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(Delta Modulation</span><span
lang=ZH-CN style='font-size:10.5pt'>，简称</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>DM)</span><span lang=ZH-CN style='font-size:
10.5pt'>；微分预测编码</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>(Differential
Pulse Code Modulation</span><span lang=ZH-CN style='font-size:10.5pt'>，</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>DPCM)</span><span
lang=ZH-CN style='font-size:10.5pt'>，具体的细节在此就不详述了。</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>所谓变换编码是指，将给定的图象变换到另一个数据域</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(</span><span
lang=ZH-CN style='font-size:10.5pt'>如频域</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>)</span><span lang=ZH-CN style='font-size:10.5pt'>上，使得大量的信息能用较少的数据来表示，从而达到压缩的目的。变换编码有很多，如</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(1)</span><span
lang=ZH-CN style='font-size:10.5pt'>离散傅立叶变换</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(Discrete Fourier Transform</span><span
lang=ZH-CN style='font-size:10.5pt'>，简称</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>DFT)</span><span lang=ZH-CN style='font-size:
10.5pt'>；</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>(2)</span><span
lang=ZH-CN style='font-size:10.5pt'>离散余弦变换</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(Discrete Cosine Transform</span><span
lang=ZH-CN style='font-size:10.5pt'>，简称</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>DCT)</span><span lang=ZH-CN style='font-size:
10.5pt'>；</span><span style='font-size:10.5pt;font-family:"Times New Roman"'>(3)</span><span
lang=ZH-CN style='font-size:10.5pt'>离散哈达玛变换</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(Discrete Hadamard Transform</span><span
lang=ZH-CN style='font-size:10.5pt'>，简称</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>DHT)</span><span lang=ZH-CN style='font-size:
10.5pt'>。</span><span style='font-size:10.5pt;font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>其它的编码方法也有很多，如混合编码</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(Hybird Coding)</span><span
lang=ZH-CN style='font-size:10.5pt'>、矢量量化</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(Vector Quantize</span><span lang=ZH-CN
style='font-size:10.5pt'>，</span><span style='font-size:10.5pt;font-family:
"Times New Roman"'>VQ) </span><span lang=ZH-CN style='font-size:10.5pt'>、</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>LZW</span><span
lang=ZH-CN style='font-size:10.5pt'>算法。在这里，我们只介绍</span><span style='font-size:
10.5pt;font-family:"Times New Roman"'>LZW</span><span lang=ZH-CN
style='font-size:10.5pt'>算法的大体思想。</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'><o:p></o:p></span></p>

<p style='margin:0cm;margin-bottom:.0001pt;text-align:justify;text-justify:
inter-ideograph;line-height:18.0pt'><span lang=ZH-CN style='font-size:10.5pt'>值得注意的是，近些年来出现了很多新的压缩编码方法，如使用人工神经元网络</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(Artificial Neural
Network</span><span lang=ZH-CN style='font-size:10.5pt'>，简称</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>ANN)</span><span
lang=ZH-CN style='font-size:10.5pt'>的压缩编码算法、分形</span><span style='font-size:
10.5pt;font-family:"Times New Roman"'>(Fractl)</span><span lang=ZH-CN
style='font-size:10.5pt'>、小波</span><span style='font-size:10.5pt;font-family:
"Times New Roman"'>(Wavelet) </span><span lang=ZH-CN style='font-size:10.5pt'>、基于对象</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>(Object Based)</span><span
lang=ZH-CN style='font-size:10.5pt'>的压缩编码算法、基于模型</span><span style='font-size:
10.5pt;font-family:"Times New Roman"'>(Model –Based)</span><span lang=ZH-CN
style='font-size:10.5pt'>的压缩编码算法</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'>(</span><span lang=ZH-CN style='font-size:10.5pt'>应用在</span><span
style='font-size:10.5pt;font-family:"Times New Roman"'>MPEG4</span><span
lang=ZH-CN style='font-size:10.5pt'>及未来的视频压缩编码标准中</span><span style='font-size:
10.5pt;font-family:"Times New Roman"'>)</span><span lang=ZH-CN
style='font-size:10.5pt'>。这些都超出了本书的范围。</span><span style='font-size:10.5pt;
font-family:"Times New Roman"'><o:p></o:p></span></p>
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -