📄 readme

📁 小波变换
💻
字号:
Baseline Wavelet Transform Coder Construction Kit-------------------------------------------------Version 0.3, 1/29/97-------------------------------------------------Geoff Davisgdavis@cs.dartmouth.eduhttp://www.cs.dartmouth.edu/~gdavis-------------------------------------------------Arithmetic coding library courtesy ofJohn Danskinjmd@cs.dartmouth.eduhttp://www.cs.dartmouth.edu/~jmdPGM file load/save courtesy ofRay Heasmanray@rucus.ru.ac.za-------------------------------------------------Source is available fromhttp://www.cs.dartmouth.edu/~gdavis/wavelet/wavelet.htmlNOTE: This is an alpha version of the code!  It isn't yet fullydocumented.  Eventually I hope to have a tutorial to accompany this.There are one or two known (minor) bugs.  Upgrades will be supplied onthe web at irregular intervals.  Caveat emptor.-------------------------------------------------Revision logVersion 0.3 1/29/97 --  fixed a bug in the allocator so that actual rates				are much closer to target rates			switched to binary files for i/o -- this hopefully				will fix problems people using DOS/Windows95				were having (thanks to Daniel Weng)			changed default for Real to float from double			fixed the copy constructor for WaveletTransform			plugged some memory leaks in the WaveletTransform 				constructors (thanks to Derek Ho)			added new filter set from J. Odegard and S. BurrusVersion 0.2 11/22/96 -- plugged lots of memory leaks, fixed minor				bugs in encode.cc and decode.cc			replaced all references to doubles with Reals			added support for PGM files				(thanks to Ray Heasman)			added double precision coefficients				from EPIC (thanks to Eero Simoncelli)Version 0.1 10/13/96 -- fixed a few compiler bugs			plus a minor bug in			quantizer.ccVersion 0.0  9/12/96 -- initial version-------------------------------------------------This code implements a reasonably good wavelet transform-based imagecoder for grayscale images.  The coder is not the most sophisticated-- it's a simple transform coder -- but each individual piece of thetransform coder has been chosen for high performance.  Performance Data for 512x512 Lena image (default settings)Target compression ratio        Actual ratio	PSNR (dB)	RMS error------------------------        ------------    ---------       ---------4:1				4.02:1 		43.71		1.668:1				8.00:1		39.42		2.7316:1				16.01:1		36.18		3.9632:1				32.01:1		33.17		5.6064:1				64.08:1		30.22		7.86128:1				129.26:1	27.73		10.48The code has been designed for experimentation.  It's very modular andshould allow for simple replacements of individual components.  Onecan easily replace the quantizer, the entropy coder, and the waveletfilters.  If you do modify/upgrade/replace sections of this code, I would verymuch appreciate hearing about it.  I hope to make this constructionkit a collaborative effort with a whole range of modules supplied bydifferent researchers.  A wish list of future improvements is includedat the end of this file.  I will provide WWW links to any extensionspeople provide.A transform coder consists of 3 basic steps.1) an invertible transform is performed on an image2) the transform coefficients are quantized (discretized)3) the quantized coefficients are entropy codedThe entropy coding, quantization routines, and bit allocation are verygeneral-purpose.  They will work with a whole variety of transforms,including DCT's, wavelet packets, local trig bases, etc.  Moreover,they have been designed with the expectation that other features suchas zerotrees or perceptual weighting will be added later.Implementing more sophisticated coders such as those described inZ. Xiong, K. Ramchandran and M. T. Orchard, ``Wavelet Packets ImageCoding Using Space-frequency Quantization", Preprint, 1996 andZ. Xiong, K. Ramchandran and M. T. Orchard, "Space-frequencyQuantization for Wavelet Image Coding", to appear in IEEE Trans. ImageProcessing, 1997 (see http://www.ee.princeton.edu/~zx/articles.html)should be relatively easy to do given this code.  The currenttransform routine should be fairly straightforward to extend to perfomwavelet packet decompositions.The wavelet transform implements symmetrized boundaries and works forimages of (more or less) arbitrary sizes, as long as the aspect ratiois less than 2:1 (the aspect ratio limitation should bestraightforward to eliminate, but I haven't gotten around to it).  Forthe full details on how to perform such a transform, seeftp://ftp.c3.lanl.gov/pub/WSQ/documents/classify.ps.Z, "Classificationof nonexpansive symmetric extension transforms for multirate filterbanks," Chris Brislawn, Los Alamos Tech Report LA-UR-94-1747.  Alsosee ftp://ftp.c3.lanl.gov/pub/WSQ/tutorial/tutorial.cThe filters included with the wavelet transform include some of thebest known for image coding.  It includes the set from J. Villasenor,B. Belzer, J. Liao, "Wavelet Filter Evaluation for Image Compression."IEEE Transactions on Image Processing, Vol. 2, pp. 1053-1060, August1995 (http://synergy.icsl.ucla.edu/~ipl/papers.html).  There are a fewextra filters from Brislawn's code, a few Daubechies filters, and anew (unpublished) 18/10 filter that Villasenor's group has foundeffective.  I've also just added a 7/9 pair from J. E. Odegard andC. S. Burrus, "Smooth biorthogonal wavelets for applications in imagecompression," in Proceedings of DSP Workshop, Loen, Norway, September1996 (http://www-dsp.rice.edu/publications).  This pair yieldssuperior results to the standard Antonini pair for EZW on Barbara.<p>Two sets of quantizers are included.  The first set performs a uniformquantization and is fairly straightforward.  The second is an embeddedfamily of quantizers fully described in D. Taubman and A. Zakhor,"Multirate 3-D subband coding of video", IEEE Transactions on ImageProcessing, Vol 3, No. 5, Sept, 1994.  The quantizers are equivalentto those used in J. Shapiro, "Embedded image coding using zerotrees ofwavelet coefficients," IEEE Transactions on Signal Processing,Vol. 41, No. 12, pp. 3445--3462, Dec. 1993, but are coupled with amore effective entropy coding scheme.Two sets of adaptive entropy coding schemes are also included.  Thefirst performs histogram adaptation with escape codes.  The escapecodes keeps rare symbols from adding too much to the overall symbolcost during early stages of histogram adaptation (see _TextCompression_ by Bell, Cleary, and Witten for details).  The secondcoder is an embedded coder designed for use with the embeddedquantizer above (See Taubman and Zakhor for full details).  It adaptsvery quickly and is very effective.The arithmetic coder is based on an implementation of AlistairMoffat's linear time coding histogram (seehttp://www.cs.mu.oz.au/~alistair/papers.html).  The implementation iscourtesy of John Danskin, and the full distribution (most of which isincluded here) may be obtained from http://www.cs.dartmouth.edu/~jmd.The bit allocation routines are based on integer programmingalgorithms described in Y. Shoham and A. Gersho, "Efficient bitallocation for an arbitrary set of quantizers," IEEE Transactions onAcoustics, Speech, and Signal Processing, Vol. 36, No. 9,pp. 1445-1453, Sept 1988.  They provide optimal or near-optimalallocations for the quantizers included here.---------------------------------------------------------------------------Executables-----------encode		Code an image	     	Usage: encode [image][width][height][output][ratio]		        image: image to be compressed	     		width, height: width and height of image to be					compressed			output: name of compressed image	     		ratio:  target compression ratiodecode		Decode an image		Usage: decode [encoded image][decoded image]compare		Compare two pbm/pgm images.  Returns MSE, RMS error, and PSNR		Usage: compare [image 1][image 2][width][height]raw2pgm		Convert an image in raw/raster format to PGM		Usage: raw2pgm [raw image name][height][width][pgm image name]pgm2raw		Convert an image in pgm/pbm format to a raw/raster format		Usage: pgm2raw [pgm image name][raw image name]The source files in this directory can be broken up into several mainclasses, each pertaining to one of the above steps plus some globalstuff.Global Stuff------------encode.cc			Main encoding program -- puts together				all steps in the coding processdecode.cc			Main decoding program -- puts together				all steps in the decoding processcompare.cc	 		Useful utility for comparing imagespgm2raw.cc			Format conversion: pgm->rawraw2pgm.cc			Format conversion: raw->pgmglobal.cc, global.hh		Location of global functions and				definitionsTransform Step--------------The routines below take an image (in pbm/pgm format) and perform a 2-Dwavelet transform.image.cc, image.hh		Handles loading/saving raw, pbm imageswavelet.cc, wavelet.hh		Performs a wavelet transform on an				image.  Handles non-square images				(with aspect ratio < 2:1).  Uses				symmetric extension of boundaries for				symmetric filters and periodic				extension for asymmetric ones.filter.cc			Contains filter coefficients for				various wavelets.  Contains all				filters from J. Villasenor, B. Belzer,				J. Liao, "Wavelet Filter Evaluation				for Image Compression." IEEE				Transactions on Image Processing,				Vol. 2, pp. 1053-1060, August 1995				transform.cc, transform.hh	Breaks wavelet transformed images up				into subbands.  This makes				postprocessing more convenient and				also independent of the method of				transform (e.g. iterated filtering,				lifting, etc.) Quantization Step-----------------These routines take subsets of the transform coefficients (typically asubset corresponds to all coefficients in a given subband) and determineappropriate quantizer precisions for each subset.  Quantizerresolutions are chosen to minimize total quantization error subject toa constraint on the total number of bits required to store thequantized coefficients. coeffset.cc, coeffset.hh	Storage for different subsets of				coefficients.  Also stores total bit				cost (rate) and total distortion for				each quantizer resolution.metric.hh			Functions for determining total				quantization error.  The most common				error measure is squared distortion.quantizer.cc, quantizer.hh	Quantizes coefficients at various				resolutions.allocator.cc, allocator.hh	Uses a constrained optimization				procedure to determine quantizer				resolutions for each set of				coefficients.Entropy Coding Step-------------------entropy.cc, entropy.hh		High-level entropy coding routines.				Writes/reads coefficients.  Allows				use of adaptive histograms and				context-based coding.coder.cc, coder.hh		High-level I/O interface for entropy				coding routines above.  Also allows				efficient coding of individual bits				and arbitrarily sized integers. Arith.cc, Arith.h		Low-level arithmetic coding routines.				Based on an implementation of Alistair				Moffat's linear time coding histogram				by John Danskin.  The full				distribution may be obtained from 				(http://www.cs.dartmouth.edu/~jmd)iHisto.cc, iHisto.h		More low-level arithmetic coding				routines from the above package.IntCoding.cc, IntCoding.hh	Integer coding routines for coder.cc				courtesy of John Danskin.BitIO.cc, BitIO.h		Bit-level I/O routines from John's				package.Wish list for future improvements (send me your code!)------------------------------------------------------* Modify wavelet.cc and transform.cc to handle images with different  aspect ratios (shouldn't be too hard).* Add support for color!  The easiest way to do this would be to take  an RGB image and to transform it to something like YIQ or HUV and  then code each layer separately.* Upgrade image.cc to support more image formats.  Read/write .tiff's,   .gif's, etc.* Add Lloyd-Max scalar quantizers.* Add trellis coding to reduce quantization errors.* Add zerotrees.* Add an 8x8 block DCT so that the code can be modified to do JPEG.* Add an 8x8 block DCT with folding. * Implement wavelet transform via lifting to improve speed.* Fix arithmetic coder so can switch back & forth between coding and  writing ints/bits.* Upgrade the dequantize routines, the entropy coder, and the  arithmetic coder so that the code can handle truncated bitstreams.
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -