📄 readme
字号:
Baseline Wavelet Transform Coder Construction Kit-------------------------------------------------Version 0.3, 1/29/97-------------------------------------------------Geoff Davisgdavis@cs.dartmouth.eduhttp://www.cs.dartmouth.edu/~gdavis-------------------------------------------------Arithmetic coding library courtesy ofJohn Danskinjmd@cs.dartmouth.eduhttp://www.cs.dartmouth.edu/~jmdPGM file load/save courtesy ofRay Heasmanray@rucus.ru.ac.za-------------------------------------------------Source is available fromhttp://www.cs.dartmouth.edu/~gdavis/wavelet/wavelet.htmlNOTE: This is an alpha version of the code! It isn't yet fullydocumented. Eventually I hope to have a tutorial to accompany this.There are one or two known (minor) bugs. Upgrades will be supplied onthe web at irregular intervals. Caveat emptor.-------------------------------------------------Revision logVersion 0.3 1/29/97 -- fixed a bug in the allocator so that actual rates are much closer to target rates switched to binary files for i/o -- this hopefully will fix problems people using DOS/Windows95 were having (thanks to Daniel Weng) changed default for Real to float from double fixed the copy constructor for WaveletTransform plugged some memory leaks in the WaveletTransform constructors (thanks to Derek Ho) added new filter set from J. Odegard and S. BurrusVersion 0.2 11/22/96 -- plugged lots of memory leaks, fixed minor bugs in encode.cc and decode.cc replaced all references to doubles with Reals added support for PGM files (thanks to Ray Heasman) added double precision coefficients from EPIC (thanks to Eero Simoncelli)Version 0.1 10/13/96 -- fixed a few compiler bugs plus a minor bug in quantizer.ccVersion 0.0 9/12/96 -- initial version-------------------------------------------------This code implements a reasonably good wavelet transform-based imagecoder for grayscale images. The coder is not the most sophisticated-- it's a simple transform coder -- but each individual piece of thetransform coder has been chosen for high performance. Performance Data for 512x512 Lena image (default settings)Target compression ratio Actual ratio PSNR (dB) RMS error------------------------ ------------ --------- ---------4:1 4.02:1 43.71 1.668:1 8.00:1 39.42 2.7316:1 16.01:1 36.18 3.9632:1 32.01:1 33.17 5.6064:1 64.08:1 30.22 7.86128:1 129.26:1 27.73 10.48The code has been designed for experimentation. It's very modular andshould allow for simple replacements of individual components. Onecan easily replace the quantizer, the entropy coder, and the waveletfilters. If you do modify/upgrade/replace sections of this code, I would verymuch appreciate hearing about it. I hope to make this constructionkit a collaborative effort with a whole range of modules supplied bydifferent researchers. A wish list of future improvements is includedat the end of this file. I will provide WWW links to any extensionspeople provide.A transform coder consists of 3 basic steps.1) an invertible transform is performed on an image2) the transform coefficients are quantized (discretized)3) the quantized coefficients are entropy codedThe entropy coding, quantization routines, and bit allocation are verygeneral-purpose. They will work with a whole variety of transforms,including DCT's, wavelet packets, local trig bases, etc. Moreover,they have been designed with the expectation that other features suchas zerotrees or perceptual weighting will be added later.Implementing more sophisticated coders such as those described inZ. Xiong, K. Ramchandran and M. T. Orchard, ``Wavelet Packets ImageCoding Using Space-frequency Quantization", Preprint, 1996 andZ. Xiong, K. Ramchandran and M. T. Orchard, "Space-frequencyQuantization for Wavelet Image Coding", to appear in IEEE Trans. ImageProcessing, 1997 (see http://www.ee.princeton.edu/~zx/articles.html)should be relatively easy to do given this code. The currenttransform routine should be fairly straightforward to extend to perfomwavelet packet decompositions.The wavelet transform implements symmetrized boundaries and works forimages of (more or less) arbitrary sizes, as long as the aspect ratiois less than 2:1 (the aspect ratio limitation should bestraightforward to eliminate, but I haven't gotten around to it). Forthe full details on how to perform such a transform, seeftp://ftp.c3.lanl.gov/pub/WSQ/documents/classify.ps.Z, "Classificationof nonexpansive symmetric extension transforms for multirate filterbanks," Chris Brislawn, Los Alamos Tech Report LA-UR-94-1747. Alsosee ftp://ftp.c3.lanl.gov/pub/WSQ/tutorial/tutorial.cThe filters included with the wavelet transform include some of thebest known for image coding. It includes the set from J. Villasenor,B. Belzer, J. Liao, "Wavelet Filter Evaluation for Image Compression."IEEE Transactions on Image Processing, Vol. 2, pp. 1053-1060, August1995 (http://synergy.icsl.ucla.edu/~ipl/papers.html). There are a fewextra filters from Brislawn's code, a few Daubechies filters, and anew (unpublished) 18/10 filter that Villasenor's group has foundeffective. I've also just added a 7/9 pair from J. E. Odegard andC. S. Burrus, "Smooth biorthogonal wavelets for applications in imagecompression," in Proceedings of DSP Workshop, Loen, Norway, September1996 (http://www-dsp.rice.edu/publications). This pair yieldssuperior results to the standard Antonini pair for EZW on Barbara.<p>Two sets of quantizers are included. The first set performs a uniformquantization and is fairly straightforward. The second is an embeddedfamily of quantizers fully described in D. Taubman and A. Zakhor,"Multirate 3-D subband coding of video", IEEE Transactions on ImageProcessing, Vol 3, No. 5, Sept, 1994. The quantizers are equivalentto those used in J. Shapiro, "Embedded image coding using zerotrees ofwavelet coefficients," IEEE Transactions on Signal Processing,Vol. 41, No. 12, pp. 3445--3462, Dec. 1993, but are coupled with amore effective entropy coding scheme.Two sets of adaptive entropy coding schemes are also included. Thefirst performs histogram adaptation with escape codes. The escapecodes keeps rare symbols from adding too much to the overall symbolcost during early stages of histogram adaptation (see _TextCompression_ by Bell, Cleary, and Witten for details). The secondcoder is an embedded coder designed for use with the embeddedquantizer above (See Taubman and Zakhor for full details). It adaptsvery quickly and is very effective.The arithmetic coder is based on an implementation of AlistairMoffat's linear time coding histogram (seehttp://www.cs.mu.oz.au/~alistair/papers.html). The implementation iscourtesy of John Danskin, and the full distribution (most of which isincluded here) may be obtained from http://www.cs.dartmouth.edu/~jmd.The bit allocation routines are based on integer programmingalgorithms described in Y. Shoham and A. Gersho, "Efficient bitallocation for an arbitrary set of quantizers," IEEE Transactions onAcoustics, Speech, and Signal Processing, Vol. 36, No. 9,pp. 1445-1453, Sept 1988. They provide optimal or near-optimalallocations for the quantizers included here.---------------------------------------------------------------------------Executables-----------encode Code an image Usage: encode [image][width][height][output][ratio] image: image to be compressed width, height: width and height of image to be compressed output: name of compressed image ratio: target compression ratiodecode Decode an image Usage: decode [encoded image][decoded image]compare Compare two pbm/pgm images. Returns MSE, RMS error, and PSNR Usage: compare [image 1][image 2][width][height]raw2pgm Convert an image in raw/raster format to PGM Usage: raw2pgm [raw image name][height][width][pgm image name]pgm2raw Convert an image in pgm/pbm format to a raw/raster format Usage: pgm2raw [pgm image name][raw image name]The source files in this directory can be broken up into several mainclasses, each pertaining to one of the above steps plus some globalstuff.Global Stuff------------encode.cc Main encoding program -- puts together all steps in the coding processdecode.cc Main decoding program -- puts together all steps in the decoding processcompare.cc Useful utility for comparing imagespgm2raw.cc Format conversion: pgm->rawraw2pgm.cc Format conversion: raw->pgmglobal.cc, global.hh Location of global functions and definitionsTransform Step--------------The routines below take an image (in pbm/pgm format) and perform a 2-Dwavelet transform.image.cc, image.hh Handles loading/saving raw, pbm imageswavelet.cc, wavelet.hh Performs a wavelet transform on an image. Handles non-square images (with aspect ratio < 2:1). Uses symmetric extension of boundaries for symmetric filters and periodic extension for asymmetric ones.filter.cc Contains filter coefficients for various wavelets. Contains all filters from J. Villasenor, B. Belzer, J. Liao, "Wavelet Filter Evaluation for Image Compression." IEEE Transactions on Image Processing, Vol. 2, pp. 1053-1060, August 1995 transform.cc, transform.hh Breaks wavelet transformed images up into subbands. This makes postprocessing more convenient and also independent of the method of transform (e.g. iterated filtering, lifting, etc.) Quantization Step-----------------These routines take subsets of the transform coefficients (typically asubset corresponds to all coefficients in a given subband) and determineappropriate quantizer precisions for each subset. Quantizerresolutions are chosen to minimize total quantization error subject toa constraint on the total number of bits required to store thequantized coefficients. coeffset.cc, coeffset.hh Storage for different subsets of coefficients. Also stores total bit cost (rate) and total distortion for each quantizer resolution.metric.hh Functions for determining total quantization error. The most common error measure is squared distortion.quantizer.cc, quantizer.hh Quantizes coefficients at various resolutions.allocator.cc, allocator.hh Uses a constrained optimization procedure to determine quantizer resolutions for each set of coefficients.Entropy Coding Step-------------------entropy.cc, entropy.hh High-level entropy coding routines. Writes/reads coefficients. Allows use of adaptive histograms and context-based coding.coder.cc, coder.hh High-level I/O interface for entropy coding routines above. Also allows efficient coding of individual bits and arbitrarily sized integers. Arith.cc, Arith.h Low-level arithmetic coding routines. Based on an implementation of Alistair Moffat's linear time coding histogram by John Danskin. The full distribution may be obtained from (http://www.cs.dartmouth.edu/~jmd)iHisto.cc, iHisto.h More low-level arithmetic coding routines from the above package.IntCoding.cc, IntCoding.hh Integer coding routines for coder.cc courtesy of John Danskin.BitIO.cc, BitIO.h Bit-level I/O routines from John's package.Wish list for future improvements (send me your code!)------------------------------------------------------* Modify wavelet.cc and transform.cc to handle images with different aspect ratios (shouldn't be too hard).* Add support for color! The easiest way to do this would be to take an RGB image and to transform it to something like YIQ or HUV and then code each layer separately.* Upgrade image.cc to support more image formats. Read/write .tiff's, .gif's, etc.* Add Lloyd-Max scalar quantizers.* Add trellis coding to reduce quantization errors.* Add zerotrees.* Add an 8x8 block DCT so that the code can be modified to do JPEG.* Add an 8x8 block DCT with folding. * Implement wavelet transform via lifting to improve speed.* Fix arithmetic coder so can switch back & forth between coding and writing ints/bits.* Upgrade the dequantize routines, the entropy coder, and the arithmetic coder so that the code can handle truncated bitstreams.
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -