📄 arith_encoder_mpeg4.asm

📁 ADI出品的blackfin533芯片有关MPEG4的编码器的C语言程序
💻 ASM
📖 第 1 页 / 共 2 页
字号:
12 下一页
/*******************************************************************************
Copyright(c) 2000 - 2002 Analog Devices. All Rights Reserved.
Developed by Joint Development Software Application Team, IPDC, Bangalore, India
for Blackfin DSPs  ( Micro Signal Architecture 1.0 specification).

By using this module you agree to the terms of the Analog Devices License
Agreement for DSP Software. 
********************************************************************************
Module Name     : arith_encoder_mpeg4.asm
Label Name      : __arith_encoder_mpeg4
Version         :   1.3
Change History  :

                Version     Date          Author        Comments
                1.3         11/18/2002    Swarnalatha   Tested with VDSP++ 3.0
                                                        compiler 6.2.2 on 
                                                        ADSP-21535 Rev.0.2
                1.2         11/13/2002    Swarnalatha   Tested with VDSP++ 3.0
                                                        on ADSP-21535 Rev.0.2                                             
                1.1         03/20/2002    Raghavendra   Modified to match
                                                        silicon cycle count
                1.0         09/11/2001    Raghavendra   Original 


Description     : The  arithmetic coding works on the basis of recursive 
                  probability interval sub division. With each binary decision 
                  the current probability interval is subdivided into two 
                  subintervals and  code string is modified so that it points to
                  base of the probability subinterval. Arithmetic encoder uses 
                  different  probability tables for INTRA and NON-INTRA 
                  contexts. These tables contain the probabilities for a binary 
                  alpha pixel being equal to 0 for intra and inter shape coding.
                  The least probable symbol LPS is defined as the symbol with 
                  least probability. If both probabilities are equal to 
                  half (i.e 0x8000), then '0' symbol is considered as least 
                  probable.
                  In initialization lower bound L is set to zero and range 
                  register R is set to 0x7fffffff. In order to avoid start code 
                  emulation, 1's are stuffed into the bitstream whenever there 
                  are too many successive 0's. If first 3(MAX-HEADING) bits are 
                  0's then 1 is transmitted and after MAX_HEADING th 0. If 
                  10(MAX_MIDDLE) or more 0's are sent successively a 1 is 
                  inserted after the MAX_MIDDLE th 0. If the number of trailing 
                  0's is larger than 2(MAX_TRAILING) then a 1 is appended.
                  The range associated with least probable symbol(LPS) is simply
                  computed as R*pLPS. 
                  where R ->16 most significant bits of Range register 
                     pLPS -> probability of LPS symbol.
                  If R value is less than QUATER (i.e 0x40000000)then 
                  re-normalisation is performed. In this procedure both lower 
                  value L and range R is doubled till R is greater than QUATER.
          

                  The following structure is used : 
                  struct arcodec {
                  UInt L;              // lower bound
                  UInt R;              // code range
                  UInt V;              // current code value
                  UInt arpipe;
                  Int bits_to_follow;  // follow bit count
                  Int first_bit;       // flag to check first bit
                  Int nzeros;          // counter to count consecutive zeros
                  Int nonzero;
                  Int nzerosf;
                  Int extrabits;
                  Int mh;              // MAX_HEAD
                  Int mm;              // MAX_MIDDLE
                  Int mt;              // MAX_TRAIL
                  unsigned char *out;  // output array
                                 };typedef struct arcodec ArCoder;   

                  This structure is common for both encoder and decoder.
                                                                            
Assumption      : Input bits are stored in an unsigned character array and each 
                  bit is stored in one location. Similarly  each output bit is 
                  written to an unsigned character location.

Prototype       : void StartArCoder(ArCoder * coder, unsigned char *);
                  void arith_encoder_mpeg4(int bit,int C0, ArCcoder * coder);
                  void arith_encode_renormalise(ArCoder  *coder);
                  void arith_bitplusfollow(int bit, ArCoder *coder);
                  void arith_StopArCoder(ArCoder *coder);

Calling sequence: Arithetic encoder is initialised by calling _StartArCoder 
                  function. _arith_encoder_mpeg4 function is called for each 
                  context with a decision bit.
                  Finally _arith_StopArcoder function is called. Following C 
                  code explains the calling sequence.
                     
                  main()
                   { 
                      int i,j,C,D;
                      struct ArCoder coder;
                        :                                             
                        :
                        :
                        _StartArCoder(coder,&bit_output[0]) ;
                            // bit_output is address of output array 
                        for(i=0;i<NO_OF_PAIR;i++)
                         { 
                           j=arrayIn0_C[i];
                           C=intra_prob[j];
                            // probability of '0' fetched from table 
                           D=arrayIn0_D[i];
                            // input bit 
                           _arith_encoder_mpeg4(D,C,coder);
                         }
                        _arith_StopArCoder(coder);
                      :
                      :
                  }

Performance     :
            Code Size   :   
                            StartArCoder             :  46 bytes
                            arith_encoder_mpeg4      :  68 bytes
                            arith_encode_renormalise : 126 bytes 
                            arith_bitplusfollow      : 106 bytes
                            arith_StopArCoder        : 128 bytes

            Cycle count : 
                                          Best case           Worst case
                 StartArCoder          :     22                 22  cycles
                 arith_encoder_mpeg4   :     59                525  cycles 
                 arith_StopArCoder     :    245                245  cycles

As the cycle count depends on the data, the above mentioned cycle counts are for
Test case 1 in the C file arith_decoder_mpeg4.c

Cycle count for 512 input symbol which gives   2 bits output is 
34329 cycles(67.04 cycles per input symbol).
Cycle count for 512 input symbol which gives 471 bits output is 
65953 cycles(128.81 cycles per input symbol).
  
Reference       : Appendix G and Appendix F of MPEG4 Video Verification 
                  model(V16.0)
*******************************************************************************/
#define MAXHEADING_ER 3
#define MAXMIDDLE_ER 10
#define MAXTRAILING_ER 2

/******************************************************************************
Prototype  : void StartArCoder(ArCoder * coder, unsigned char *);

             In this procedure Lower register is set to zero. Range is set to 
             0x7fffffff and output array is initialized. Register mh, mm, mt are
             initialized to keep track of bit stuffing  in inital, middle and 
             trailing part of bitstream.

Registers used : R0-R3, P0.
*******************************************************************************/
.section           L1_code;
.global     __StartArCoder;
.align                   8;

__StartArCoder: 

    P0 = R0;                // Address of structure coder
    R0 = 0; 
    R2 = 0;                 // clear R0 and set R2 to 0x80000000
    BITSET(R2,31);
    R3 = 1;                  
    R2 = R2-R3(NS)||[P0] = R0;
                            // load zero to low range register 
    [P0+4] = R2;            // Set Range value to 0x7fffffff
    [P0+16] = R0;           // Clear number of  BITS-TO-FOLLOW
    [P0+20] = R3;           // set first_bit  flag
    [P0+28] = R0;           // clear non-zero register
    R3 = MAXHEADING_ER;
    R2 = MAXMIDDLE_ER;
    [P0+24] = R3;           // to keep track of number of zeros at beginning
    [P0+40] = R3;           // store coder->mm to MAX_HEADER value i.e 3
    [P0+44] = R2;           // to keep track of maximum zeros in middle of bit 
                            // stream
    R2 = MAXTRAILING_ER;
    [P0+48] = R2;           // store coder->mt as MAX_TRAILING value i.e 2
    [P0+52] = R1;           // address of output array
    RTS;
    NOP;
    
__StartArCoder.end: 
    

/******************************************************************************
Prototype : void arith_encoder_mpeg4(int bit,int C0, ArCcoder * coder);

            bit -> decision bit which is to be coded
            C0  -> probability of symbol '0'.
            coder -> address of structure variable ArCcoder.

In this procedure, probability of symbol '1' is calculated using probability of 
symbol '0' If probability of symbol '1' is greater then probability of symbol 
'0' then '0' is treated as LPS else '1' is treated as LPS. Range of LPS 
symbol(rLPS) is calculated by multiplying higher 16 bit of range register(R) and
probability of LPS (CLPS). If bit to be coded equals to LPS then value of lower 
register coder->L += coder->R-rLPS and new range value(coder->R) equals to range
of LPS(rLPS). Otherwise range is decremented by rLPS. If R value is less than 
QUATER (i.e 0x40000000)then re-normalisation is performed. In this  procedure 
both lower value L and range R is doubled till R is greater than QUATER.

Registers used : R0-R3, R5-R7, P0, P1.
*******************************************************************************/
.section               program;
.global        __arith_encoder_mpeg4;
.align                      8;
    
__arith_encoder_mpeg4:  

    P0 = R2;                // Address of structure coder
    [--SP] = (R7:5);        // Push R7:5
    P1 = R2;                // Duplicate the address of coder
    R2 = R0-R0(NS)||R6 = [P0++];
                            // fetch value of lower range register 
    BITSET(R2,16);          // set r2 = 0x8000
    R2 = R2-R1(NS)||R7 = [P0];
                            // R1 = probability of '0'.R2 = probability of'1' 
                            // and get Range value 
    CC = R2<R1;             // check if  probability of '0' is greater than  
                            // probability of '1'
    IF CC R1 = R2;          // probability of LPS (CLPS)
    R5 = R7>>16;            // Higher 16 bits of range register
    R3 = CC;                // R3 contains Least probable symbol
    R5 = R5.L*R1.L(FU);     // Range LPS(rLPS) =  R * CLPS
    R7 = R7-R5;             // Range = Range -range of LPS
    R2 = R6+R7;             // Lower value  +=  Range -range of LPS
    CC = R0 == R3;          // check if bit to be coded  == LPS
    IF CC R7 = R5;          // if true Range ==  range of LPS
    IF !CC R2 = R6;         // if false lower range  ==  previous lower range
    [P0--] = R7;            // store new Range value in R register
    [P0] = R2;              // store L register
    [--SP] = RETS;          // Push RETS register before calling a function
    R0 = P1;                // argument to function

    CALL __arith_encode_renormalise;
    RETS = [SP++];          // Pop RETS register
    (R7:5) = [SP++];        // Pop R7:5
    
    RTS;
    NOP;
 
__arith_encoder_mpeg4.end:  
       
/******************************************************************************
Prototype   :   void  arith_encode_renormalise(ArCoder  *coder);

Renormalization is done only if Range value (coder->R) is less than 0x40000000.
If lower value(coder->L) is greater than or equals to  HALF(0x80000000) then '1'
is written to bitstream and lower value is decremented by HALF(0x80000000). If 
sum of lower value and range value (coder->L +coder->R) is greater than HALF 
then '0' is written to bitstream, otherwise lower value (coder->L )is
12 下一页
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -