⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 tree.html

📁 A garbage collector for C and C
💻 HTML
字号:
<HTML><HEAD>    <TITLE>  Two-Level Tree Structure for Fast Pointer Lookup</TITLE>    <AUTHOR> Hans-J. Boehm, Silicon Graphics (now at HP)</author></HEAD><BODY><H1>Two-Level Tree Structure for Fast Pointer Lookup</h1><P>The conservative garbage collector described<A HREF="http://www.hpl.hp.com/personal/Hans_Boehm/gc/">here</a>uses a 2-level treedata structure to aid in fast pointer identification.This data structure is described in a bit more detail here, since<OL><LI> Variations of the data structure are more generally useful.<LI> It appears to be hard to understand by reading the code.<LI> Some other collectors appear to use inferior data structures tosolve the same problem.<LI> It is central to fast collector operation.</ol>A candidate pointer is divided into three sections, the <I>high</i>,<I>middle</i>, and <I>low</i> bits.  The exact division between thesethree groups of bits is dependent on the detailed collector configuration.<P>The high and middle bits are used to look up an entry in the table describedhere.  The resulting table entry consists of either a block descriptor(<TT>struct hblkhdr *</tt> or <TT>hdr *</tt>)identifying the layout of objects in the block, or an indication that thisaddress range corresponds to the middle of a large block, together with ahint for locating the actual block descriptor.  Such a hint consistof a displacement that can be subtracted from the middle bits of the candidatepointer without leaving the object.<P>In either case, the block descriptor (<TT>struct hblkhdr</tt>)refers to a table of object starting addresses (the <TT>hb_map</tt> field).The starting address table is indexed by the low bits if the candidate pointer.The resulting entry contains a displacement to the beginning of the object,or an indication that this cannot be a valid object pointer.(If all interior pointer are recognized, pointers into large objectsare handled specially, as appropriate.)<H2>The Tree</h2><P>The rest of this discussion focuses on the two level data structureused to map the high and middle bits to the block descriptor.<P>The high bits are used as an index into the <TT>GC_top_index</tt> (really<TT>GC_arrays._top_index</tt>) array.  Each entry points to a<TT>bottom_index</tt> data structure.  This structure in turn consistsmostly of an array <TT>index</tt> indexed by the middle bits ofthe candidate pointer.  The <TT>index</tt> array contains the actual<TT>hdr</tt> pointers. <P>Thus a pointer lookup consists primarily of a handful of memory references,and can be quite fast:<OL><LI> The appropriate <TT>bottom_index</tt> pointer is looked up in<TT>GC_top_index</tt>, based on the high bits of the candidate pointer.<LI> The appropriate <TT>hdr</tt> pointer is looked up in the<TT>bottom_index</tt> structure, based on the middle bits.<LI> The block layout map pointer is retrieved from the <TT>hdr</tt>structure.  (This memory reference is necessary since we try to shareblock layout maps.)<LI> The displacement to the beginning of the object is retrieved from theabove map.</ol><P>In order to conserve space, not all <TT>GC_top_index</tt> entries in factpoint to distinct <TT>bottom_index</tt> structures.  If no address withthe corresponding high bits is part of the heap, then the entry pointsto <TT>GC_all_nils</tt>, a single <TT>bottom_index</tt> structure consistingonly of NULL <TT>hdr</tt> pointers.<P><TT>Bottom_index</tt> structures contain slightly more information thanjust <TT>hdr</tt> pointers.  The <TT>asc_link</tt> field is used to linkall <TT>bottom_index</tt> structures in ascending order for fast traversal.This list is pointed to be <TT>GC_all_bottom_indices</tt>.It is maintained with the aid of <TT>key</tt> field that contains thehigh bits corresponding to the <TT>bottom_index</tt>.<H2>64 bit addresses</h2><P>In the case of 64 bit addresses, this picture is complicated slightlyby the fact that one of the index structures would have to be huge tocover the entire address space with a two level tree.  We deal with thisby turning <TT>GC_top_index</tt> into a chained hash table, instead ofa simple array.  This adds a <TT>hash_link</tt> field to the<TT>bottom_index</tt> structure.<P>The "hash function" consists of dropping the high bits.  This is cheap tocompute, and guarantees that there will be no collisions if the heapis contiguous and not excessively large.<H2>A picture</h2><P>The following is an ASCII diagram of the data structure.This was contributed by Dave Barrett several years ago.<PRE>		Data Structure used by GC_base in gc3.7:			      21-Apr-94			 			    63                  LOG_TOP_SZ[11]  LOG_BOTTOM_SZ[10]   LOG_HBLKSIZE[13]   +------------------+----------------+------------------+------------------+ p:|                  |   TL_HASH(hi)  |                  |   HBLKDISPL(p)   |   +------------------+----------------+------------------+------------------+    \-----------------------HBLKPTR(p)-------------------/    \------------hi-------------------/                       \______ ________/ \________ _______/ \________ _______/                             V                   V                  V                             |                   |                  |           GC_top_index[]    |                   |                  |  ---      +--------------+   |                   |                  |    ^       |              |   |                   |                  |     |       |              |   |                   |                  |    TOP      +--------------+<--+                   |                  |       _SZ   +-<|      []      | *                     |                  |     (items)|  +--------------+  if 0 < bi< HBLKSIZE  |                  |      |    |  |              | then large object     |                  |      |    |  |              | starts at the bi'th   |                  |      v    |  |              | HBLK before p.        |             i    |     ---   |  +--------------+                       |          (word-  |           v                                         |         aligned) |       bi= |GET_BI(p){->hash_link}->key==hi          |                  |          v                                         |                  |           |   (bottom_index)  \ scratch_alloc'd     |                  |           |   ( struct  bi )  / by get_index()      |                  |     ---   +->+--------------+                       |                  |      ^       |              |                       |                  |  ^       |              |                       |                  | BOTTOM   |              |   ha=GET_HDR_ADDR(p)  |                  |_SZ(items)+--------------+<----------------------+          +-------+  |   +--<|   index[]    |                                  |           |   |   +--------------+                      GC_obj_map: v                |   |   |              |              from      / +-+-+-----+-+-+-+-+  ---   v   |   |              |              GC_add   < 0| | |     | | | | |   ^   ---  |   +--------------+             _map_entry \ +-+-+-----+-+-+-+-+   |        |   |   asc_link   |                          +-+-+-----+-+-+-+-+ MAXOBJSZ      |   +--------------+                      +-->| | |  j  | | | | |  +1         |   |     key      |                      |   +-+-+-----+-+-+-+-+   |        |   +--------------+                      |   +-+-+-----+-+-+-+-+   |       |   |  hash_link   |                      |   | | |     | | | | |   v       |   +--------------+                      |   +-+-+-----+-+-+-+-+  ---      |                                         |   |<--MAX_OFFSET--->|         |                                         |         (bytes)HDR(p)| GC_find_header(p)                       |   |<--MAP_ENTRIES-->|       |                           \ from        |    =HBLKSIZE/WORDSZ         |    (hdr) (struct hblkhdr) / alloc_hdr() |    (1024 on Alpha)      +-->+----------------------+              |    (8/16 bits each)GET_HDR(p)| word   hb_sz (words) |              |                    +----------------------+              |               | struct hblk *hb_next |              |          +----------------------+              |                 |mark_proc hb_mark_proc|              |          +----------------------+              |          | char * hb_map        |>-------------+          +----------------------+                     | ushort hb_obj_kind   |                     +----------------------+                     |   hb_last_reclaimed  |            ---      +----------------------+                  ^       |                      | MARK_BITS|       hb_marks[]     | *if hdr is free, hb_sz + DISCARD_WORDS_SZ(words)|                      |  is the size of a heap chunk (struct hblk)  v       |                      |  of at least MININCR*HBLKSIZE bytes (below), ---      +----------------------+  otherwise, size of each object in chunk.Dynamic data structures above are interleaved throughout the heap in blocks of size MININCR * HBLKSIZE bytes as done by gc_scratch_alloc which cannot befreed; free lists are used (e.g. alloc_hdr).  HBLK's below are collected.	      (struct hblk)       ---      +----------------------+ < HBLKSIZE ---         ---          DISCARD_  ^       |garbage[DISCARD_WORDS]|   aligned   ^           ^ HDR_BYTES WORDS  |       |                      |             |           v (bytes)   (words)  |       +-----hb_body----------+ < WORDSZ    |          ---   ---     |       |                      |   aligned   |           ^     ^  |       |      Object 0        |             |           hb_sz |  |       |                      |           i |(word-    (words)|  |       |                      |      (bytes)|aligned)   v     |  |       + - - - - - - - - - - -+ ---         |          ---    |  |       |                      |  ^          |           ^     |  n *     |                      |  j (words)  |          hb_sz BODY_SZ  HBLKSIZE |      Object 1        |  v          v           |   (words) (bytes)  |                      |---------------          v   MAX_OFFSET  |       + - - - - - - - - - - -+                        ---  (bytes)  |       |                      | !All_INTERIOR_PTRS      ^     |  |       |                      | sets j only for       hb_sz   |  |       |      Object N        | valid object offsets.   |     |  v       |                      | All objects WORDSZ      v     v ---      +----------------------+ aligned.               ---   ---DISCARD_WORDS is normally zero.  Indeed the collector has not been testedwith another value in ages.</pre></body>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -