ics 180, april 22, 1997.htm

来自「介绍各种经典算法的代码。说明详细」· HTM 代码 · 共 140 行
HTM
140 行
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<!-- saved from url=(0049)http://www.ics.uci.edu/~eppstein/180a/970422.html -->
<HTML><HEAD><TITLE>ICS 180, April 22, 1997</TITLE>
<META content="text/html; charset=gb2312" http-equiv=Content-Type>
<META name=Owner value="eppstein">
<META name=Reply-To value="eppstein@ics.uci.edu">
<META content="MSHTML 5.00.2614.3500" name=GENERATOR></HEAD>
<BODY><IMG alt="" height=72 src="ICS 180, April 22, 1997.files/icslogo2.gif" 
width=472>
<P><A href="http://www.ics.uci.edu/~eppstein/180a/index.html">
<H1>ICS 180A, Spring 1997:<BR>Strategy and board game programming</H1></A>
<H2>Lecture notes for April 22, 1997<BR>Alpha-Beta Search</H2>
<H3>Shallow Pruning</H3>Suppose you're doing a minimax search (as we described 
last time) of the following tree: 
<P>
<CENTER><IMG src="ICS 180, April 22, 1997.files/shallow-prune.gif"></CENTER>
<P>You've searched F, and found its children's evaluations to be 11, 12, 7, and 
9; at this level of search, the first player is to move, and we expect him to 
choose the best of these values, 12. So, the minimax value of F is 12. 
<P>Now, you start searching G, and its first child returns a value of 15. When 
this happens, you know because of it that G's value will be at least 15, 
possibly even higher (if another of the children of G is even better). What this 
implies is that we don't expect the second player to move to G; from the second 
player's point of view, F's value of 12 is always better than G's value of 15 or 
higher. So, we know that G is not on the principal variation. We can 
<I>prune</I> the remaining children of G, not evaluate them, and return 
immediately from searching G, since any further work evaluating descendants of G 
would just be wasted. 
<P>In general, we can prune like we did in node G when one of its children 
returns a value better (from the point of view of the player whose turn it is at 
node G) than the previously computed evaluation of one of G's siblings. 
<H3>Deep Pruning</H3>
<P>Other more complicated forms of pruning are possible as well. For example, 
suppose in the same search tree that we've evaluated all of G, H, and I to be 
better than 12, so that 12 is the total evaluation at node B. Now, we search 
node C, and two levels down, see an evaluation of 10 at node N: 
<P>
<CENTER><IMG src="ICS 180, April 22, 1997.files/deep-prune.gif"></CENTER>
<P>We can use a more complicated line of reasoning to prune again. We know that 
N will return a 10 or smaller (the second player is to move, and wants to choose 
small numbers). We don't know whether this value of 10 or smaller will be 
returned at J as well, or whether one of the other children of J will be better. 
If a value 10 or smaller is returned from J to C, we can prune at C because it 
has a better sibling (B). So, in this case, further exploration of the children 
of N is pointless. The other case is that some other child of J returns a better 
value than 10; but again, in this case, further exploration of N is pointless. 
So as soon as we see 10, we can safely return from N. 
<H3>Alpha-Beta Pseudocode</H3>
<P>In general, when a returned value is better than the value of a sibling an 
even number of levels up in the tree, we can return immediately. If we pass the 
minimum value of any of these siblings in as a parameter <I>beta</I> to the 
search, we can do this pruning very efficiently. We also use another parameter 
<I>alpha</I> to keep track of the siblings at odd levels of the tree. Pruning 
using these two values is very simple; code to do so is listed below. Like last 
time, we use the <I>negamax</I> formulation, in which evaluations at alternate 
levels of the trees are negated. <PRE>double alphabeta(int depth, double alpha, double beta)
{
    if (depth &lt;= 0 || game is over) return evaluation();
    generate and sort list of moves available in the position
    for (each move m) {
        make move m;
        double val = -alphabeta(depth - 1, -beta, -alpha);
        unmake move m;
        if (val &gt;= beta) return val;
        if (val &gt; alpha) alpha = val;
    }
    return alpha;
}
</PRE>We'll explain Thursday what the sorting step is and why it's important. 
<H3>Aspiration Search</H3>What should we supply as the initial values of alpha 
and beta at the root of the tree? 
<P>Alpha and beta define an interval of the real number line (alpha,beta) of the 
evaluations we consider <I>interesting</I>. If a value is greater than beta we 
prune and immediately return, because we know it's not part of the principal 
variation; we don't really care about the exact value, only that it's greater 
than beta. If a value is less than alpha, we don't prune, but we still don't 
consider it interesting, because we know there's a better move somewhere else in 
the tree. 
<P>But at the root of the tree, we don't know what range of evaluation values 
are likely to be interesting, and if we want to be sure of not accidentally 
pruning something important, we have to just set alpha=-infinity and 
beta=infinity. 
<P>However, especially if we are using iterated deepening, it is likely that we 
have a pretty good idea what the principal variation is going to look like. 
Suppose we guess that its value is going to be x (e.g., just let x be the value 
found when you previously searched to depth D-1), and let epsilon be a small 
number representing the amount by which we expect a depth-D search to vary from 
a depth-(D-1) search. We can then try calling alphabeta(D, x-epsilon, 
x+epsilon). Three different things can happen as a result: 
<OL>
  <LI>The search might return a value within the interval (x-epsilon, 
  x+epsilon). In this case, we know that it returned the correct value, and we 
  can safely choose the move in the search tree leading to a node with that 
  value. 
  <LI>The search might return a value v&nbsp;<U>&gt;</U>&nbsp;x+epsilon. In this 
  case, we know that the true search value is also <U>&gt;</U>&nbsp;x+epsilon, 
  but we don't know what it is (the correct principal variation might have been 
  pruned as soon as we saw some other move having a value greater than beta). We 
  have to adjust our guess x to be a higher value and try again (probably also 
  with a larger value of epsilon). This condition is known as a <I>fail 
  high</I>. 
  <LI>The search might return a value v&nbsp;<U>&lt;</U>&nbsp;x-epsilon. In this 
  case, we know that the true search value is also <U>&lt;</U>&nbsp;x-epsilon, 
  but we don't know what it is. We have to adjust our guess x to be a smaller 
  value and try again (probably also with a larger value of epsilon). This 
  condition is known as a <I>fail low</I>. </LI></OL>Even though it can fail in 
these two ways, using the aspiration search (having an initial interval 
(alpha,beta) smaller than (-infinity,infinity)) is usually an improvement 
overall, because it does so much more pruning. 
<H3>Analysis</H3>Let's do an analysis of alpha-beta search to see why it's such 
a useful idea. Unlike the usual kind of analysis of algorithms, we'll do a 
<I>best-case analysis</I>, in which we assume that alpha-beta prunes as often as 
it possibly can. We'll see next time what we need to do to make alpha-beta 
search behave in a way consistent with this analysis. In class I did this 
analysis only considering shallow cuts, but in these lecture notes I'll include 
deep cuts as well, because it makes the analysis much simpler. 
<P>In the best case, each node at depth D-1 will only examine one child at depth 
D before pruning, except that one node on the principal variation will not prune 
(if it did, the overall algorithm will end up failing high or failing low, which 
would certainly not be the best case). 
<P>At depth D-2, however, nobody can prune, because all the children returned 
values greater than or equal to the values of beta they were passed, which at 
depth D-2 are negated and become less than or equal to alpha. 
<P>Continuing up the tree, at depth D-3 everyone (except on the principal 
variation) prunes, and at depth D-4 nobody prunes, etc. 
<P>So, if the branching factor of the tree is B, the number of nodes increases 
by a factor of B at half the levels of tree, and stays pretty much constant 
(ignoring the principal variation) at the other half of the levels. So the total 
size of the part of the tree that gets searched ends up being roughly 
B<SUP>D/2</SUP>&nbsp;=&nbsp;sqrt(B)<SUP>D</SUP>. Effectively, alpha-beta search 
ends up reducing the branching factor to the square root of its original value, 
and lets one search twice as deeply. For this reason it is an essential part of 
any minimax-based game-playing program. 
<P>
<HR>
<A href="http://www.ics.uci.edu/~eppstein/">David Eppstein, <A 
href="http://www.ics.uci.edu/">Dept. Information &amp; Computer Science</A>, <A 
href="http://www.uci.edu/">UC Irvine</A>, Wednesday, 07-May-1997 14:57:17 PDT. 
</BODY></HTML>
ics 180, april 22, 1997.htm - 源码说明

本页面展示了「介绍各种经典算法的代码。说明详细」中的 ics 180, april 22, 1997.htm 源码文件，采用 HTM 编程语言编写，共 140 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。
虫虫下载站收录了大量与算法相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。
⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?