demo_ep_usps.m

来自「高斯过程在回归和分类问题中的应用」· M 代码 · 共 176 行

176 行

% Demo script to illustrate use of binaryEP on a binary digit classification% task. 2006-03-29.if isempty(regexp(path,['gpml' pathsep]))  cd ..; w = pwd; addpath([w, '/gpml']); cd gpml-demo    % add code dir to pathendhold offclearclfclcdisp('This demonstration illustrates the use of the Expectation Propagation')disp('(EP) approximation for binary Gaussian process classification applied')disp('to a digit task.')disp(' ')if exist('loadBinaryUSPS') ~= 2  disp('Error: can''t find the loadBinaryUSPS.m file. For this example, you')  disp('need to download the usps_resampled archive. It is available at')  disp('http://www.GaussianProcess.org/gpml/data')  returnend  disp('The data consists of 16 by 16 grey scale images of handwritten digits')disp('derived from the USPS data set. We will consider the binary')disp('classification task of separating 3''s from 5''s. The training set')disp('contains 767 cases and the test set 773 cases. Here is an example')disp('image of a digit 3.');disp(' ');disp('  [x y xx yy] = loadBinaryUSPS(3, 5);')[x y xx yy] = loadBinaryUSPS(3, 5);disp('  imagesc(reshape(x(3,:),16,16)''), colormap(''gray'')')imagesc(reshape(x(3,:),16,16)'), colormap('gray')disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('We must specify a covariance function. The demonstration uses the')disp('squared exponential (SE) covariance function but many other covariance')disp('functions are supported as well. The SE covariance function has two')disp('parameters; a log length-scale parameter and a log magnitude parameter.')disp('As an initial guess for the parameters, we set the log length-scale to')disp('the log of the average pairwise distance between training points,')disp('roughly log(22)=3 and the magnitude is set to unity, ie it''s log to 0.')disp('Other initial choices could be reasonable too.')disp(' ');disp('We then call the binaryEPGP function, which constructs the EP')disp('approximation of the posterior over functions based on the training set')disp('and produces probabilistic predictions for the test cases. This may')disp('take a few minutes or so... depending on whether you compiled the mex')disp('files... ')disp(' ')disp('  loghyper = [3.0; 0.0];   % set the log hyperparameters')loghyper = [3.0; 0.0];   % set the log hyperparametersdisp('  p = binaryEPGP(loghyper, ''covSEiso'', x, y, xx);')p = binaryEPGP(loghyper, 'covSEiso', x, y, xx);disp(' ')disp('  plot(p,''.'')');plot(p,'.')disp('  hold on');hold ondisp('  plot([1 length(p)],[0.5 0.5],''r'')');plot([1 length(p)],[0.5 0.5],'r')xlabel('test case number')ylabel('predictive probability')axis([0 length(p) 0 1])disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('Keep in mind that the test cases are ordered according to their')disp('target class. Notice that there are misclassifications, but there are')disp('no very confident misclassifications. The number of test set errors')disp('(out of 773 test cases) when thresholding the predictive probability at')disp('0.5 and the average amount of information about the test set labels in')disp('excess of a 50/50 model in bits are given by:')disp(' ')disp('  sum((p>0.5)~=(yy>0))')sum((p>0.5)~=(yy>0))disp('  mean((yy==1).*log2(p)+(yy==-1).*log2(1-p))+1')mean((yy==1).*log2(p)+(yy==-1).*log2(1-p))+1disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('These results were obtained by simply guessing some values for the')disp('hyperparameters. We can instead optimize the marginal likelihood on')disp('the training set w.r.t. the hyperparameters. The current values');disp('of the log hyperparameters (2 numbers), and the initial value')disp('of the negative log marginal likelihood are:')disp(' ')disp('  [loghyper'' binaryEPGP(loghyper, ''covSEiso'', x, y)]')[loghyper' binaryEPGP(loghyper, 'covSEiso', x, y)]disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('Now minimize the negative log marginal likelihood w.r.t. the')disp('hyperparameters, starting at the current values of loghyper. The third')disp('argument, -20, tells minimize to evaluate the function a maximum of 20')disp('times... WARNING: this may take 30 minutes or so... depending on your')disp('machine and whether you compiled the mex files... press ''ctrl-C'' to')disp('abort now, otherwise...')disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('  [newloghyper logmarglik] = minimize(loghyper, ''binaryEPGP'', -20, ''covSEiso'', x, y);')[newloghyper logmarglik] = minimize(loghyper, 'binaryEPGP', -20, 'covSEiso', x, y);disp('  [newloghyper'' logmarglik(end)]')[newloghyper' logmarglik(end)]disp(' ')disp('This shows that the log marginal likelihood was increased from -222 to')disp('-90 by optimizing the hyperparameters. This means that the marginal')disp('likelihood as increased by a factor of exp(295-90) = 2e+57.')disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('Finally, we can make test set predictions with the new hyperparameters:')disp(' ')disp('  pp = binaryEPGP(newloghyper, ''covSEiso'', x, y, xx);')pp = binaryEPGP(newloghyper, 'covSEiso', x, y, xx);disp('  plot(pp,''g.'')');plot(pp,'g.')disp(' ')disp('We note that the new predictions (in green) take much more extreme')disp('values than the old ones (in blue).')disp(' ')disp('Press any key to continue.')disp(' ')pausedisp('The number of test set errors (out of 773 test cases) when')disp('thresholding the predictive probability at 0.5 and the average amount')disp('of information about the test set labels in excess of a 50/50 model')disp('in bits are given by:')disp(' ')disp('  sum((pp>0.5)~=(yy>0))')sum((pp>0.5)~=(yy>0))disp('  mean((yy==1).*log2(pp)+(yy==-1).*log2(1-pp))+1')mean((yy==1).*log2(pp)+(yy==-1).*log2(1-pp))+1disp(' ')disp('showing that misclassification rate has dropped and the information')disp('about the test target labels has increased compared to using the old')disp('initially guessed values for the hyperparaneters.')disp(' ')disp('Press any key to exit.')disp(' ')pause

demo_ep_usps.m - 源码说明

本页面展示了「高斯过程在回归和分类问题中的应用」中的 demo_ep_usps.m 源码文件，采用 M 编程语言编写，共 176 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫下载站收录了大量与高斯相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?