auclpm.m

来自「支持向量域是近几年采用的一种较新的分类器」· M 代码 · 共 193 行

193 行

function w = auclpm(x, C, rtype, par, unitnorm, usematlab)%AUCLPM Find linear mapping with optimized AUC%%    W = AUCLPM(X, C, RTYPE, PAR)%% Optimize the AUC on dataset X and reg. param. C. This is done by% finding the weights W for which the ordering of the objects mapped% onto the line defined by W, is optimal. That means that objects from% class +1 is always mapped above objects from the -1 class. This% results in a constraint for each (+1,-1) pair of objects. The number% of constraints therefore become very large.  The AUC constraints can% be subsampled in different ways:%%		RTYPE     PAR%	  'full',     -   use all constraints%	  'subs',     N   subsample just N constraints%	  'subk',     k   subsample just k*#trainobj. constraints%	  'knn'       k   use only the k nearest neighbors%	  'xval'      N   subsample just N constraints and use the rest to%                 optimize C (this version can be very slow)%%    W = AUCLPM(X, C, RTYPE, PAR, UNITNORM)%% Finally, per default the difference vectors are normalized to unit% length. If you don't like that, set UNITNORM to 0.%% Default: RTYPE='subk'%          PAR  = 1.0;%% See also: createA% Copyright: D.M.J. Tax, D.M.J.Tax@prtools.org% Faculty EWI, Delft University of Technology% P.O. Box 5031, 2600 GA Delft, The Netherlandsprtrace(mfilename);if (nargin < 6)	usematlab = 0;endif (nargin < 5)	unitnorm = 1;endif (nargin < 4)	par = 1.00;endif (nargin < 3)	rtype = 'subk';endif (nargin < 2)	prwarning(3,'Lambda set to ten');	C = 10; end% define the correct name:if unitnorm	cl_name = sprintf('AUC-LP %s 1norm',rtype);else	cl_name = sprintf('AUC-LP %s',rtype);endif (nargin < 1) | (isempty(x))	w = mapping(mfilename,{C,rtype,par,unitnorm,usematlab});	w = setname(w,cl_name);	returnendif ~ismapping(C)   % train the mapping	% Unpack the dataset.	islabtype(x,'crisp');	isvaldset(x,1,2); % at least 1 object per class, 2 classes	[n,k,c] = getsize(x); 	% Check some values:	if par<=0		error('Parameter ''par'' should be larger than zero');	end	if c > 2  % only two-class mapping:		error('Only a two-class mapping is implemented.');	end	% first create the target values (+1 and -1):	y = 2*getnlab(x)-3;	% by this construction, the second class becomes the +1/target	% class:	labl = getlablist(x); dim = size(x,2);	tlab = labl(2,:);	% makes the mapping much faster:	X = +x; clear x;	%---create A for optauc	  rstate = rand('state');	seed = 0;	[A,Nxi,Aval] = createA(X,y,rtype,par,seed);	  rand('state',rstate);	if unitnorm		% normalize the length of A:		lA = sqrt(sum(A.*A,2));		lenn0 = find(lA~=0);  % when labels are flipped, terrible									 % things can happen		A(lenn0,:) = A(lenn0,:)./repmat(lA(lenn0,:),1,size(A,2));	end	% negative should be present for the constraints:	A = [A -A];	% take also care for the xi:	A = [A -speye(Nxi)];	%---create f	% NO, do this later, maybe we want to optimize it!	%f = [ones(2*k,1); repmat(C,Nxi,1)];	%--- generate b	b = -ones(Nxi,1);   % no zeros, otherwise we get w=0		 % the constraint is changed here to <=-1	%---lower bound constraints	lb = zeros(2*k+Nxi,1);	% should we run over a range of Cs?	if ~isempty(Aval)		M = 25;		xval = repmat(inf,M,1);		C = logspace(-3,3,M);		% run over all the Cs		for i=1:length(C)			%---create f again:			f = [ones(2*k,1); repmat(C(i),Nxi,1)];			%---solve linear program			if (exist('glpkmex')>0) & ~usematlab				[z,fmin,status]=glpkmex(1,f,A,b,repmat('U',Nxi,1),...					lb,[],repmat('C',size(f,1),1));			else				opts = optimset('Display','off','LargeScale','on','Diagnostics','off');				z = linprog(f,A,b,[],[],lb,[],[],opts);			end			constr = Aval*(z(1:k)-z(k+1:2*k));			I = find(constr<-0); % the number of violations			if isempty(I)				xval(i) = inf;			else				xval(i) = length(I)/size(constr,1);			end		end		[minxval,mini] = max(xval);		C = C(mini);	end	%---create f	f = [ones(2*k,1); repmat(C,Nxi,1)];	%---solve linear program	if (exist('glpkmex')>0) & ~usematlab		prwarning(7,'Use glpkmex');		[z,fmin,status,xtra]=glpkmex(1,f,A,b,repmat('U',Nxi,1),...			lb,[],repmat('C',size(f,1),1));		alpha = xtra.lambda;	else		[z,fmin,exitflag,outp,alpha] = linprog(f,A,b,[],[],lb);	end	%---extract parameters	u = z(1:k); u = u(:);	v = z(k+1:2*k); v = v(:);	zeta = z(2*k+1:2*k+Nxi); zeta = zeta(:);	% now find out how sparse the result is:	%nr = sum(beta>1e-6);	rel = (abs(u-v)>0);	nr = sum(rel);		% and store the results:	%W.wsc = wsc;	W.u = u; %the ultimate weights	W.v = v;	W.alpha = alpha;	W.zeta = zeta;	W.nr = nr;	W.rel = rel;	W.C = C;	w = mapping(mfilename,'trained',W,tlab,dim,1);	w = setname(w,sprintf('%s %s',cl_name,rtype));	else	% Evaluate the classifier on new data:	W = getdata(C);	n = size(x,1);	% linear classifier:	out = x*(W.u-W.v);	% and put it nicely in a prtools dataset:	% (I am not really sure what I should output, I decided to give a 1D	% output:)	w = setdat(x,out,C);end		return

auclpm.m - 源码说明

本页面展示了「支持向量域是近几年采用的一种较新的分类器」中的 auclpm.m 源码文件，采用 M 编程语言编写，共 193 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫下载站收录了大量与向量相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?