📄 lda.m
字号:
function [alpha,beta] = lda(d,k,emmax,demmax)% Latent Dirichlet Allocation, standard model.% Copyright (c) 2004 Daichi Mochihashi, all rights reserved.% $Id: lda.m,v 1.7 2004/11/08 12:31:46 dmochiha Exp $% [alpha,beta] = lda(d,k,[emmax,demmax])% d : data of documents% k : # of classes to assume% emmax : # of maximum VB-EM iteration (default 100)% demmax : # of maximum VB-EM iteration for a document (default 20)if nargin < 4 demmax = 20; if nargin < 3 emmax = 100; endendn = length(d);l = features(d);beta = ones(l,k) / l;alpha = normalize(fliplr(sort(rand(1,k))));gammas = zeros(n,k);lik = 0;plik = lik;tic;fprintf(1,'number of documents = %d\n', n);fprintf(1,'number of words = %d\n', l);fprintf(1,'number of latent classes = %d\n', k);for j = 1:emmax fprintf(1,'iteration %d/%d..\t',j,emmax); % vb-estep betas = zeros(l,k); for i = 1:n [gamma,q] = vbem(d{i},beta,alpha,demmax); gammas(i,:) = gamma; betas = accum_beta(betas,q,d{i}); end % vb-mstep alpha = newton_alpha(gammas); beta = mnormalize(betas,1); % converge? lik = lda_lik(d,beta,gammas); fprintf(1,'likelihood = %g\t',lik); if (j > 1) && converged(lik,plik,1.0e-4) if (j < 5) fprintf(1,'\n'); [alpha,beta] = lda(d,k,emmax,demmax); % try again! return; end fprintf(1,'\nconverged.\n'); return; end plik = lik; % ETA elapsed = toc; fprintf(1,'ETA:%s (%d sec/step)\r', ... rtime(elapsed * (emmax / j - 1)),round(elapsed / j));endfprintf(1,'\n');% $Id: lda.m,v 1.7 2004/11/08 12:31:46 dmochiha Exp $
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -