⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 lda.m

📁 使用变分EM方法实现的LDA文本模型 输入为文本即可 分解实现
💻 M
字号:
function [alpha,beta] = lda(d,k,emmax,demmax)% Latent Dirichlet Allocation, standard model.% Copyright (c) 2004 Daichi Mochihashi, all rights reserved.% $Id: lda.m,v 1.7 2004/11/08 12:31:46 dmochiha Exp $% [alpha,beta] = lda(d,k,[emmax,demmax])% d      : data of documents% k      : # of classes to assume% emmax  : # of maximum VB-EM iteration (default 100)% demmax : # of maximum VB-EM iteration for a document (default 20)if nargin < 4  demmax = 20;  if nargin < 3    emmax = 100;  endendn = length(d);l = features(d);beta = ones(l,k) / l;alpha = normalize(fliplr(sort(rand(1,k))));gammas = zeros(n,k);lik = 0;plik = lik;tic;fprintf(1,'number of documents      = %d\n', n);fprintf(1,'number of words          = %d\n', l);fprintf(1,'number of latent classes = %d\n', k);for j = 1:emmax  fprintf(1,'iteration %d/%d..\t',j,emmax);  % vb-estep  betas = zeros(l,k);  for i = 1:n    [gamma,q] = vbem(d{i},beta,alpha,demmax);    gammas(i,:) = gamma;    betas = accum_beta(betas,q,d{i});  end  % vb-mstep  alpha = newton_alpha(gammas);  beta = mnormalize(betas,1);  % converge?  lik = lda_lik(d,beta,gammas);  fprintf(1,'likelihood = %g\t',lik);  if (j > 1) && converged(lik,plik,1.0e-4)    if (j < 5)      fprintf(1,'\n');      [alpha,beta] = lda(d,k,emmax,demmax); % try again!      return;    end    fprintf(1,'\nconverged.\n');    return;  end  plik = lik;  % ETA  elapsed = toc;  fprintf(1,'ETA:%s (%d sec/step)\r', ...	  rtime(elapsed * (emmax / j  - 1)),round(elapsed / j));endfprintf(1,'\n');% $Id: lda.m,v 1.7 2004/11/08 12:31:46 dmochiha Exp $

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -