ICA is used to classify text in extension to the Latent semantic indexing framework. ICA show to align the context grouping structure well in a human sense [1], thus can be used for unsupervised classification. The demonstration shows this on medical abstracts (MED dataset), that uses BIC to estimate the number of classes and produces keywords for each class. The icaML algorithm is used.
标签: ICA extension framework classify
上传时间: 2013-12-22
上传用户:himbly
用于文本语义分析的潜在语义分析算法LSA(Latent Semantic Analysis),包含详细的函数说明和原理分析
标签: Analysis Semantic Latent LSA
上传时间: 2016-10-05
上传用户:qilin
用于文本分析的pLSA(Probability Latent Semantic Analysis)的Matlab算法,含有测试数据及算法原理介绍。也可用于图像分析。
标签: Probability Analysis Semantic Latent
上传时间: 2013-12-03
上传用户:chens000
Latent dirichlet allocation的C实现代码,LDA主要用于文本分类是很经典的ML算法
标签: allocation dirichlet Latent 代码
上传时间: 2014-11-26
上传用户:cx111111