952.txt

来自「This complete matlab for neural network」· 文本 代码 · 共 30 行

TXT
30
字号
发信人: fervvac (高远), 信区: DataMining
标  题: Re: 下一步该看些什么
发信站: 南京大学小百合站 (Mon Dec 17 19:40:12 2001), 站内信件

One EDBT paper (this year) on boundary finding for transactions from 
proxy log, but you will probably wait till the electronic edition is
avilable.

【 在 roamingo (漫步鸥) 的大作中提到: 】
: I think it's better to use a configurable variable when you are writing
: a sessionizor.  Then, you can find some reasonable interval by yourself.
: For my experiments, 10 - 30 minites are all fine. (This is the inactive
: period of a session, not the total time span of it.) 
: For the second question, there are many, depending on what kind of patterns
: you are going to find. For example:
:  * Association: find those pages that tend to be accessed togeter.
:  * Sequenatial analysis: find the frequent path.
:  * Markov chain model: predict the next access, often used to do prefetch.
:  * Clustering (usually the categorial value oriented method, like ROCK 
:    mentioned in Han's textbook):
:    - session clustering
:    - page clustering
:  * And some combinations of the above.
: 【 在 hwe (xiaohui) 的大作中提到: 】
: : 好像是30分钟


--
※ 来源:.南京大学小百合站 bbs.nju.edu.cn.[FROM: 饮水思源BBS]

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?