30.txt
来自「This complete matlab for neural network」· 文本 代码 · 共 45 行
TXT
45 行
发信人: roamingo (漫步鸥), 信区: DataMining
标 题: Re: 下一步该看些什么
发信站: 南京大学小百合站 (Fri Dec 21 13:38:52 2001), 站内信件
I happened to be running Apache under Linux, and it is also very easy to
customize the log file format under Apache. I have put the cookie
field in it, and it makes unique user identification very accurate.
Of course, you still have to use the timestamp field to perform session
identification. However, the cookie field will always change for those users
who turn the cookie acceptance option off in their browers. Those
log entries have to be discarded or treated differently.
If no cookie information is available, the IP field with optional
information such as "refer" and "user agent" can be used to do
session identification, more adaptive but not so accurate. Mobasher
and Cooley's work around 1998 have detailed discussion on this topic.
This way is also useful for carrying out experiments on some external
weblog data, such as the freely available http log from Berkeley CS server.
http://www.cs.berkeley.edu/logs/
Ronny Kohavi has proposed some new insights on the level of information
the web usage mining algorithm should be carried out. He think it is
better to do it at the E-Commerce application level, not the raw log file
level.
// sorry for my absence. I have just finished my thesis draft yesterday.
【 在 iamgufeng (古风) 的大作中提到: 】
: If you open the IIS MMC to configure your application, you will find many
: extended properties can be selected in W3C-Extended-Log-Format. Such as
: cs(cookie),cs(Refer),etc. These maybe facilitate many works. I don't know
: if other HTTP servers like Apache can log such info as cookie,etc. Since
: so many dynamic page languages adopt cookie to label and manage user session
: infomation. Perhaps there are a lot of other methods.
: 【 在 fervvac (高远) 的大作中提到: 】
: : One technical question: Is that session information available in the log?
: : If so, as you have put it, the exact session boundaris could be obtained
: : and this will nullify many works, :-)
--
Read digitally, save a tree.
※ 来源:.南京大学小百合站 bbs.nju.edu.cn.[FROM: 202.120.7.27]
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?