📄 page_226.html
字号:
<HTML> <HEAD> <!--SCRIPT LANGUAGE="JavaScript" SRC="http://a1835.g.akamai.net/f/1835/276/3h/www.netlibrary.com/include/js/dictionary_library.js"></SCRIPT> <SCRIPT LANGUAGE="JavaScript"> if (!opener){document.onkeyup=parent.turnBookPage;} </SCRIPT!--> <META HTTP-EQUIV="Cache-Control" CONTENT="no-cache"> <META HTTP-EQUIV="Pragma" CONTENT="no-cache"> <META HTTP-EQUIV="Expires" CONTENT="-1"><META http-equiv="Content-Type" content="text/html; charset=windows-1252"><SCRIPT>var PrevPage="Page_225";var NextPage="Page_227";var CurPage="Page_226";var PageOrder="235";</SCRIPT> <TITLE>Document</TITLE> </HEAD> <BODY BGCOLOR="#FFFFFF"><CENTER><TABLE BORDER=0 WIDTH=100% CELLPADDING=0><TR><TD ALIGN=CENTER> <TABLE BORDER=0 CELLPADDING=2 CELLSPACING=0 WIDTH=100%> <TR> <TD ALIGN=LEFT><A HREF='Page_225.html'>Previous</A></TD> <TD ALIGN=RIGHT><A HREF='Page_227.html'>Next</A></TD> </TR> </TABLE></TD></TR><TR><TD ALIGN=LEFT><P><A NAME='JUMPDEST_Page_226'/><A NAME='{7BD}'/><TABLE BORDER=0 CELLSPACING=0 CELLPADDING=0 WIDTH='100%'><TR><TD ALIGN=RIGHT><FONT FACE='Times New Roman, Times, Serif' SIZE=2 COLOR=#FF0000>Page 226</FONT></TD></TR></TABLE><A NAME='{7BE}'/><TABLE BORDER=0 CELLSPACING=0 CELLPADDING=0><TR> <TD ROWSPAN=5></TD> <TD COLSPAN=3 HEIGHT=12></TD> <TD ROWSPAN=5></TD></TR><TR> <TD COLSPAN=3></TD></TR><TR><TD></TD> <TD><FONT FACE='Times New Roman, Times, Serif' SIZE=3>ses and what type of customer attributes you need to capture for sales, marketing, strategic planning, etc. For example, do you want to mine your data on a daily basis, or will a weekly sample of 10 percent of your online transactions suffice? Keep in mind that irrelevant, redundant, or covariant variables in a database constitute noise to any data analysis and can weaken your data mining predictive models. The Common Log Format was not designed for data mining; it was designed to measure server traffic, and as such it contains a lot of redundant and useless information that can be discarded from the mining analysis. In some instances you will need to append or merge log or database information created from registration forms into a data set more suitable for inductive data analysis (mining).</FONT></TD><TD></TD></TR><TR> <TD COLSPAN=3></TD></TR><TR> <TD COLSPAN=3 HEIGHT=1></TD></TR></TABLE><A NAME='{7BF}'/><TABLE BORDER=0 CELLSPACING=0 CELLPADDING=0><TR> <TD ROWSPAN=5></TD> <TD COLSPAN=3 HEIGHT=12></TD> <TD ROWSPAN=5></TD></TR><TR> <TD COLSPAN=3></TD></TR><TR><TD></TD> <TD><FONT FACE='Times New Roman, Times, Serif' SIZE=3>Another issue you need to deal with is dirty data. You must fix inconsistent data whenever possible including deciding how to deal with missing values. Some data mining tools provide the options and utilities for fixing spotty databases and some do not, so be prepared to do some cleaning.</FONT></TD><TD></TD></TR><TR> <TD COLSPAN=3></TD></TR><TR> <TD COLSPAN=3 HEIGHT=1></TD></TR></TABLE><A NAME='{7C0}'/><TABLE BORDER=0 CELLSPACING=0 CELLPADDING=0><TR> <TD ROWSPAN=5></TD> <TD COLSPAN=3 HEIGHT=12></TD> <TD ROWSPAN=5></TD></TR><TR> <TD COLSPAN=3></TD></TR><TR><TD></TD> <TD><FONT FACE='Times New Roman, Times, Serif' SIZE=3>Lastly, consideration must be given on how you will represent your data. For example, how do you want to deal with categorical attributes such as gender or ZIP codes? Do you want to group them into classes? How do you want to deal with continuous value variables? Do you need to create ratios from them? Compounded by all of these issues is the fact that data from a website is highly dynamic. Keep in mind that these elements are not set in stone, and that their customization is the norm rather than the exception. The data components generated from your website are only the starting point in the data mining process, for their value increases exponentially when merged with your data warehouse and third party information.</FONT><FONT FACE='Times New Roman, Times, Serif' SIZE=3 COLOR=#FFFF00><!-- break --></FONT></TD><TD></TD></TR><TR> <TD COLSPAN=3></TD></TR><TR> <TD COLSPAN=3 HEIGHT=1></TD></TR></TABLE><A NAME='{7C1}'/></FORM></P></TD></TR></TABLE><P><FONT SIZE=0 COLOR=WHITE></CENTER><A NAME="bottom"> </A><!-- netLibrary.com Copyright Notice --> </BODY></HTML>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -