⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 spam.rd

📁 这是核学习的一个基础软件包
💻 RD
字号:
\name{spam}\alias{spam}\title{Spam E-mail Database}\description{A data set collected at Hewlett-Packard Labs, that classifies 4601e-mails as spam or non-spam. In addition to this class label there are 57variables indicating the frequency of certain words and characters in thee-mail.}\usage{data(spam)}\format{A data frame with 4601 observations and 58 variables.The first 48 variables contain the frequency of the variable name(e.g., business) in the e-mail. If the variable name starts with num (e.g.,num650) the it indicates the frequency of the corresponding number (e.g., 650).The variables 49-54 indicate the frequency of the characters `;', `(', `[', `!',`\$', and `\#'. The variables 55-57 contain the average, longest and total run-length of captial letters. Variable 58 indicates the type of themail and is either \code{"nonspam"} or \code{"spam"}, i.e. unsolicitedcommercial e-mail.}\details{The data set contains 2788 e-mails classified as \code{"nonspam"} and 1813classified as \code{"spam"}.The ``spam'' concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography...This collection of spam e-mails came from the collectors' postmaster andindividuals who had filed spam.  The collection of non-spame-mails came from filed work and personal e-mails, and hencethe word 'george' and the area code '650' are indicators ofnon-spam.  These are useful when constructing a personalizedspam filter.  One would either have to blind such non-spamindicators or get a very wide collection of non-spam togenerate a general purpose spam filter.}\source{\itemize{\item Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt atHewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304\item Donor: George Forman (gforman at nospam hpl.hp.com)  650-857-7835}These data have been taken from the UCI Repository Of Machine LearningDatabases at \url{http://www.ics.uci.edu/~mlearn/MLRepository.html}}\references{T. Hastie, R. Tibshirani, J.H. Friedman. \emph{The Elements of StatisticalLearning.} Springer, 2001.}\keyword{datasets}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -