📄 classagreement.rd
字号:
\name{classAgreement}\alias{classAgreement}%- Also NEED an `\alias' for EACH other topic documented here.\title{Coefficients comparing classification agreement}\description{ \code{classAgreement()} computes several coefficents of agreement between the columns and rows of a 2-way contingency table.}\usage{classAgreement(tab, match.names=FALSE)}%- maybe also `usage' for other objects documented here.\arguments{ \item{tab}{A 2-dimensional contingency table.} \item{match.names}{Flag whether row and columns should be matched by name.}}\details{Suppose we want to compare two classifications summarized by thecontingency table \eqn{T=[t_{ij}]} where \eqn{i,j=1,\ldots,K} and \eqn{t_{ij}}denotes the number of data points which are in class \eqn{i} in thefirst partition and in class \eqn{j} in the second partition. If bothclassifications use the same labels, then obviously the twoclassification agree completely if only elements in the main diagonalof the table are non-zero. On the other hand, large off-diagonalelements correspond to smaller agreement between the twoclassifications. If \code{match.names} is \code{TRUE}, the class labelsas given by the row and column names are matched, i.e. only columns androws with the same dimnames are used for the computation.If the two classification do not use the same set of labels, or ifidentical labels can have different meaning (e.g., two outcomes ofcluster analysis on the same data set), then the situation is a littlebit more complicated. Let \eqn{A} denote the number of all pairs of datapoints which are either put into the same cluster by both partitions orput into different clusters by both partitions. Conversely, let \eqn{D}denote the number of all pairs of data points that are put into onecluster in one partition, but into different clusters by the otherpartition. Hence, the partitions disagree for all pairs \eqn{D} andagree for all pairs \eqn{A}. We can measure the agreement by the Randindex \eqn{A/(A+D)} which is invariant with respect to permutations ofthe columns or rows of \eqn{T}.Both indices have to be corrected for agreement by chance if the sizesof the classes are not uniform.}\value{ A list with components \item{diag}{Percentage of data points in the main diagonal of \code{tab}.} \item{kappa}{\code{diag} corrected for agreement by chance.} \item{rand}{Rand index.} \item{crand}{Rand index corrected for agreement by chance.}}\references{J.~Cohen. A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20, 37--46, 1960.Lawrence Hubert and Phipps Arabie. Comparing partitions.Journal of Classification, 2, 193--218, 1985.}\author{Friedrich Leisch}\seealso{\code{\link{matchClasses}}}\examples{## no class correlations: both kappa and crand almost zerog1 <- sample(1:5, size=1000, replace=TRUE)g2 <- sample(1:5, size=1000, replace=TRUE)tab <- table(g1, g2)classAgreement(tab)## let pairs (g1=1,g2=1) and (g1=3,g2=3) agree betterk <- sample(1:1000, size=200)g1[k] <- 1g2[k] <- 1k <- sample(1:1000, size=200)g1[k] <- 3g2[k] <- 3tab <- table(g1, g2)## both kappa and crand should be significantly larger than beforeclassAgreement(tab)}\keyword{category}
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -