📄 ripperaux.1
字号:
.EN.TH RIPPER AUXILIARY PROGRAMS 1.SH SYNOPSIS.PPdata2text \- convert feature-vector data to a single text-valuedfield..PPeliminate \- eliminate obviously redundant conditions from a ruleset.PPfilter-text \- perform feature selection on text, optionally convertingit to an attribute-value format..PPpair \- do a paired test.PP partition \- randomly subsample a dataset.PPpprint-rules \- print a ruleset.PPpredict \- makes predictions using a ripper-generated ruleset .PProcchio \- strawman text categorizer.PPselect-class \- remove all but two classes from a dataset.PPsummarize \- summarize output of predict..PPtest-rules \- test individual ripper rules on data.PPverify \- check a dataset for consistency.SH DESCRIPTIONMiscellaneous programs to assist in evaluating the rulesetsproduced by RIPPER. All of these programs will summarizetheir legal options if invoked with a -h option. Most of them also will read from standard input if invokedwith a -s option. .SH PREDICT.B predict [options] filestem.PPUses the ruleset filestem.hyp to predict classes of the examplesin the file filestem.test. (If the -s option is used thenexamples are taken from standard input instead of filestem.test).Output is a series of lines of the form.PPpredicted-class p n actual-class .PPwhere p and n are the numbers of positive and negative examplesclassified by the rule used to pick the predicted-class..SH SUMMARIZE.B summarize [options] filestem.PPSummarize the predictions appearing on the standard input.Reports the error rate, recall, precision, and a confusion matrix..SH VERIFY.B verify [options] filestem.PP Do a consistency check on the dataset filestem.data.SH ELIMINATE.B eliminate [options] filestem.PP Eliminate redundant conditions from the ruleset filestem.hyp.SH TEST-RULES.B test-rules [options] filestem.PP Gives a rule-by-rule breakdown of error rates of the rules in filestem.hyp on the examples in filestem.test.SH PAIR.B pair predfile-1 predfile-2.PP Does a paired test of the classifiers used to generate thepredictions in the files predfile-1 predfile-2. Predfile-1and predfile-2 should be the outputs of predict. Outputsthe following information: number of "wins" (times the classifierused for predfile-1 is right and the classifier of predfile-2is wrong); number of "losses"; number of "ties" (times bothclassifiers agree); estimate of the probability of a wingiven a disagreement, and the standard error of the estimate;and how far the probability of winning is from 0.5 in standard units (standard errors). .PPStandard errors are computed using a normal approximationto the binomial distribution.Generally, much more than 2 standardunits means that the difference is statistically significant. .SH PARTITION.B partition [-p q | -m N] [stem].PP Chooses a random subset of stem.data. Size of subsetis specified by either a percentage to retain (-p) or by a subset size (-m)..SH SELECT-CLASS.B select-class -c C [stem].PP For examples in stem.data, changes class of examples of anything other than class C to 'default_class', and writes the modified dataset to the standard output. .SH PPRINT-RULES.B pprint-rules [options] stem .PP Print a RIPPER ruleset to standard output in a human-readableformat..SH EXAMPLES.PP Here are some common ways of evaluating RIPPER. .PP #generate a ruleset .PP ripper foo .PP #eliminate redundant conditions.PP eliminate foo .PP #test ruleset on foo.test and summarize.PP predict foo | summarize foo.PP#save predictions in a file.PP predict foo > foo-default.pred.PP#re-run experiment with different RIPPER options .PP ripper -L2 foo.PP predict foo > foo-L2.pred.PP #do a paired test.PP pair foo-default.pred foo-L2.pred.SH BUGSProbably.
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -