⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 readme.texi

📁 机器学习作者tom mitchell的书上代码
💻 TEXI
字号:
@chapter Bag Of Words Library README@c set the vars BOW_VERSION@include version.texi@samp{libbow}, version @value{BOWVERSION}.@include libbow-desc.texi@section Rainbow@samp{Rainbow} is a standalone program that does documentclassification.  Here are some examples:@itemize @bullet@item@examplerainbow -i ./training/positive ./training/negative@end exampleUsing the text files found under the directories@file{./positive} and @file{./negative},tokenize, build word vectors, and write the resulting data structuresto disk.@item@examplerainbow --query=./testing/254@end exampleTokenize the text document @file{./testing/254}, and classify it,producing output like:@example/home/mccallum/training/positive 0.72/home/mccallum/training/negative 0.28@end example@item@examplerainbow --test-set=0.5 -t 5@end examplePerform 5 trials, each consisting of a new random test/train split andoutputs of the classification of the test documents.@end itemizeTyping @samp{rainbow --help} will give list of all rainbow options.After you have compiled @samp{libbow} and @samp{rainbow}, you can runthe shell script @file{./demo/script} to see an annotated demonstrationof the classifier in action.More information and documentation is available athttp://www.cs.cmu.edu/~mccallum/bow@formatRainbow improvements coming eventually:   Better documentation.   Incremental model training.@end format@section Arrow@samp{Arrow} is a standalone program that does document retrieval byTFIDF.  Index all the documents in directory @samp{foo} by typing@examplearrow --index foo@end exampleMake a single query by typing@examplearrow --query@end examplethen typing your query, and pressing Control-D.If you want to make many queries, it will be more efficient to run arrowas a server, and query it multiple times without restarts bycommunicating through a socket.  Type, for example,@examplearrow --query-server=9876@end exampleAnd access it through port number 9876.  For example:@exampletelnet localhost 9876@end exampleIn this mode there is no need to press Control-D to end a query.  Simplytype your query on one line, and press return.@section Crossbow@samp{Crossbow} is a standalone program that does document clustering.Sorry, there is no documentation yet.@section Archer@samp{Archer} is a standalone program that does document retrieval withAltaVista-type queries, using +, -, "", etc.  The commands in the"arrow" examples above also work for archer.  See "archer --help" formore information.

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -