📄 readme.texi
字号:
@chapter Bag Of Words Library README@c set the vars BOW_VERSION@include version.texi@samp{libbow}, version @value{BOWVERSION}.@include libbow-desc.texi@section Rainbow@samp{Rainbow} is a standalone program that does documentclassification. Here are some examples:@itemize @bullet@item@examplerainbow -i ./training/positive ./training/negative@end exampleUsing the text files found under the directories@file{./positive} and @file{./negative},tokenize, build word vectors, and write the resulting data structuresto disk.@item@examplerainbow --query=./testing/254@end exampleTokenize the text document @file{./testing/254}, and classify it,producing output like:@example/home/mccallum/training/positive 0.72/home/mccallum/training/negative 0.28@end example@item@examplerainbow --test-set=0.5 -t 5@end examplePerform 5 trials, each consisting of a new random test/train split andoutputs of the classification of the test documents.@end itemizeTyping @samp{rainbow --help} will give list of all rainbow options.After you have compiled @samp{libbow} and @samp{rainbow}, you can runthe shell script @file{./demo/script} to see an annotated demonstrationof the classifier in action.More information and documentation is available athttp://www.cs.cmu.edu/~mccallum/bow@formatRainbow improvements coming eventually: Better documentation. Incremental model training.@end format@section Arrow@samp{Arrow} is a standalone program that does document retrieval byTFIDF. Index all the documents in directory @samp{foo} by typing@examplearrow --index foo@end exampleMake a single query by typing@examplearrow --query@end examplethen typing your query, and pressing Control-D.If you want to make many queries, it will be more efficient to run arrowas a server, and query it multiple times without restarts bycommunicating through a socket. Type, for example,@examplearrow --query-server=9876@end exampleAnd access it through port number 9876. For example:@exampletelnet localhost 9876@end exampleIn this mode there is no need to press Control-D to end a query. Simplytype your query on one line, and press return.@section Crossbow@samp{Crossbow} is a standalone program that does document clustering.Sorry, there is no documentation yet.@section Archer@samp{Archer} is a standalone program that does document retrieval withAltaVista-type queries, using +, -, "", etc. The commands in the"arrow" examples above also work for archer. See "archer --help" formore information.
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -