📄 fileformats.xml
字号:
<chapter id="fileformats"><title>File Formats</title>All files produced by <application>Select</application> should be in UTF-8 encoding.<sect1><title>Dictionary</title><para>Dictionaries look like this when saved:<screen>9237;264=5:hello;7562=4:luck;4=7:umbrella; ... 92=8:bookcase;</screen>The first number indicates the number of items in the dictionary.</para></sect1><sect1><title>Vector</title><para>The vector (12, 0.34, 0, 0, 34.1200, 6, 6, 6, 6, 0.12) is represented as:<screen>8;0=12,0.34;4=34.12,6*4,0.12;</screen>The first number indicates the number of nonzero items in the vector.</para></sect1><sect1><title>Vectorizer</title><para>A boolean vectorizer:<screen>vectorizer booltokenizer alphanormalizerautobias 1dictionary <replaceable>dictionary</replaceable></screen></para><para>A term frequency vectorizer:<screen>vectorizer tftokenizer alphanormalizerautobias 1dictionary <replaceable>dictionary</replaceable></screen></para><para>A TF-IDF vectorizer:<screen>vectorizer tfidftokenizer alphanormalizerautobias 1dictionary <replaceable>dictionary</replaceable>df <replaceable>vector</replaceable></screen></para></sect1><sect1><title>Classifier</title><para>A naive bayes classifier:<screen>classifier naivebayestype multi_onenoc 4 # Number of classes[global] # Global section: read by load_dbnod 578now 9912[0] # Class section: read by load_classnod 154now 3359tf <replaceable>vector</replaceable>[1]nod 259now 6527tf <replaceable>vector</replaceable>[2]nod 53now 923tf <replaceable>vector</replaceable>[3]nod 112now 2534tf <replaceable>vector</replaceable></screen></para></sect1><sect1><title>Folder database</title><para><screen>folders <replaceable>dictionary</replaceable></screen></para></sect1></chapter>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -