📄 ref_selectd.xml
字号:
<refmeta><refentrytitle>selectd</refentrytitle><manvolnum>8</manvolnum></refmeta><refnamediv><refname>selectd</refname><refpurpose>email classification daemon</refpurpose></refnamediv><refsynopsisdiv><cmdsynopsis><command>selectd</command><arg><option>-a <replaceable>address</replaceable></option></arg><arg><option>-f <replaceable>configfile</replaceable></option></arg></cmdsynopsis></refsynopsisdiv><refsect1><title>Description</title><para>To be written.</para></refsect1><refsect1><title>Command Line Options</title><variablelist><varlistentry><term><option>-a</option> or <option>--address</option></term><listitem><simpara>Selects socket address to use.This overrides the value given in the configuration file.</simpara></listitem></varlistentry><varlistentry><term><option>-f</option> or <option>--conf-file</option></term><listitem><simpara>Selects which configuration file to use.Default value is <filename>$HOME/.select/selectd.conf</filename>.</simpara></listitem></varlistentry></variablelist></refsect1><refsect1><title>Configuration</title><para>Configuration options for <application>selectd</application> is given in a file.The default name of this file is <filename>$HOME/.select/selectd.conf</filename>.</para><refsect2><title>Location options</title><para>Options describing location of directories etc.</para><refsect3><title>address</title><para>Specifies the daemon address.Addresses beginning with the character @ are interpreted as abstract addresses, otherwise they are filename addresses.Default value is @my_select_classifier.<programlisting>address # Default addressaddress qwerty # Use the specified address</programlisting></para></refsect3><refsect3><title>data_dir</title><para>Data directory.Indicates which directory database files are stored in.Default value is to not save databases.<programlisting>data_dir # Default: Don't save databasesdata_dir /home/mikael/.select # Use the specified directory</programlisting></para></refsect3><refsect3><title>share_dir</title><para>Share directory.Indicates which directory language files are stored in.Default value is <filename>/usr/local/share/select</filename>.<programlisting>share_dir # Default: /usr/local/share/selectshare_dir share # Use the specified directory</programlisting></para></refsect3><refsect3><title>plugin_dir</title><para>Plugin directory.Indicates which directory classifiers and stemmers are stored in.Default value is <filename>/usr/local/lib/select</filename>.<programlisting>plugin_dir # Default: /usr/local/lib/selectplugin_dir plugins # Use the specified directory</programlisting></para></refsect3></refsect2><refsect2><title>Language options</title><para>Options describing language handling.</para><refsect3><title>language</title><para>Specifies a language.May be present several times. The first presence sets the default language.Default value is no languages.</para><para>A language is specified by its name followed by a default locale string.The name must be the same as is used by its stopwords and stemmer.The stopwords and stemmers distibuted with <application>Select</application> use the language codes specified in ISO 639 as names.If you use a custom stopword list (for example for several languages) a new name should be invented for it.<programlisting>language # No languageslanguage en C # Englishlanguage sv sv_SE # Swedishlanguage sven sv_SE # Swenglish</programlisting></para></refsect3><refsect3><title>identifier</title><para>Language identification.Default value is off.<programlisting>identifier on # Use language identificationidentifier off # Don't use language identification</programlisting></para></refsect3><refsect3><title>stemming</title><para>Stemming.Default value is off.<programlisting>stemming on # Perform stemming for chosen languagesstemming off # Don't perform stemming</programlisting></para></refsect3><refsect3><title>stopwords</title><para>Stopwords.Default value is off.<programlisting>stopwords on # Use stopwords for chosen languagesstopwords off # Don't use any stopwords</programlisting></para></refsect3></refsect2><refsect2><title>Classifier Options</title><para>Classifiers are configured in special classifier sections in the configuration file.A classifier selection is started by a line like this:<screen>[classifier]</screen>There may be several classifiers specified.</para><refsect3><title>name</title><para>Classifier name.No default value.<programlisting>name Henry # Call the classifier Henry</programlisting></para></refsect3><refsect3><title>classifier</title><para>Classifier algorithm.No default value.<programlisting># Multi classifiersclassifier Alma # Alma classifierclassifier NaiveBayes # Naive bayes classifierclassifier Perceptron # Perceptron classifierclassifier Rocchio # Rocchio classifierclassifier Romma # Romma classifierclassifier WidrowHoff # Widrow-Hoff (LMS) classifierclassifier WMA # Weighted Majority Algorithm# Document classifiersclassifier From # Sender classifierclassifier Reply # Thread classifier</programlisting></para></refsect3><refsect3><title>type</title><para>Classifier type.No default value.<programlisting>type multi_one # Multi classifier, type ONE_MAXtype multi_rest # Multi classifier, type REST_MAXtype multi_linmax # Multi classifier, type LIN_MAXtype multi_uc # Multi classifier, type UC_MAXtype document # Document classifier</programlisting></para></refsect3><refsect3><title>options</title><para>Classifier options.A string which is passed on to the classifier for interpretation.Is used for changing the behavior of the classifier.The string consists of a comma-separated list of entries of the form key=value.Valid keys and values depends on the classifier.Default value is none.<programlisting>options # No optionsoptions n=200,o=fifo # Classifier dependent options</programlisting></para></refsect3><refsect3><title>tokenizer</title><para>Tokenizer.Specifies which method is used for transforming texts into text-tokens.Default value is alpha.<programlisting>tokenizer alpha # Alpha tokenizertokenizer wspace.byte # Whitespace byte tokenizertokenizer ngram.byte # N-gram byte tokenizertokenizer null # Null tokenizer</programlisting></para></refsect3><refsect3><title>vectorizer</title><para>Vectorization mode.Specifies which method is used for transforming text-tokens into a vector.Default value is tf.<programlisting>vectorizer bool # Booleanvectorizer tf # Term Frequencyvectorizer tfidf # TF-IDF</programlisting></para></refsect3><refsect3><title>normalizer</title><para>Normalization mode.Specifies how vectors should be normalized.Default value is none.<programlisting>normalizer # No normalizationnormalizer cosine # Cosine normalization</programlisting></para></refsect3></refsect2><refsect2><title>Example</title><para>Here is a complete example of a <application>selectd</application> configuration file:</para><informalexample><programlisting>address # Use default addressdata_dir /home/mikael/.select # Save databases in specified directoryshare_dir # Default: /usr/local/share/selectplugin_dir # Default: /usr/local/lib/selectlanguage sven sv_SE # Swenglishidentifier off # Don't use language identificationstemming off # Don't perform stemmingstopwords on # Use stopwords for chosen languages[classifier]name nb # Name the classifier nbclassifier NaiveBayes # NaiveBayes classifiertype multi_one # Multi classifier, type ONE_MAXoptions # No optionstokenizer alpha # Alpha tokenizervectorizer tfidf # TF-IDF vectorizernormalizer # No normalization</programlisting></informalexample></refsect2></refsect1>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -