📄 ref_selectd.xml

📁 使用具有增量学习的监控式学习方法。包括几个不同的分类算法。
💻 XML
字号:
<refmeta><refentrytitle>selectd</refentrytitle><manvolnum>8</manvolnum></refmeta><refnamediv><refname>selectd</refname><refpurpose>email classification daemon</refpurpose></refnamediv><refsynopsisdiv><cmdsynopsis><command>selectd</command><arg><option>-a <replaceable>address</replaceable></option></arg><arg><option>-f <replaceable>configfile</replaceable></option></arg></cmdsynopsis></refsynopsisdiv><refsect1><title>Description</title><para>To be written.</para></refsect1><refsect1><title>Command Line Options</title><variablelist><varlistentry><term><option>-a</option> or <option>--address</option></term><listitem><simpara>Selects socket address to use.This overrides the value given in the configuration file.</simpara></listitem></varlistentry><varlistentry><term><option>-f</option> or <option>--conf-file</option></term><listitem><simpara>Selects which configuration file to use.Default value is <filename>$HOME/.select/selectd.conf</filename>.</simpara></listitem></varlistentry></variablelist></refsect1><refsect1><title>Configuration</title><para>Configuration options for <application>selectd</application> is given in a file.The default name of this file is <filename>$HOME/.select/selectd.conf</filename>.</para><refsect2><title>Location options</title><para>Options describing location of directories etc.</para><refsect3><title>address</title><para>Specifies the daemon address.Addresses beginning with the character @ are interpreted as abstract addresses, otherwise they are filename addresses.Default value is @my_select_classifier.<programlisting>address                                 # Default addressaddress qwerty                          # Use the specified address</programlisting></para></refsect3><refsect3><title>data_dir</title><para>Data directory.Indicates which directory database files are stored in.Default value is to not save databases.<programlisting>data_dir                                # Default: Don't save databasesdata_dir        /home/mikael/.select    # Use the specified directory</programlisting></para></refsect3><refsect3><title>share_dir</title><para>Share directory.Indicates which directory language files are stored in.Default value is <filename>/usr/local/share/select</filename>.<programlisting>share_dir                               # Default: /usr/local/share/selectshare_dir       share                   # Use the specified directory</programlisting></para></refsect3><refsect3><title>plugin_dir</title><para>Plugin directory.Indicates which directory classifiers and stemmers are stored in.Default value is <filename>/usr/local/lib/select</filename>.<programlisting>plugin_dir                              # Default: /usr/local/lib/selectplugin_dir      plugins                 # Use the specified directory</programlisting></para></refsect3></refsect2><refsect2><title>Language options</title><para>Options describing language handling.</para><refsect3><title>language</title><para>Specifies a language.May be present several times. The first presence sets the default language.Default value is no languages.</para><para>A language is specified by its name followed by a default locale string.The name must be the same as is used by its stopwords and stemmer.The stopwords and stemmers distibuted with <application>Select</application> use the language codes specified in ISO 639 as names.If you use a custom stopword list (for example for several languages) a new name should be invented for it.<programlisting>language                                # No languageslanguage        en C                    # Englishlanguage        sv sv_SE                # Swedishlanguage        sven sv_SE              # Swenglish</programlisting></para></refsect3><refsect3><title>identifier</title><para>Language identification.Default value is off.<programlisting>identifier      on                      # Use language identificationidentifier      off                     # Don't use language identification</programlisting></para></refsect3><refsect3><title>stemming</title><para>Stemming.Default value is off.<programlisting>stemming        on                      # Perform stemming for chosen languagesstemming        off                     # Don't perform stemming</programlisting></para></refsect3><refsect3><title>stopwords</title><para>Stopwords.Default value is off.<programlisting>stopwords       on                      # Use stopwords for chosen languagesstopwords       off                     # Don't use any stopwords</programlisting></para></refsect3></refsect2><refsect2><title>Classifier Options</title><para>Classifiers are configured in special classifier sections in the configuration file.A classifier selection is started by a line like this:<screen>[classifier]</screen>There may be several classifiers specified.</para><refsect3><title>name</title><para>Classifier name.No default value.<programlisting>name            Henry                   # Call the classifier Henry</programlisting></para></refsect3><refsect3><title>classifier</title><para>Classifier algorithm.No default value.<programlisting># Multi classifiersclassifier      Alma                    # Alma classifierclassifier      NaiveBayes              # Naive bayes classifierclassifier      Perceptron              # Perceptron classifierclassifier      Rocchio                 # Rocchio classifierclassifier      Romma                   # Romma classifierclassifier      WidrowHoff              # Widrow-Hoff (LMS) classifierclassifier      WMA                     # Weighted Majority Algorithm# Document classifiersclassifier      From                    # Sender classifierclassifier      Reply                   # Thread classifier</programlisting></para></refsect3><refsect3><title>type</title><para>Classifier type.No default value.<programlisting>type            multi_one               # Multi classifier, type ONE_MAXtype            multi_rest              # Multi classifier, type REST_MAXtype            multi_linmax            # Multi classifier, type LIN_MAXtype            multi_uc                # Multi classifier, type UC_MAXtype            document                # Document classifier</programlisting></para></refsect3><refsect3><title>options</title><para>Classifier options.A string which is passed on to the classifier for interpretation.Is used for changing the behavior of the classifier.The string consists of a comma-separated list of entries of the form key=value.Valid keys and values depends on the classifier.Default value is none.<programlisting>options                                 # No optionsoptions         n=200,o=fifo            # Classifier dependent options</programlisting></para></refsect3><refsect3><title>tokenizer</title><para>Tokenizer.Specifies which method is used for transforming texts into text-tokens.Default value is alpha.<programlisting>tokenizer       alpha                   # Alpha tokenizertokenizer       wspace.byte             # Whitespace byte tokenizertokenizer       ngram.byte              # N-gram byte tokenizertokenizer       null                    # Null tokenizer</programlisting></para></refsect3><refsect3><title>vectorizer</title><para>Vectorization mode.Specifies which method is used for transforming text-tokens into a vector.Default value is tf.<programlisting>vectorizer      bool                    # Booleanvectorizer      tf                      # Term Frequencyvectorizer      tfidf                   # TF-IDF</programlisting></para></refsect3><refsect3><title>normalizer</title><para>Normalization mode.Specifies how vectors should be normalized.Default value is none.<programlisting>normalizer                              # No normalizationnormalizer      cosine                  # Cosine normalization</programlisting></para></refsect3></refsect2><refsect2><title>Example</title><para>Here is a complete example of a <application>selectd</application> configuration file:</para><informalexample><programlisting>address                                 # Use default addressdata_dir        /home/mikael/.select    # Save databases in specified directoryshare_dir                               # Default: /usr/local/share/selectplugin_dir                              # Default: /usr/local/lib/selectlanguage        sven sv_SE              # Swenglishidentifier      off                     # Don't use language identificationstemming        off                     # Don't perform stemmingstopwords       on                      # Use stopwords for chosen languages[classifier]name            nb                      # Name the classifier nbclassifier      NaiveBayes              # NaiveBayes classifiertype            multi_one               # Multi classifier, type ONE_MAXoptions                                 # No optionstokenizer       alpha                   # Alpha tokenizervectorizer      tfidf                   # TF-IDF vectorizernormalizer                              # No normalization</programlisting></informalexample></refsect2></refsect1>
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -