📄 svm_light.htm

📁 本程序用于分类两类数据
💻 HTM
📖 第 1 页 / 共 3 页
字号:
Transduction options (see [<A href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 1999c</A>], [<A href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 2002a</A>]):
         -p [0..1]   - fraction of unlabeled examples to be classified
                       into the positive class (default is the ratio of
                       positive and negative examples in the training data)
Kernel options:
         -t int      - type of kernel function:
                        0: linear (default)
                        1: polynomial (s a*b+c)^d
                        2: radial basis function exp(-gamma ||a-b||^2)
                        3: sigmoid tanh(s a*b + c)
                        4: user defined kernel from kernel.h
         -d int      - parameter d in polynomial kernel
         -g float    - parameter gamma in rbf kernel
         -s float    - parameter s in sigmoid/poly kernel
         -r float    - parameter c in sigmoid/poly kernel
         -u string   - parameter of user defined kernel
Optimization options (see [<A href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 1999a</A>], [<A href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 2002a</A>]):
         -q [2..]    - maximum size of QP-subproblems (default 10)
         -n [2..q]   - number of new variables entering the working set
                       in each iteration (default n = q). Set n&lt;q to prevent
                       zig-zagging.
         -m [5..]    - size of cache for kernel evaluations in MB (default 40)
                       The larger the faster...
         -e float    - eps: Allow that error for termination criterion
                       [y [w*x+b] - 1] = eps (default 0.001) 
         -h [5..]    - number of iterations a variable needs to be
                       optimal before considered for shrinking (default 100) 
         -f [0,1]    - do final optimality check for variables removed by
                       shrinking. Although this test is usually positive, there
                       is no guarantee that the optimum was found if the test is
                       omitted. (default 1) 
         -y string   -&gt; if option is given, reads alphas from file with given
                        and uses them as starting point. (default 'disabled')
         -# int      -&gt; terminate optimization, if no progress after this
                        number of iterations. (default 100000)
Output options: 
         -l char     - file to write predicted labels of unlabeled examples 
                       into after transductive learning 
         -a char     - write all alphas to this file after learning (in the 
                       same order as in the training set)</PRE></DIR>
<P>A more detailed description of the parameters and how they link to the 
respective algorithms is given in the appendix of [<A 
href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 
2002a</A>]. </P>
<P>The input file <TT>example_file</TT> contains the training examples. The 
first lines may contain comments and are ignored if they start with #. Each of 
the following lines represents one training example and is of the following 
format: </P>
<DIR><TT>&lt;line&gt; .=. &lt;target&gt; &lt;feature&gt;:&lt;value&gt; 
&lt;feature&gt;:&lt;value&gt; ... &lt;feature&gt;:&lt;value&gt; # 
&lt;info&gt;</TT><BR><TT>&lt;target&gt; .=. +1 | -1 | 0 | 
&lt;float&gt;</TT>&nbsp;</TT><BR><TT>&lt;feature&gt; .=. &lt;integer&gt; | 
"qid"</TT><BR><TT>&lt;value&gt; .=. &lt;float&gt;</TT><BR><TT>&lt;info&gt; .=. 
&lt;string&gt;</TT> </DIR>
<P>The target value and each of the feature/value pairs are separated by a space 
character. Feature/value pairs MUST be ordered by increasing feature number. 
Features with value zero can be skipped. The string <TT>&lt;info&gt;</TT> can be 
used to pass additional information to the kernel (e.g. non feature vector 
data). Check the <A 
href="http://www.cs.cornell.edu/People/tj/svm_light/svm_light_faq.html">FAQ</A> 
for more details on how to implement your own kernel.</P>
<P>In classification mode, the target value denotes the class of the example. +1 
as the target value marks a positive example, -1 a negative example 
respectively. So, for example, the line </P>
<BLOCKQUOTE>
  <P><TT>-1 1:0.43 3:0.12 9284:0.2 # abcdef</TT> </P></BLOCKQUOTE>
<P>specifies a negative example for which feature number 1 has the value 0.43, 
feature number 3 has the value 0.12, feature number 9284 has the value 0.2, and 
all the other features have value 0. In addition, the string <TT>abcdef</TT> is 
stored with the vector, which can serve as a way of providing additional 
information for user defined kernels. A class label of 0 indicates that this 
example should be classified using transduction. The predictions for the 
examples classified by transduction are written to the file specified through 
the -l option. The order of the predictions is the same as in the training data. 
</P>
<P>In regression mode, the &lt;target&gt; contains the real-valued target 
value.</P>
<P>In ranking mode [<A 
href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 
2002c</A>], the target value is used to generated pairwise preference 
constraints (see <A href="http://striver.joachims.org/">STRIVER</A>). A 
preference constraint is included for all pairs of examples in the 
<TT>example_file</TT>, for which the target value differs. The special feature 
"qid" can be used to restrict the generation of constraints. Two examples are 
considered for a pairwise preference constraint only, if the value of "qid" is 
the same. For example, given the <TT>example_file</TT></P>
<BLOCKQUOTE dir=ltr style="MARGIN-RIGHT: 0px">
  <P><TT>3&nbsp;qid:1 1:0.53 2:0.12<BR>2&nbsp;qid:1 1:0.13 2:0.1<BR>7 qid:2 
  1:0.87 2:0.12 </TT></P></BLOCKQUOTE>
<P>a preference constraint is included only for the first and the second 
example(ie. the first should be ranked higher than the second), but not with the 
third example, since it has a different "qid".</P>
<P>In all modes, the result of <TT>svm_learn</TT> is the model which is learned 
from the training data in <TT>example_file</TT>. The model is written to 
<TT>model_file</TT>. To make predictions on test examples, <TT>svm_classify</TT> 
reads this file. <TT>svm_classify</TT> is called with the following parameters: 
</P>
<DIR><TT>
<P>svm_classify [options] example_file model_file output_file</P></TT></DIR>
<P>Available options are: </P>
<BLOCKQUOTE><PRE>-h         Help. 
-v [0..3]  Verbosity level (default 2).
-f [0,1]   0: old output format of V1.0
           1: output the value of decision function (default)</PRE></BLOCKQUOTE>
<P>The test examples in <TT>example_file</TT> are given in the same format as 
the training examples (possibly with 0 as class label). For all test examples in 
<TT>example_file</TT> the predicted values are written to <TT>output_file</TT>. 
There is one line per test example in <TT>output_file</TT> containing the value 
of the decision function on that example. For classification, the sign of this 
value determines the predicted class. For regression, it is the predicted value 
itself, and for ranking the value can be used to order the test examples. The 
test example file has the same format as the one for <TT>svm_learn</TT>. Again, 
<TT>&lt;class&gt;</TT> can have the value zero indicating unknown. </P>
<P>If you want to find out more, try this <A 
href="http://www.cs.cornell.edu/People/tj/svm_light/svm_light_faq.html">FAQ</A>. 
</P>
<H2>Getting started: some Example Problems</H2>
<H3>Inductive SVM</H3>
<P>You will find an example text classification problem at </P>
<DIR>
<P><A href="http://download.joachims.org/svm_light/examples/example1.tar.gz" 
target=_top>http://download.joachims.org/svm_light/examples/example1.tar.gz</A></P></DIR>
<P>Download this file into your svm_light directory and unpack it with </P>
<DIR><TT>
<P>gunzip -c example1.tar.gz | tar xvf -</P></TT></DIR>
<P>This will create a subdirectory <TT>example1</TT>. Documents are represented 
as feature vectors. Each feature corresponds to a word stem (9947 features). The 
task is to learn which <A 
href="http://www.daviddlewis.com/resources/testcollections/reuters21578/" 
target=_top>Reuters articles</A> are about "corporate acquisitions". There are 
1000 positive and 1000 negative examples in the file <TT>train.dat</TT>. The 
file <TT>test.dat</TT> contains 600 test examples. The feature numbers 
correspond to the line numbers in the file <TT>words</TT>. To run the example, 
execute the commands: </P>
<DIR><TT>
<P>svm_learn example1/train.dat example1/model<BR></TT><TT>svm_classify 
example1/test.dat example1/model example1/predictions</P></TT></DIR>
<P>The accuracy on the test set is printed to stdout. </P>
<H3>Transductive SVM</H3>
<P>To try out the transductive learner, you can use the following dataset (see 
also <A href="http://sgt.joachims.org/">Spectral Graph Transducer</A>). I 
compiled it from the same Reuters articles as used in the example for the 
inductive SVM. The dataset consists of only 10 training examples (5 positive and 
5 negative) and the same 600 test examples as above. You find it at </P>
<DIR>
<P><A href="http://download.joachims.org/svm_light/examples/example2.tar.gz" 
target=_top>http://download.joachims.org/svm_light/examples/example2.tar.gz</A></P></DIR>
<P>Download this file into your svm_light directory and unpack it with </P>
<DIR><TT>
<P>gunzip -c example2.tar.gz | tar xvf -</P></TT></DIR>
<P>This will create a subdirectory <TT>example2</TT>. To run the example, 
execute the commands: </P>
<DIR>
<P><TT>svm_learn example2/train_transduction.dat example2/model</TT> 
<BR><TT>svm_classify example2/test.dat example2/model 
example2/predictions</TT></P></DIR>
<P>The classification module is called only to get the accuracy printed. The 
transductive learner is invoced automatically, since <TT>train_transduction.dat 
</TT>contains unlabeled examples (i. e. the 600 test examples). You can compare 
the results to those of the inductive SVM by running: </P>
<BLOCKQUOTE><TT>svm_learn example2/train_induction.dat example2/model</TT> 
  <BR><TT>svm_classify example2/test.dat example2/model 
  example2/predictions</TT> </BLOCKQUOTE>
<P>The file <TT>train_induction.dat</TT> contains the same 10 (labeled) training 
examples as <TT>train_transduction.dat</TT>. </P>
<H3>Ranking SVM</H3>
<P>For the ranking SVM [<A 
href="http://www.cs.cornell.edu/People/tj/svm_light/#References">Joachims, 
2002c</A>], I created a toy example. It consists of only 12 training examples in 
3 groups and 4 test examples. You find it at </P>
<DIR>
<P><A href="http://download.joachims.org/svm_light/examples/example3.tar.gz" 
target=_top>http://download.joachims.org/svm_light/examples/example3.tar.gz</A></P></DIR>
<P>Download this file into your svm_light directory and unpack it with </P>
<DIR><TT>
<P>gunzip -c example3.tar.gz | tar xvf -</P></TT></DIR>
<P>This will create a subdirectory <TT>example3</TT>. To run the example, 
execute the commands: </P>
<DIR>
<P><TT>svm_learn -z p example3/train.dat example3/model</TT> 
<BR><TT>svm_classify example3/test.dat example3/model 
example3/predictions</TT></P></DIR>
<P>The output in the predictions file can be used to rank the test examples. If 
you do so, you will see that it predicts the correct ranking. The values in the 
predictions file do not have a meaning in an absolute sense. They are only used 
for ordering. </P>
<P>It can also be interesting to&nbsp;look at the "training error" of the 
ranking SVM. The equivalent of training error for a ranking SVM is the number of 
training pairs that are misordered by the learned model. To find those pairs, 
one can apply the model to the training file: </P>
<BLOCKQUOTE dir=ltr style="MARGIN-RIGHT: 0px">
  <P><TT>svm_classify example3/train.dat example3/model 
  example3/predictions.train </TT></P></BLOCKQUOTE>
<P>Again, the predictions file shows the ordering implied by the model. The 
model ranks all training examples correctly. </P>
<P>Note that ranks are comparable only between examples with the same qid. Note 
also that the target value (first value in each line of the data files) is only 
used to define the order of the examples. Its absolute value does not matter, as 
long as the ordering relative to the other examples with the same qid remains 
the same.</P>
<H2>Extensions and Additions</H2>
<UL>
  <LI><A href="http://ai-nlp.info.uniroma2.it/moschitti/" target=_top>Tree 
  Kernels</A>: kernel for classifying trees with SVM<I><SUP>light</I></SUP> 
  written by <A href="http://ai-nlp.info.uniroma2.it/moschitti/" 
  target=_top>Alessandro Moschitti</A> 
  <LI><A 
  href="http://search.cpan.org/~kwilliams/Algorithm-SVMLight/lib/Algorithm/SVMLight.pm" 
  target=_top>PERL Interface</A>: a PERL interface to SVM<I><SUP>light</I></SUP> 
  written by <A href="mailto:kwilliams@cpan.org" target=_top>Ken Williams</A> 
  <LI><A href="http://www.cis.tugraz.at/igi/aschwaig/software.html" 
  target=_top>Matlab Interface</A>: a MATLAB interface to 
  SVM<I><SUP>light</I></SUP> written by <A 
  href="http://www.cis.tugraz.at/igi/aschwaig/index.html" target=_top>Anton 
  Schwaighofer</A> (for <A 
  href="http://svmlight.joachims.org/old/svm_light_v4.00.html">SVM<I><SUP>light</SUP> 
  </I>V4.00</A>) 
  <LI><A href="http://www.ship.edu/~thb/mexsvm/" target=_top>Matlab 
  Interface</A>: a MATLAB MEX-interface to SVM<I><SUP>light</I></SUP> written by 
  <A href="http://www.ship.edu/~thb/" target=_top>Tom Briggs</A> 
  <LI><A 
  href="http://www-cad.eecs.berkeley.edu/~hwawen/research/projects/jsvm/doc/manual/index.html" 
  target=_top>jSVM</A>: a JAVA interface to SVM<I><SUP>light</I></SUP> written 
  by <A href="http://www-cad.eecs.berkeley.edu/~hwawen/" target=_top>Heloise 
  Hwawen Hse</A> (for <A 
  href="http://www-ai.cs.uni-dortmund.de/SOFTWARE/SVM_LIGHT/svm_light_v2.01.eng.html">SVM<I><SUP>light</I></SUP> 
  V2.01</A>) 
  <LI>A <A 
  href="http://sourceforge.net/project/showfiles.php?group_id=16036">special 
  version of SVM<I><SUP>light</SUP> </I></A>is integrated into the virtual file 
  system <A href="http://witme.sourceforge.net/libferris.web/">libferris</A> by 
  Ben Martin 
  <LI><A 
  href="http://www.soarcorp.com/svm_light_data_helper.jsp">LightDataAgent</A>: 
  tool to translate comma/tab-delimited data into SVM<SUP><I>light</I></SUP> 
  format, written by Ophir Gottlieb 
  <LI><A href="http://www.aifb.uni-karlsruhe.de/WBS/sbl/software/jnikernel/">JNI 
  Kernel</A>: interface for SVM<SUP><I>light</I></SUP> to access kernel 
  functions implemented in Java, written by Stephan Bloehdorn (for <A 
  href="http://www-ai.cs.uni-dortmund.de/SOFTWARE/SVM_LIGHT/svm_light_v6.01.eng.html">SVM<SUP><I>light</I></SUP> 
  V6.01</A>) 
  <LI><A
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -