meme.ino_up800.tcm

来自「EM算法的改进」· TCM 代码 · 共 377 行 · 第 1/2 页

TCM
377
字号
********************************************************************************MEME - Motif discovery tool********************************************************************************MEME version 3.5.3 (Release date: 2006-03-08 09:30:25 +1000 (Wed, 08 Mar 2006))For further information on how to interpret these results or to geta copy of the MEME software please access http://meme.nbcr.net.This file may be used as input to the MAST algorithm for searchingsequence databases for matches to groups of motifs.  MAST is availablefor interactive use and downloading at http://meme.nbcr.net.****************************************************************************************************************************************************************REFERENCE********************************************************************************If you use this program in your research, please cite:Timothy L. Bailey and Charles Elkan,"Fitting a mixture model by expectation maximization to discovermotifs in biopolymers", Proceedings of the Second InternationalConference on Intelligent Systems for Molecular Biology, pp. 28-36,AAAI Press, Menlo Park, California, 1994.****************************************************************************************************************************************************************TRAINING SET********************************************************************************DATAFILE= /home/tbailey/tests/INO_up800.sALPHABET= ACGTSequence name            Weight Length  Sequence name            Weight Length  -------------            ------ ------  -------------            ------ ------  CHO1                     1.0000    800  CHO2                     1.0000    800  FAS1                     1.0000    800  FAS2                     1.0000    800  ACC1                     1.0000    800  INO1                     1.0000    800  OPI3                     1.0000    800  ****************************************************************************************************************************************************************COMMAND LINE SUMMARY********************************************************************************This information can also be useful in the event you wish to report aproblem with the MEME software.command: meme /home/tbailey/tests/INO_up800.s -mod tcm -dna -revcomp -bfile /home/tbailey/tests/yeast.nc.6.freq -nostatus -nmotifs 2 model:  mod=           tcm    nmotifs=         2    evt=           infobject function=  E-value of product of p-valueswidth:  minw=            8    maxw=           50    minic=        0.00width:  wg=             11    ws=              1    endgaps=       yesnsites: minsites=        2    maxsites=       35    wnsites=       0.8theta:  prob=            1    spmap=         uni    spfuzz=        0.5em:     prior=   dirichlet    b=            0.01    maxiter=        50        distance=    1e-05data:   n=            5600    N=               7strands: + -sample: seed=            0    seqfrac=         1Letter frequencies in dataset:A 0.304 C 0.196 G 0.196 T 0.304 Background letter frequencies (from /home/tbailey/tests/yeast.nc.6.freq):A 0.324 C 0.176 G 0.176 T 0.324 ****************************************************************************************************************************************************************MOTIF  1	width =   15   sites =  14   llr = 174   E-value = 2.3e-003********************************************************************************--------------------------------------------------------------------------------	Motif 1 Description--------------------------------------------------------------------------------Simplified        A  :::8:a:::442411pos.-specific     C  :1a19:117527644probability       G  :::1:::83:4:11:matrix            T  a9:11:91:111:44         bits    2.5   *                             2.3   *                             2.0   *                             1.8   * *   *      Information      1.5 * * ** **      content          1.3 *** *****  *   (17.9 bits)      1.0 *** *****  **                   0.8 ********** **                   0.5 ********** ****                 0.3 ***************                 0.0 ---------------Multilevel           TTCACATGCCACCCCconsensus                    GAGAATTsequence                       C                                        ----------------------------------------------------------------------------------------------------------------------------------------------------------------	Motif 1 sites sorted by position p-value--------------------------------------------------------------------------------Sequence name            Strand  Start   P-value                 Site    -------------            ------  ----- ---------            ---------------FAS1                         +     95  6.14e-09 GGCCAAAAAC TTCACATGCCGCCCA GCCAAGCATAINO1                         -    617  2.63e-08 GACAATACTT TTCACATGCCGCATT TAGCCGCCGAFAS2                         +    567  1.80e-07 CTCCCGCGTT TTCACATGCTACCTC ATTCGCCTCGCHO1                         +    611  2.03e-07 ACTTTGAACG TTCACACGGCACCCT CACGCCTTTGACC1                         +     83  4.22e-07 CGTTAAAATC TTCACATGGCCCGGC CGCGCGCGCGCHO2                         +    354  5.33e-07 TGCCACACTT TTCTCATGCCGCATT CATTATTCGCCHO1                         +    640  7.59e-07 CCTTTGAGCT TTCACATGGACCCAT CTAAAGATGAINO1                         -    568  8.34e-07 CTCCGCATAT TTCACATTCAACACT TTCGATTCCGOPI3                         +    637  1.02e-06 CTTTTCAATC TTCATATGCAAACCC ACACATGCATINO1                         +    687  1.10e-06 ACTTATTTAA TTCACATGGAGCAGA GAAAGCGCACOPI3                         -    582  1.84e-06 GGTGCAACTT TCCACATGCACTCTC ATTGACACCTINO1                         -    547  2.76e-06 CACTTTCGAT TCCGCATCCAACCCC GGAACACCGGCHO1                         +    163  3.42e-06 TCTGGCGCCC TTCCCATTCCGAACC ATGTTATATTCHO2                         -    478  7.37e-06 AATCTGTATT TTCATATGCCTACTT ATAAATTCAC----------------------------------------------------------------------------------------------------------------------------------------------------------------	Motif 1 block diagrams--------------------------------------------------------------------------------SEQUENCE NAME            POSITION P-VALUE  MOTIF DIAGRAM-------------            ----------------  -------------FAS1                              6.1e-09  94_[+1]_691INO1                              2.8e-06  546_[-1]_6_[-1]_34_[-1]_55_[+1]_99FAS2                              1.8e-07  566_[+1]_219CHO1                              7.6e-07  162_[+1]_433_[+1]_14_[+1]_146ACC1                              4.2e-07  82_[+1]_703CHO2                              5.3e-07  353_[+1]_109_[-1]_308OPI3                                1e-06  581_[-1]_40_[+1]_149----------------------------------------------------------------------------------------------------------------------------------------------------------------	Motif 1 in BLOCKS format--------------------------------------------------------------------------------BL   MOTIF 1 width=15 seqs=14FAS1                     (   95) TTCACATGCCGCCCA  1 INO1                     (  617) TTCACATGCCGCATT  1 FAS2                     (  567) TTCACATGCTACCTC  1 CHO1                     (  611) TTCACACGGCACCCT  1 ACC1                     (   83) TTCACATGGCCCGGC  1 CHO2                     (  354) TTCTCATGCCGCATT  1 CHO1                     (  640) TTCACATGGACCCAT  1 INO1                     (  568) TTCACATTCAACACT  1 OPI3                     (  637) TTCATATGCAAACCC  1 INO1                     (  687) TTCACATGGAGCAGA  1 OPI3                     (  582) TCCACATGCACTCTC  1 INO1                     (  547) TCCGCATCCAACCCC  1 CHO1                     (  163) TTCCCATTCCGAACC  1 CHO2                     (  478) TTCATATGCCTACTT  1 //----------------------------------------------------------------------------------------------------------------------------------------------------------------	Motif 1 position-specific scoring matrix--------------------------------------------------------------------------------log-odds matrix: alength= 4 w= 15 n= 5502 bayes= 8.45817 E= 2.3e-003  -1045  -1045  -1045    162  -1045    -30  -1045    140  -1045    251  -1045  -1045    128   -130   -130   -218  -1045    229  -1045   -118    162  -1045  -1045  -1045  -1045   -130  -1045    152  -1045   -130    216   -118  -1045    202     70  -1045     40    151  -1045   -218     14     29    102   -218    -60    202  -1045   -218     14    170   -130  -1045   -218    129    -30     14   -118    129  -1045     40 ----------------------------------------------------------------------------------------------------------------------------------------------------------------	Motif 1 position-specific probability matrix--------------------------------------------------------------------------------letter-probability matrix: alength= 4 w= 15 nsites= 14 E= 2.3e-003  0.000000  0.000000  0.000000  1.000000  0.000000  0.142857  0.000000  0.857143  0.000000  1.000000  0.000000  0.000000  0.785714  0.071429  0.071429  0.071429  0.000000  0.857143  0.000000  0.142857  1.000000  0.000000  0.000000  0.000000  0.000000  0.071429  0.000000  0.928571  0.000000  0.071429  0.785714  0.142857  0.000000  0.714286  0.285714  0.000000  0.428571  0.500000  0.000000  0.071429  0.357143  0.214286  0.357143  0.071429  0.214286  0.714286  0.000000  0.071429 

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?