📄 fastdnaml_scripts.txt

📁 fastDNAml is an attempt to solve the same problem as DNAML, but to do so faster and using less memo
💻 TXT
📖 第 1 页 / 共 2 页
字号:
上一页 12
initially placing a new sequence in the growing tree (but does not change thetime required to subsequently test rearrangements).  This will probably becomethe default program behavior in the near future.Any possible downside of the quickadd option would be a decreased frequency offinding the globally optimal tree.  Since you should NEVER depend on a singleorder of addition yielding the best tree, multiple jumble runs will still bethe best way to check the reproducibility of any presumptively optimal tree.Quickadd should let you do this more quickly!RESTART (R)  Format:  restart  checkpoint_file_name  Example: quickadd <archaea.phylip | restart checkpoint.13728 | fastDNAmlAdds a restart option and appends the last tree in the checkpoint file to theinput.  This conflicts with a usertree option.  Note that this script makes avaliant attempt to figure out the beginning location of the last tree in thefile.  In particular, you should not modify lines in the checkpoint file (orany other treefile that you supply) so that they end with a semicolon or aperiod (other than the final line of a tree).  The example will continuebuilding a tree for the archaea.phylip data from whereever the checkpointfile left off.TRANSITION-TRANSVERSION RATIO (T)  Format:  transition  ratio_value  Example: transition 2.0 <archaea.phylip | global 4 | fastDNAmlAdds the transition-transversion option and appropriate auxiliary data lineto the input.  This option with a value of 2.0 (the program's default value)can be used before a global or treefile option with auxiliary data (see theexample).TREEFILE (Y)  Format:  treefile  [format_number]Adds the treefile option, and, if requested, the treefile format auxiliarydata line to the input data.  In this latter case, it is essential that theinput contain (or the treefile command be preceded by) a B, J or T option.The only formats at present are 1 (default Newick format) and 2 (a prologphylip_tree fact).  Example: outgroup 5 <archaea.phylip | treefile | fastDNAml > archaea.out  Example: transition 2.0 <archaea.phylip | treefile 2 | fastDNAmlThe first example will infer a tree for the archaea data, root it on sequence5, and write a tree to treefile.PID, where PID is a number (the process ID offastDNAml).  The second example writes a prolog fact for the tree inferred fromthese same data.  The transition/transversion ratio is supplied with itsdefault value, only so that the treefile format will be found in the input file.USERLENGTHS (L)  Format:  userlengths  Example: usertree archaea.tree <archaea.phylip | userlengths | fastDNAmlAdds the userlengths option to the input.  This must be used in conjunctionwith the usertree option, so it is usually easier to add the second parameterto the usertree script (see below).  This script is useful if the usertree isalready part of the input, and you only need to alter the lengths option.USERTREE (U)  Format: usertree  tree_file [ L ]Adds the usertree option and appends the tree count and all of the trees inthe named file to the input.  If the L parameter is included, it is added tothe options line to invoke the program userlengths option.  Example: usertree archaea.tree <archaea.phylip | userlengths | fastDNAml  Example  usertree archaea.tree L <archaea.phylip | fastDNAmlThese two examples are equivalent.All of the trees in the file are added, so this is not an appropriate way toevaluate the last tree in a checkpoint file.  Restart is the script to use forthat latter task.WEIGHTS (W)  Format:  weights  weight_data_file  Example: weights  archaea.weight <archaea.phylip | fastDNAmlAdds the weights option (W) and the corresponding data to the input.  The datamust have the format specified for PHYLIP dnaml 3.3.  The first line must bethe word Weights and AT LEAST 3 BLANK SPACES, followed by one weight value persequence position.  The weights 0 - 35 are represented by the series 0, 1, 2,3, ..., 8, 9, A, B, C, ..., Y, Z.  These latter data can be on one or morelines.  For example,  Weights     111111111111001100000100011111100000000000000110000110000000              111101111111111111111111011100000111001011100000000011WEIGHTS and CATEGORIES (W C)  Format: weights_categories  weight_and_category_data_fileAdds both the userweights and categories options to the input, along with theweights and categories data in the named file.  This is useful for importingthe data from the DNArates program into fastDNAml.  The format of the datafile should conform to that of weights auxiliary data followed by categoriesauxiliary data.  As noted for the categories option above, in order togenerate output compatible with PHYLIP dnaml, this should be the first optionadded (so that the categories data are inserted immediately before thesequence data).              SCRIPTS THAT PERFORM OTHER USEFUL FUNCTIONSclean_checkpointsUsage:  clean_checkpointsLocates checkpoint files for which there is a corresponding treefile, anddeletes the former.  This is useful in cleaning up the directory when thetreefile option is in use.clean_jumblesUsage:  clean_jumbles  [ -nosummary ]  file_name_rootFinds all files with the form "file_name_root.PID", where PID is the processID appended by fastDNAml_boot and fastDNAml_loop to results from analyses withdifferent jumble seeds.  Unless the nosummary flag is included, all likelihoodsare summarized in a file named "file_name_root.summary".  The likelihoods arescreened for the best value.  An output file with the optimal score is renamedto "file_name_root.out", and the corresponding treefile (if any) is renamed"file_name_root.tree".  All other treefiles and output files matching the nameare deleted.  Corresponding checkpoint files are also deleted.fastDNAml_bootUsage:  fastDNAml_boot [-seed seed]  [-boots nboot] \                       [-max maxjumble]  [-nice nicevalue]  [-noclean] \                       in_file  n_best  [ "dnaml_opt1 [ | dnaml_opt2 ... ]" ]For the current bootstrap seed, the sequence input order is jumbled (up tomaxjumble times) until the same best tree is found n_best times.  The outputfiles are then reduced to a summary of the scores produced by jumbling, and oneexample of the best tree.  The number process is then repeated with newbootstrap seeds until nboot samples have been analyzed.Boot, jumble, treefile and quickadd are included by the script and should notbe specified by the user or in the data file.  Additional爁astDNAmlprogram options are enclosed in quotes, and separated by vertical bars (|).Flags and parameters:  in_file         -- name of the input data file  n_best          -- input order is jumbled (up to maxjumble times) until                     same tree is found n_best times  -boots nboot    -- number of different bootstrap samples (Default=1)  -seed seed      -- seed for first bootstrap (Default is based on the process                     ID and time of day)  -max maxjumble  -- maximum attempts at replicating inferred tree                     (Default=10)  -nice nicevalue -- run fastDNAml with specified nice value (Default=10)  -noclean        -- inhibits cleanup of the output files for the individual                     seedsExample:  fastDNAml_boot -boots 100 -seed 137 -max 8 -nice 20  test_data.phylip  2fastDNAml_loopUsage:  fastDNAml_loop [-max maxjumble]  [-nice nicevalue]  [-noclean] \                       in_file  n_best  [ "dnaml_opt1 [ | dnaml_opt2 ... ]" ]For the given input file, the sequence input order is jumbled (up to maxjumbletimes) until the same best tree is found n_best times.  The output files arethen reduced to a summary of the scores produced by jumbling, and one exampleof the best tree.Jumble, treefile and quickadd are included by the script and should notbe specified by the user or in the data file.  Additional爁astDNAmlprogram options are enclosed in quotes, and separated by vertical bars (|).Flags and parameters:  in_file         -- name of the input data file  n_best          -- input order is jumbled (up to maxjumble times) until                     same tree is found n_best times  -max maxjumble  -- maximum attempts at replicating inferred tree                     (Default=10)  -nice nicevalue -- run fastDNAml with specified nice value (Default=10)  -noclean        -- inhibits cleanup of the output files for the individual                     seedsExample:  fastDNAml_loop -max 15 -nice 20  test_data.phylip  3out.PIDUsage:  out.PID  program  outfile   or   out.PID  -o outfile  program  program_argsWhen a process sends its output to standard out, this script puts it in afile whose name is the outfile.PID, where outfile is the name given by theuser and PID is the process ID of the running program.  This is useful whena program is run a number of times, and output needs to be put in differentfiles.  Since the fastDNAml program uses the PID in generating the checkpointfile name and the tree file name, it is useful to be able to put the programoutput itself in a file with the corresponding suffix.scoresUsage:  scores  filesExtracts and sorts likelihood values from fastDNAml output files.  Thecorresponding file names are also provided.trees2NEXUSUsage:  trees2NEXUS  [ treefile ... ]Concatenate Newick format trees from named file(s) or from standard inputinto a single file suitable for the PAUP (tested) or MacClade (untested)programs.  For example, it is possible to combine trees from bootstrapreplicates in order to use the PAUP concensus tree functions.trees2prologUsage:  trees2prolog  [ treefile ... ]Accepts Newick format trees from named file(s) or from standard input andas converts them to prolog facts of the form pseudoNewick(Comment, Tree).Each tree must be entirely on one line, and they must be similar to theform produced by the fastDNAml program.  In particular, the only permittedcomment is one preceeding the whole tree.  If there is no leading commment,then the fact produced will have arity 1.
上一页 12
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -