📄 readme.pvm_3.3

📁 序列对齐 Compare a protein sequence to a protein sequence database or a DNA sequence to a DNA sequenc
💻 3
字号:
 $Name: fa35_03_06 $ - $Id: readme.pvm_3.3,v 1.15 2007/06/29 20:23:58 wrp Exp $================pvcomp* - FAQ's, November, 1999(The comments below apply to the pv3comp* programs.  This problem hasbeen addressed in the pv4comp* programs, by dramatically changingthe way databases are distributed.)I believe that the number one reason why the pvcomp* programs do notwork properly is that the second library must be fully specified.If you simply type:	pv3compfa query.lib database.libThe program will not be able to find database.lib on the worker machines.You need to use:	pv3compfa query.lib /home/user/lib/database.liband /home/user/lib/database.lib must be accessible to all of the workernodes.To find error messages from the workers, look at /tmp/pvml.uid, whereuid is your unix uid.================Program summary:Programs to produce conventional scores and alignments:pv3compfa	protein vs protein, DNA vs DNApv3compsw	protein vs protein, DNA vs DNApv3compfx/	DNA vs proteinpv3comptfx/y	protein vs DNAPrograms to summarize the effectiveness of a search (requiresuper-family-labeled databases):ps3compfa	protein vs protein, DNA vs DNAps3compsw	protein vs protein, DNA vs DNAps3compfx/	DNA vs proteinps3comptfx/y	protein vs DNAPrograms to report the scores and alignments of the highest scoringunrelated sequence (require super-family-labeled databases). Theseprograms are used to evaluate the super-family labeling.pu3compfa	protein vs protein, DNA vs DNApu3compsw	protein vs protein, DNA vs DNApucompfx/	DNA vs proteinpu3comptfx/y	protein vs DNANote that the current parallel implementations distribute the seconddatabase among 'N' parallel workers by approximately dividing thedatabase into 'N' parts by seeking into the middle of the database andfinding the next entry.  This strategy fails when the database is asingle long sequence (the first worker gets the entire database, theothers get nothing).================Release notes:--> July 18, 2000Increase SQSZ in pxgetaa.c to 200000 for long Genbank entries.  Thismay still not be long enough.  This increase may allow overlaps tooccur.--> July 10, 2000Corrections to the code for breaking up very long sequences.  The lastportion of a long sequence did not have the correct offset.--> July 1, 2000Modified pxgetaa.c to read Genbank flatfiles.Additional pieces of a long sequence no longer have a '+' at thebeginning.--> June 12, 2000Restructured p_complib.c, p_workcomp.c to make the -m 9 display moreconsistent with the fast33(_t) set of programs.  The alignment (%_id,swscore, boundary) information is now calculated at the do_opt() stageof the calculation.  This rearrangement uncovered a problem with thedo_opt() stage (s_func=1) that has been fixed.  This has not yet beentested with the MPI implementation.Many changes were made to allow k_H, k_comp information to be passedback so that the -z 6 scaleswn.c (proc_hist_mle2) function could beused.--> February 6, 2000Corrected some problems with proc_hist_ml() to correctly reinitializehist_db_size and num_db_entries.--> January 20, 2000   The structure of the p[vsu]comp* programs has not changed, but thethe code has been modified to accomodate both PVM and MPI versions ofthe programs from the same source code.  Thus, all of the PVM-specificcode is now surrounded by #ifdef PVM_SRC/#endif.  The source filespvcomplib.c and pvworkcomp.c have been replaced by p_complib.c andp_workcomp.c, respectively.  Additional changes were made to ensurethat "FIRSTNODE" is used appropriately.  In general, FIRSTNODE=0 forPVM programs (although with > 8 nodes, FIRSTNODE=1 may be moreeffective), but FIRSTNODE=1 for MPI programs.  Modest changes were made to reduce warning messages duringcompilation.--> January, 2000   Modification to hxgetaa.c, pxgetaa.c to handle library sequences,such as those from NCBI/NR, with very long comment lines.  Additionalmodifications to correct problems with long comments, long DNAsequences with pv3comptfx/tfy.--> v3.33	December, 1999Substantial updates to pvcomplib.c/pvworkcomp.c to improve efficiencyand to provide pv3compf[xy] and pv3comptf[xy].  Previous versions ofpvcomplib.c/pvworkcomp.c passed the entire struct mngmsg (structs.h)each time a new query was initiated or alignments were required.  Thisversion sends struct mngmsg only once and sends struct qmng_str(w_msg.h), which is much smaller, for the queries and alignments. Inaddition, the buffer size for results is now variable (but can be aslarge as 1200, vs 600 previously), which may improve performance whenlarge numbers of workers are available.  The maximum number of librarysequences per worker has been raised to 200,000 from 50,000.Nevertheless, very large databases (est_human) may have too manyentries to be examined by 4 workers.It is likely that pv3comptf[xy] may have problems with very longsequences.  pv3compf[xy]/tf[xy] have not been tested extensively.--> v3.32 December, 1999Substantial corrections to showsum.c (showbest()) for the case of DNAqueries, where two scores are calculated for each query.  As a resultof the changes, bptr[] no longer mapped exactly to best[], whichcaused a bug that was very difficult to track down.  To ensure thatbptr[]=best[], bptr[] is now re-initialized for each query.The output format has changed significantly as well.  Lots ofredundant /** **/ comments have been removed.  An E() value has beenadded to the "equ num:" line in showsum.c.The organization of the inner while() loop in pvcomplib.c has beenmodified so that new query sequences can be sent to workersimmediately as soon as a worker is available, rather than waiting forall to finish and the statistical analysis.--> v3.30	October, 1999The p*comp*/c.work* programs have been renamed to pv3compfa,ps3compfa, etc.  and c3.work* so that the older version 3.2 programscan co-exist with this version.Corrected problem with "-n" option that prevented it from functioningproperly.  Include "ACGTCN" in check for DNA query library.a(from readme.pvm_3.2)--> August, 1999Corrected problem with opt_cut initialization that only appearedwith p?compfa programs.--> v3.26	July, 1999pvcomp* programs now use the same method for working with forward andreverse strands as the standard fast*3(_t) programs.  Thus, statisticsfor DNA sequences should be very similar for pvcompfa and fasta3 orfasta3_t. 		February, 1999With release fasta32t02 of the FASTA package, the alignmentroutines for pvcompfa, pvcompsw, etc now work properlyagain.The PVM versions of the FASTA and Smith-Waterman search programsshould now be functionally identical to the multithreaded (fasta3_t,ssearch3_t) and non-threaded (fasta3, ssearch3) versions.The programs have also been updated to provide similar -m 10information to the non-pvm versions.  There are some slightdifferences, because the pvcomp* versions are designed to work withmultiple sequences.  But, in general, a script that looks for /^>>>/to start an alignment set and /^>>><<</ to end the set workproperly.--> v3.23	March, 1999 Modified Makefile.pvm, showsum.c so that showsum.c is used byboth the complib/_thr and pvcomplib (pvm parallel) versions.Corrected bug in reading first query for DNA sequences.--> v3.25	May, 1999Fixed pvm_showalign.c so that FIRSTNODE (in msg.h) can be 1, ratherthan 0.  #define FIRSTNODE 1 is recommended when the virtual machinehas 8 or more nodes.
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -