📄 backtranambig.txt

📁 emboss的linux版本的源代码
💻 TXT
字号:
                               backtranambig Function   Back translate a protein sequence to ambiguous codonsDescription   backtranambig takes a protein sequence and makes the nucleic acid   sequence it could have come from. It does this by generating   nucleotide ambiguity codes that represent all possible codons for each   amino acid.   The resulting ambiguous nucleotide sequence can be translated to the   original protein using transeq, which will recognise highly redundant   codons (for example "WSN" for serine) as being produced by a program   such as backtranambig.  Genetic code   backtranambig needs a genetic code to generate the ambiguous codons.   The default genetic code is the standard ('Universal') code.Usage   Here is a sample session with backtranambig% backtranambig Back translate a protein sequence to ambiguous codonsInput (gapped) protein sequence: tsw:opsd_human(gapped) nucleotide output sequence [opsd_human.fasta]:    Go to the input files for this example   Go to the output files for this exampleCommand line arguments   Standard (Mandatory) qualifiers:  [-sequence]          sequence   (Gapped) protein sequence filename and                                  optional format, or reference (input USA)  [-outfile]           seqout     [.] (Gapped) nucleotide                                  sequence filename and optional format                                  (output USA)   Additional (Optional) qualifiers:   -table              menu       [0] Genetic code to use (Values: 0                                  (Standard); 1 (Standard (with alternative                                  initiation codons)); 2 (Vertebrate                                  Mitochondrial); 3 (Yeast Mitochondrial); 4                                  (Mold, Protozoan, Coelenterate Mitochondrial                                  and Mycoplasma/Spiroplasma); 5                                  (Invertebrate Mitochondrial); 6 (Ciliate                                  Macronuclear and Dasycladacean); 9                                  (Echinoderm Mitochondrial); 10 (Euplotid                                  Nuclear); 11 (Bacterial); 12 (Alternative                                  Yeast Nuclear); 13 (Ascidian Mitochondrial);                                  14 (Flatworm Mitochondrial); 15                                  (Blepharisma Macronuclear); 16                                  (Chlorophycean Mitochondrial); 21 (Trematode                                  Mitochondrial); 22 (Scenedesmus obliquus);                                  23 (Thraustochytrium Mitochondrial))   Advanced (Unprompted) qualifiers: (none)   Associated qualifiers:   "-sequence" associated qualifiers   -sbegin1            integer    Start of the sequence to be used   -send1              integer    End of the sequence to be used   -sreverse1          boolean    Reverse (if DNA)   -sask1              boolean    Ask for begin/end/reverse   -snucleotide1       boolean    Sequence is nucleotide   -sprotein1          boolean    Sequence is protein   -slower1            boolean    Make lower case   -supper1            boolean    Make upper case   -sformat1           string     Input sequence format   -sdbname1           string     Database name   -sid1               string     Entryname   -ufo1               string     UFO features   -fformat1           string     Features format   -fopenfile1         string     Features file name   "-outfile" associated qualifiers   -osformat2          string     Output seq format   -osextension2       string     File name extension   -osname2            string     Base file name   -osdirectory2       string     Output directory   -osdbname2          string     Database name to add   -ossingle2          boolean    Separate file for each entry   -oufo2              string     UFO features   -offormat2          string     Features format   -ofname2            string     Features file name   -ofdirectory2       string     Output directory   General qualifiers:   -auto               boolean    Turn off prompts   -stdout             boolean    Write standard output   -filter             boolean    Read standard input, write standard output   -options            boolean    Prompt for standard and additional values   -debug              boolean    Write debug output to program.dbg   -verbose            boolean    Report some/full command line options   -help               boolean    Report command line options. More                                  information on associated and general                                  qualifiers can be found with -help -verbose   -warning            boolean    Report warnings   -error              boolean    Report errors   -fatal              boolean    Report fatal errors   -die                boolean    Report dying program messagesInput file format   Any DNA sequence USA.  Input files for usage example   'tsw:opsd_human' is a sequence entry in the example protein database   'tsw'  Database entry: tsw:opsd_humanID   OPSD_HUMAN     STANDARD;      PRT;   348 AA.AC   P08100; Q16414;DT   01-AUG-1988 (Rel. 08, Created)DT   01-AUG-1988 (Rel. 08, Last sequence update)DT   15-JUL-1999 (Rel. 38, Last annotation update)DE   RHODOPSIN.GN   RHO.OS   Homo sapiens (Human).OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia;OC   Eutheria; Primates; Catarrhini; Hominidae; Homo.RN   [1]RP   SEQUENCE FROM N.A.RX   MEDLINE; 84272729.RA   NATHANS J., HOGNESS D.S.;RT   "Isolation and nucleotide sequence of the gene encoding humanRT   rhodopsin.";RL   Proc. Natl. Acad. Sci. U.S.A. 81:4851-4855(1984).RN   [2]RP   SEQUENCE OF 1-120 FROM N.A.RA   BENNETT J., BELLER B., SUN D., KARIKO K.;RL   Submitted (NOV-1994) to the EMBL/GenBank/DDBJ databases.RN   [3]RP   REVIEW ON ADRP VARIANTS.RX   MEDLINE; 94004905.RA   AL-MAGHTHEH M., GREGORY C., INGLEHEARN C., HARDCASTLE A.,RA   BHATTACHARYA S.;RT   "Rhodopsin mutations in autosomal dominant retinitis pigmentosa.";RL   Hum. Mutat. 2:249-255(1993).RN   [4]RP   VARIANT ADRP HIS-23.RX   MEDLINE; 90136922.RA   DRYJA T.P., MCGEE T.L., REICHEI E., HAHN L.B., COWLEY G.S.,RA   YANDELL D.W., SANDBERG M.A., BERSON E.L.;RT   "A point mutation of the rhodopsin gene in one form of retinitisRT   pigmentosa.";RL   Nature 343:364-366(1990).RN   [5]RP   VARIANTS ADRP.RX   MEDLINE; 91051574.RA   FARRAR G.J., KENNA P., REDMOND R., MCWILLIAM P., BRADLEY D.G.,RA   HUMPHRIES M.M., SHARP E.M., INGLEHEARN C.F., BASHIR R., JAY M.,RA   WATTY A., LUDWIG M., SCHINZEL A., SAMANNS C., GAL A.,RA   BHATTACHARYA S.S., HUMPHRIES P.;RT   "Autosomal dominant retinitis pigmentosa: absence of the rhodopsinRT   proline-->histidine substitution (codon 23) in pedigrees fromRT   Europe.";RL   Am. J. Hum. Genet. 47:941-945(1990).RN   [6]RP   VARIANTS ADRP HIS-23; ARG-58; LEU-347 AND SER-347.RX   MEDLINE; 91015273.  [Part of this file has been deleted for brevity]FT                                /FTId=VAR_004816.FT   VARIANT     209    209       V -> M (EFFECT NOT KNOWN).FT                                /FTId=VAR_004817.FT   VARIANT     211    211       H -> P (IN ADRP).FT                                /FTId=VAR_004818.FT   VARIANT     211    211       H -> R (IN ADRP).FT                                /FTId=VAR_004819.FT   VARIANT     216    216       M -> K (IN ADRP).FT                                /FTId=VAR_004820.FT   VARIANT     220    220       F -> C (IN ADRP).FT                                /FTId=VAR_004821.FT   VARIANT     222    222       C -> R (IN ADRP).FT                                /FTId=VAR_004822.FT   VARIANT     255    255       MISSING (IN ADRP).FT                                /FTId=VAR_004823.FT   VARIANT     264    264       MISSING (IN ADRP).FT                                /FTId=VAR_004824.FT   VARIANT     267    267       P -> L (IN ADRP).FT                                /FTId=VAR_004825.FT   VARIANT     267    267       P -> R (IN ADRP).FT                                /FTId=VAR_004826.FT   VARIANT     292    292       A -> E (IN CSNB4).FT                                /FTId=VAR_004827.FT   VARIANT     296    296       K -> E (IN ADRP).FT                                /FTId=VAR_004828.FT   VARIANT     297    297       S -> R (IN ADRP).FT                                /FTId=VAR_004829.FT   VARIANT     342    342       T -> M (IN ADRP).FT                                /FTId=VAR_004830.FT   VARIANT     345    345       V -> L (IN ADRP).FT                                /FTId=VAR_004831.FT   VARIANT     345    345       V -> M (IN ADRP).FT                                /FTId=VAR_004832.FT   VARIANT     347    347       P -> A (IN ADRP).FT                                /FTId=VAR_004833.FT   VARIANT     347    347       P -> L (IN ADRP; COMMON VARIANT).FT                                /FTId=VAR_004834.FT   VARIANT     347    347       P -> Q (IN ADRP).FT                                /FTId=VAR_004835.FT   VARIANT     347    347       P -> R (IN ADRP).FT                                /FTId=VAR_004836.FT   VARIANT     347    347       P -> S (IN ADRP).FT                                /FTId=VAR_004837.SQ   SEQUENCE   348 AA;  38892 MW;  07443BEA CRC32;     MNGTEGPNFY VPFSNATGVV RSPFEYPQYY LAEPWQFSML AAYMFLLIVL GFPINFLTLY     VTVQHKKLRT PLNYILLNLA VADLFMVLGG FTSTLYTSLH GYFVFGPTGC NLEGFFATLG     GEIALWSLVV LAIERYVVVC KPMSNFRFGE NHAIMGVAFT WVMALACAAP PLAGWSRYIP     EGLQCSCGID YYTLKPEVNN ESFVIYMFVV HFTIPMIIIF FCYGQLVFTV KEAAAQQQES     ATTQKAEKEV TRMVIIMVIA FLICWVPYAS VAFYIFTHQG SNFGPIFMTI PAFFAKSAAI     YNPVIYIMMN KQFRNCMLTT ICCGKNPLGD DEASATVSKT ETSQVAPA//Output file format   The output is a nucleotide sequence containing the most favoured back   translation of the specified protein, and using the specified   translation table (which defaults to human).  Output files for usage example  File: opsd_human.fasta>OPSD_HUMAN P08100 RHODOPSIN.ATGAAYGGNACNGARGGNCCNAAYTTYTAYGTNCCNTTYWSNAAYGCNACNGGNGTNGTNMGNWSNCCNTTYGARTAYCCNCARTAYTAYYTNGCNGARCCNTGGCARTTYWSNATGYTNGCNGCNTAYATGTTYYTNYTNATHGTNYTNGGNTTYCCNATHAAYTTYYTNACNYTNTAYGTNACNGTNCARCAYAARAARYTNMGNACNCCNYTNAAYTAYATHYTNYTNAAYYTNGCNGTNGCNGAYYTNTTYATGGTNYTNGGNGGNTTYACNWSNACNYTNTAYACNWSNYTNCAYGGNTAYTTYGTNTTYGGNCCNACNGGNTGYAAYYTNGARGGNTTYTTYGCNACNYTNGGNGGNGARATHGCNYTNTGGWSNYTNGTNGTNYTNGCNATHGARMGNTAYGTNGTNGTNTGYAARCCNATGWSNAAYTTYMGNTTYGGNGARAAYCAYGCNATHATGGGNGTNGCNTTYACNTGGGTNATGGCNYTNGCNTGYGCNGCNCCNCCNYTNGCNGGNTGGWSNMGNTAYATHCCNGARGGNYTNCARTGYWSNTGYGGNATHGAYTAYTAYACNYTNAARCCNGARGTNAAYAAYGARWSNTTYGTNATHTAYATGTTYGTNGTNCAYTTYACNATHCCNATGATHATHATHTTYTTYTGYTAYGGNCARYTNGTNTTYACNGTNAARGARGCNGCNGCNCARCARCARGARWSNGCNACNACNCARAARGCNGARAARGARGTNACNMGNATGGTNATHATHATGGTNATHGCNTTYYTNATHTGYTGGGTNCCNTAYGCNWSNGTNGCNTTYTAYATHTTYACNCAYCARGGNWSNAAYTTYGGNCCNATHTTYATGACNATHCCNGCNTTYTTYGCNAARWSNGCNGCNATHTAYAAYCCNGTNATHTAYATHATGATGAAYAARCARTTYMGNAAYTGYATGYTNACNACNATHTGYTGYGGNAARAAYCCNYTNGGNGAYGAYGARGCNWSNGCNACNGTNWSNAARACNGARACNWSNCARGTNGCNCCNGCNData files   The codon usage table is read by default from "Ehum.cut" in the   'data/CODONS' directory of the EMBOSS distribution. If the name of a   codon usage file is specified on the command line, then this file will   first be searched for in the current directory and then in the   'data/CODONS' directory of the EMBOSS distribution.   EMBOSS data files are distributed with the application and stored in   the standard EMBOSS data directory, which is defined by the EMBOSS   environment variable EMBOSS_DATA.   To see the available EMBOSS data files, run:% embossdata -showall   To fetch one of the data files (for example 'Exxx.dat') into your   current directory for you to inspect or modify, run:% embossdata -fetch -file Exxx.dat   Users can provide their own data files in their own directories.   Project specific files can be put in the current directory, or for   tidier directory listings in a subdirectory called ".embossdata".   Files for all EMBOSS runs can be put in the user's home directory, or   again in a subdirectory called ".embossdata".   The directories are searched in the following order:     * . (your current directory)     * .embossdata (under your current directory)     * ~/ (your home directory)     * ~/.embossdataNotes   None.References   None.Warnings   None.Diagnostic Error Messages   "Corrupt codon index file" - the codon usage file is incomplete or   empty.   "The file 'drosoph.cut' does not exist" - the codon usage file cannot   be opened.Exit status   This program always exits with a status of 0, unless the codon usage   table cannot be opened.Known bugs   None.See also   Program name                        Description   backtranseq  Back translate a protein sequence   charge       Protein charge plot   checktrans   Reports STOP codons and ORF statistics of a protein   coderet      Extract CDS, mRNA and translations from feature tables   compseq      Count composition of dimer/trimer/etc words in a sequence   emowse       Protein identification by mass spectrometry   freak        Residue/base frequency table or plot   iep          Calculates the isoelectric point of a protein   mwcontam     Shows molwts that match across a set of files   mwfilter     Filter noisy molwts from mass spec output   octanol      Displays protein hydropathy   pepinfo      Plots simple amino acid properties in parallel   pepstats     Protein statistics   pepwindow    Displays protein hydropathy   pepwindowall Displays protein hydropathy of a set of sequences   plotorf      Plot potential open reading frames   prettyseq    Output sequence with translated ranges   remap        Display sequence with restriction sites, translation etc   showorf      Pretty output of DNA translations   showseq      Display a sequence with features, translation etc   sixpack      Display a DNA sequence with 6-frame translation and ORFs   transeq      Translate nucleic acid sequencesAuthor(s)   Alan Bleasby (ajb
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -