📄 biosed.txt

📁 emboss的linux版本的源代码
💻 TXT
字号:
                                  biosed Function   Replace or delete sequence sectionsDescription   biosed is a simple sequence editing utility that searches for a target   subsequence in one or more input sequences and replaces it with a   specified second subsequence (or optionally just deletes the found   target subsequence).   biosed was inspired by the useful UNIX utility sed which searches for   a pattern in text and can replace or delete the found pattern.   If the target subsequence occurs more than once, then each instance of   the target is replaced.   The target subsequence is not any sort of an ambiguity pattern, it is   just a short sequence. A simple string match is done and if it exactly   matches then the replacement is done. The matching is independent of   the case of the sequence or the target - both uppercase and lowercase   will match.Usage   Here is a sample session with biosed   Replace all 'T's with 'U's to create an RNA sequence% biosed tembl:hsfau hsfau.rna -target T -replace U Replace or delete sequence sections   Go to the input files for this example   Go to the output files for this example   Example 2   Replace all 'PPP' protein motifs with 'XXPPPXX'% biosed tsw:AMIR_PSEAE AMIR_PSEAE.pep -target PPP -replace XXPPPXX Replace or delete sequence sections   Go to the input files for this example   Go to the output files for this exampleCommand line arguments   Standard (Mandatory) qualifiers (* if not always prompted):  [-sequence]          seqall     (Gapped) sequence(s) filename and optional                                  format, or reference (input USA)   -target             string     [N] Sequence section to match (Any string is                                  accepted)*  -replace            string     [A] Replacement sequence section (Any string                                  is accepted)  [-outseq]            seqout     [.] Sequence filename and                                  optional format (output USA)   Additional (Optional) qualifiers:   -position           integer    [0] Sequence position to match (Integer 0 or                                  more)   Advanced (Unprompted) qualifiers:   -delete             toggle     [N] Delete the target sequence sections   Associated qualifiers:   "-sequence" associated qualifiers   -sbegin1            integer    Start of each sequence to be used   -send1              integer    End of each sequence to be used   -sreverse1          boolean    Reverse (if DNA)   -sask1              boolean    Ask for begin/end/reverse   -snucleotide1       boolean    Sequence is nucleotide   -sprotein1          boolean    Sequence is protein   -slower1            boolean    Make lower case   -supper1            boolean    Make upper case   -sformat1           string     Input sequence format   -sdbname1           string     Database name   -sid1               string     Entryname   -ufo1               string     UFO features   -fformat1           string     Features format   -fopenfile1         string     Features file name   "-outseq" associated qualifiers   -osformat2          string     Output seq format   -osextension2       string     File name extension   -osname2            string     Base file name   -osdirectory2       string     Output directory   -osdbname2          string     Database name to add   -ossingle2          boolean    Separate file for each entry   -oufo2              string     UFO features   -offormat2          string     Features format   -ofname2            string     Features file name   -ofdirectory2       string     Output directory   General qualifiers:   -auto               boolean    Turn off prompts   -stdout             boolean    Write standard output   -filter             boolean    Read standard input, write standard output   -options            boolean    Prompt for standard and additional values   -debug              boolean    Write debug output to program.dbg   -verbose            boolean    Report some/full command line options   -help               boolean    Report command line options. More                                  information on associated and general                                  qualifiers can be found with -help -verbose   -warning            boolean    Report warnings   -error              boolean    Report errors   -fatal              boolean    Report fatal errors   -die                boolean    Report dying program messagesInput file format   It reads the USA of one or more nucleic acid or protein sequences.  Input files for usage example   'tembl:hsfau' is a sequence entry in the example nucleic acid database   'tembl'  Database entry: tembl:hsfauID   HSFAU      standard; RNA; HUM; 518 BP.XXAC   X65923;XXSV   X65923.1XXDT   13-MAY-1992 (Rel. 31, Created)DT   23-SEP-1993 (Rel. 37, Last updated, Version 10)XXDE   H.sapiens fau mRNAXXKW   fau gene.XXOS   Homo sapiens (human)OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;OC   Eutheria; Primates; Catarrhini; Hominidae; Homo.XXRN   [1]RP   1-518RA   Michiels L.M.R.;RT   ;RL   Submitted (29-APR-1992) to the EMBL/GenBank/DDBJ databases.RL   L.M.R. Michiels, University of Antwerp, Dept of Biochemistry,RL   Universiteisplein 1, 2610 Wilrijk, BELGIUMXXRN   [2]RP   1-518RX   MEDLINE; 93368957.RA   Michiels L., Van der Rauwelaert E., Van Hasselt F., Kas K., Merregaert J.;RT   " fau cDNA encodes a ubiquitin-like-S30 fusion protein and is expressed asRT   an antisense sequences in the Finkel-Biskis-Reilly murine sarcoma virus";RL   Oncogene 8:2537-2546(1993).XXDR   SWISS-PROT; P35544; UBIM_HUMAN.DR   SWISS-PROT; Q05472; RS30_HUMAN.XXFH   Key             Location/QualifiersFHFT   source          1..518FT                   /chromosome="11q"FT                   /db_xref="taxon:9606"FT                   /organism="Homo sapiens"FT                   /tissue_type="placenta"FT                   /clone_lib="cDNA"FT                   /clone="pUIA 631"FT                   /map="13"FT   misc_feature    57..278FT                   /note="ubiquitin like part"FT   CDS             57..458FT                   /db_xref="SWISS-PROT:P35544"FT                   /db_xref="SWISS-PROT:Q05472"FT                   /gene="fau"FT                   /protein_id="CAA46716.1"FT                   /translation="MQLFVRAQELHTFEVTGQETVAQIKAHVASLEGIAPEDQVVLLAGFT                   APLEDEATLGQCGVEALTTLEVAGRMLGGKVHGSLARAGKVRGQTPKVAKQEKKKKKTGFT                   RAKRRMQYNRRFVNVVPTFGKKKGPNANS"FT   misc_feature    98..102FT                   /note="nucleolar localization signal"FT   misc_feature    279..458FT                   /note="S30 part"FT   polyA_signal    484..489FT   polyA_site      509XXSQ   Sequence 518 BP; 125 A; 139 C; 148 G; 106 T; 0 other;     ttcctctttc tcgactccat cttcgcggta gctgggaccg ccgttcagtc gccaatatgc        60     agctctttgt ccgcgcccag gagctacaca ccttcgaggt gaccggccag gaaacggtcg       120     cccagatcaa ggctcatgta gcctcactgg agggcattgc cccggaagat caagtcgtgc       180     tcctggcagg cgcgcccctg gaggatgagg ccactctggg ccagtgcggg gtggaggccc       240     tgactaccct ggaagtagca ggccgcatgc ttggaggtaa agttcatggt tccctggccc       300     gtgctggaaa agtgagaggt cagactccta aggtggccaa acaggagaag aagaagaaga       360     agacaggtcg ggctaagcgg cggatgcagt acaaccggcg ctttgtcaac gttgtgccca       420     cctttggcaa gaagaagggc cccaatgcca actcttaagt cttttgtaat tctggctttc       480     tctaataaaa aagccactta gttcagtcaa aaaaaaaa                               518//  Input files for usage example 2   'tsw:AMIR_PSEAE' is a sequence entry in the example protein database   'tsw'  Database entry: tsw:AMIR_PSEAEID   AMIR_PSEAE     STANDARD;      PRT;   196 AA.AC   P10932;DT   01-JUL-1989 (Rel. 11, Created)DT   01-JUL-1989 (Rel. 11, Last sequence update)DT   15-DEC-1998 (Rel. 37, Last annotation update)DE   ALIPHATIC AMIDASE REGULATOR.GN   AMIR.OS   Pseudomonas aeruginosa.OC   Bacteria; Proteobacteria; gamma subdivision; Pseudomonas group;OC   Pseudomonas.RN   [1]RP   SEQUENCE FROM N.A.RC   STRAIN=PAC;RX   MEDLINE; 89211409.RA   LOWE N., RICE P.M., DREW R.E.;RT   "Nucleotide sequence of the aliphatic amidase regulator gene (amiR)RT   of Pseudomonas aeruginosa.";RL   FEBS Lett. 246:39-43(1989).RN   [2]RP   CHARACTERIZATION.RX   MEDLINE; 95286483.RA   WILSON S.A., DREW R.E.;RT   "Transcriptional analysis of the amidase operon from PseudomonasRT   aeruginosa.";RL   J. Bacteriol. 177:3052-3057(1995).CC   -!- FUNCTION: POSITIVE CONTROLLING ELEMENT OF AMIE, THE GENE FORCC       ALIPHATIC AMIDASE. ACTS AS A TRANSCRIPTIONAL ANTITERMINATIONCC       FACTOR. IT IS THOUGHT TO ALLOW RNA POLYMERASE READ THROUGH A RHO-CC       INDEPENDENT TRANSCRIPTION TERMINATOR BETWEEN THE AMIE PROMOTER ANDCC       GENE.CC   --------------------------------------------------------------------------CC   This SWISS-PROT entry is copyright. It is produced through a collaborationCC   between  the Swiss Institute of Bioinformatics  and the  EMBL outstation -CC   the European Bioinformatics Institute.  There are no  restrictions on  itsCC   use  by  non-profit  institutions as long  as its content  is  in  no  wayCC   modified and this statement is not removed.  Usage  by  and for commercialCC   entities requires a license agreement (See http://www.isb-sib.ch/announce/CC   or send an email to license@isb-sib.ch).CC   --------------------------------------------------------------------------DR   EMBL; X13776; CAA32023.1; -.DR   PIR; S03884; S03884.KW   Transcription regulation; Activator.SQ   SEQUENCE   196 AA;  21776 MW;  560A8AE3 CRC32;     MSANSLLGSL RELQVLVLNP PGEVSDALVL QLIRIGCSVR QCWPPPEAFD VPVDVVFTSI     FQNGHHDEIA ALLAAGTPRT TLVALVEYES PAVLSQIIEL ECHGVITQPL DAHRVLPVLV     SARRISEEMA KLKQKTEQLQ DRIAGQARIN QAKVLLMQRH GWDEREAHQH LSREAMKRRE     PILKIAQELL GNEPSA//Output file format   The edited sequence is output.   The sequence will be in uppercase.  Output files for usage example  File: hsfau.rna>HSFAU X65923.1 H.sapiens fau mRNAUUCCUCUUUCUCGACUCCAUCUUCGCGGUAGCUGGGACCGCCGUUCAGUCGCCAAUAUGCAGCUCUUUGUCCGCGCCCAGGAGCUACACACCUUCGAGGUGACCGGCCAGGAAACGGUCGCCCAGAUCAAGGCUCAUGUAGCCUCACUGGAGGGCAUUGCCCCGGAAGAUCAAGUCGUGCUCCUGGCAGGCGCGCCCCUGGAGGAUGAGGCCACUCUGGGCCAGUGCGGGGUGGAGGCCCUGACUACCCUGGAAGUAGCAGGCCGCAUGCUUGGAGGUAAAGUUCAUGGUUCCCUGGCCCGUGCUGGAAAAGUGAGAGGUCAGACUCCUAAGGUGGCCAAACAGGAGAAGAAGAAGAAGAAGACAGGUCGGGCUAAGCGGCGGAUGCAGUACAACCGGCGCUUUGUCAACGUUGUGCCCACCUUUGGCAAGAAGAAGGGCCCCAAUGCCAACUCUUAAGUCUUUUGUAAUUCUGGCUUUCUCUAAUAAAAAAGCCACUUAGUUCAGUCAAAAAAAAAA  Output files for usage example 2  File: AMIR_PSEAE.pep>AMIR_PSEAE P10932 ALIPHATIC AMIDASE REGULATOR.MSANSLLGSLRELQVLVLNPPGEVSDALVLQLIRIGCSVRQCWXXPPPXXEAFDVPVDVVFTSIFQNGHHDEIAALLAAGTPRTTLVALVEYESPAVLSQIIELECHGVITQPLDAHRVLPVLVSARRISEEMAKLKQKTEQLQDRIAGQARINQAKVLLMQRHGWDEREAHQHLSREAMKRREPILKIAQELLGNEPSAData files   None.Notes   The edited sequence will be output in uppercase.References   None.Warnings   No check is made on the replacement subsequence.   Any text can be used as the replacement, including characters only   used in proteins (e.g. D, E, F, etc.), characters rarely used in   proteins (e.g. U, J, O, etc), digits and punctuation characters.Diagnostic Error Messages   None.Exit status
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -