📄 squid-format.tex

📁 hmmer源程序

💻 TEX

字号:

\newpage\section {SQUID format}SQUID format is a sequence database format similar to the PIR,GenBank, and EMBL formats. The primary difference is that SQUID formatmay optionally contain secondary structure annotation information forthe sequence. No other sequence format allows secondary structureannotation, which is why SQUID format became necessary.An example SQUID format file:\begin{verbatim}NAM  DY9990SRC  HSTGYA M27547 76..169::196DES  Human Tyr-tRNA gene, clone pM6.SEQ  +SS       1 ccttcgatagctcagctggtagagcggaggactgtagactgcggaaacgt         >>>>>>>..>>>>........<<<<.>>>>>...................      51 ttgtggacatccttaggtcgctggttcaattccggctcgaagga         .........<<<<<.....>>>>>.......<<<<<<<<<<<<.++NAM  DY9991SRC  HSTRNAYE M55611 1..93::93DES  Human Tyr-tRNA precursor.SEQ  +SS       1 ccttcgatagctcagctggtagagcggaggactgtagcctgtagaaacat         >>>>>>>..>>>>........<<<<.>>>>>...................      51 ttgtggacatccttaggtcgctggttcgattccggctcgaagg         .........<<<<<.....>>>>>.......<<<<<<<<<<<<++NAM  DA0260SEQ       1 GGGCGAAUAGUGUCAGCGGGAGCACACCAGACUUGCAAUCUGGUAGGGAG      51 GGUUCGAGUCCCUCUUUGUCCACCA++\end{verbatim}\subsection {Specification of a SQUID file}\begin{enumerate}\item There must be a line of the form \verb+NAM  <sequence name>+.\item There may be an optional line \verb+SRC <id> <acc><start>..<stop>::<olen>+, which specified a database source for thissequence, giving the database identifier (name), accession number,start and end position in the database sequence, and the originallength of the database sequence, respectively.  If a \verb+SRC+ lineis present, all of these values must be specified.  If any values areunknown, they may be set to \verb+-+ in the case of \verb+<id>+ and\verb+<acc>+ and \verb+0+ in the case of \verb+<start>+, \verb+<stop>,and \verb+<olen>+, and in these cases the values will be ignored.\item There may be an optional line  \verb+DES <description>+ givinga one-line description of the sequence.\item There must be a line of the form \verb-SEQ +SS- or \verb-SEQ-.If the line contains \verb-+SS-, it means that the record containssecondary structure annotation interleaved with the sequence.\item The sequence (and optional structure) immediately follow. There may beoptional numbering either before or after the sequence. The number ofcharacters per line is unimportant. Spaces and tabs are ignored.There must be no non-numeric non-space characters on any lines exceptsequence or structure annotation characters. Structure annotation isfairly free-form; any alphabetic character or character in the set\verb/_.-*?<>{}[]()!@#$%^&=+;:'|`~"\/ is accepted. There mustbe one such character for every sequence character (preferably alignedto the sequence, but in fact this is not checked for). Note thatspaces in the secondary structure annotation are not permitted,except where they are aligned to gaps in the sequence.\item Sequence records are separated by a line of the form \verb-++-.\end{enumerate}

💿 文件大小 1007 K

👤 上传用户 haowoainime

📂 所属分类 *行业应用

🏷️ 相关标签

#hmmer #源程序

⌨️ 快捷键说明

复制代码 Ctrl + C

搜索代码 Ctrl + F

全屏模式 F11

切换主题 Ctrl + Shift + D

显示快捷键 ?

增大字号 Ctrl + =

减小字号 Ctrl + -