📄 squid-format.tex
字号:
\newpage\section {SQUID format}SQUID format is a sequence database format similar to the PIR,GenBank, and EMBL formats. The primary difference is that SQUID formatmay optionally contain secondary structure annotation information forthe sequence. No other sequence format allows secondary structureannotation, which is why SQUID format became necessary.An example SQUID format file:\begin{verbatim}NAM DY9990SRC HSTGYA M27547 76..169::196DES Human Tyr-tRNA gene, clone pM6.SEQ +SS 1 ccttcgatagctcagctggtagagcggaggactgtagactgcggaaacgt >>>>>>>..>>>>........<<<<.>>>>>................... 51 ttgtggacatccttaggtcgctggttcaattccggctcgaagga .........<<<<<.....>>>>>.......<<<<<<<<<<<<.++NAM DY9991SRC HSTRNAYE M55611 1..93::93DES Human Tyr-tRNA precursor.SEQ +SS 1 ccttcgatagctcagctggtagagcggaggactgtagcctgtagaaacat >>>>>>>..>>>>........<<<<.>>>>>................... 51 ttgtggacatccttaggtcgctggttcgattccggctcgaagg .........<<<<<.....>>>>>.......<<<<<<<<<<<<++NAM DA0260SEQ 1 GGGCGAAUAGUGUCAGCGGGAGCACACCAGACUUGCAAUCUGGUAGGGAG 51 GGUUCGAGUCCCUCUUUGUCCACCA++\end{verbatim}\subsection {Specification of a SQUID file}\begin{enumerate}\item There must be a line of the form \verb+NAM <sequence name>+.\item There may be an optional line \verb+SRC <id> <acc><start>..<stop>::<olen>+, which specified a database source for thissequence, giving the database identifier (name), accession number,start and end position in the database sequence, and the originallength of the database sequence, respectively. If a \verb+SRC+ lineis present, all of these values must be specified. If any values areunknown, they may be set to \verb+-+ in the case of \verb+<id>+ and\verb+<acc>+ and \verb+0+ in the case of \verb+<start>+, \verb+<stop>,and \verb+<olen>+, and in these cases the values will be ignored.\item There may be an optional line \verb+DES <description>+ givinga one-line description of the sequence.\item There must be a line of the form \verb-SEQ +SS- or \verb-SEQ-.If the line contains \verb-+SS-, it means that the record containssecondary structure annotation interleaved with the sequence.\item The sequence (and optional structure) immediately follow. There may beoptional numbering either before or after the sequence. The number ofcharacters per line is unimportant. Spaces and tabs are ignored.There must be no non-numeric non-space characters on any lines exceptsequence or structure annotation characters. Structure annotation isfairly free-form; any alphabetic character or character in the set\verb/_.-*?<>{}[]()!@#$%^&=+;:'|`~"\/ is accepted. There mustbe one such character for every sequence character (preferably alignedto the sequence, but in fact this is not checked for). Note thatspaces in the secondary structure annotation are not permitted,except where they are aligned to gaps in the sequence.\item Sequence records are separated by a line of the form \verb-++-.\end{enumerate}
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -