⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 第四章.htm

📁 这是一些经典算法的描述
💻 HTM
📖 第 1 页 / 共 5 页
字号:
"Times New Roman"'>)</span><span lang=EN-US style='font-size:10.0pt;font-family:
"Times New Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>数据质量</span><span lang=EN-US style='font-size:10.0pt;
font-family:"Times New Roman"'><span style="mso-spacerun:
yes">&nbsp;&nbsp;&nbsp; </span></span><span style='font-size:10.0pt;mso-ascii-font-family:
"Times New Roman";mso-hansi-font-family:"Times New Roman"'>必须保证数据库中数据的质量,数据库管理机构应对数据来源进行检查,并且关注数据库用户和专家提出的意见。</span><span
style='font-size:10.0pt;font-family:"Times New Roman"'> </span></p>

<p class=MsoPlainText style='margin-left:36.0pt;text-indent:-36.0pt;line-height:
150%;tab-stops:list 36.0pt'><span style='font-size:10.0pt;mso-ascii-font-family:
"Times New Roman";mso-hansi-font-family:"Times New Roman"'>(</span><span
lang=EN-US style='font-size:10.0pt;font-family:"Times New Roman"'>5</span><span
style='font-size:10.0pt;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>)</span><span lang=EN-US style='font-size:10.0pt;font-family:
"Times New Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>集成性</span><span lang=EN-US style='font-size:10.0pt;
font-family:"Times New Roman"'><span style="mso-spacerun:
yes">&nbsp;&nbsp;&nbsp; </span></span><span style='font-size:10.0pt;mso-ascii-font-family:
"Times New Roman";mso-hansi-font-family:"Times New Roman"'>三种基本生物分子数据库(核酸序列、蛋白质序列、蛋白质结构)的集成对于用户来说是非常重要的。对于数据库中的每一个数据对象,必须与其它数据库中的相关数据联系起来,这样可以从某些分子数据出发得到一系列的相关信息。例如,从某个核酸序列出发,通过交叉索引,可进一步得到对应的基因、蛋白质序列、蛋白质结构,甚至得到蛋白质功能的信息。</span><span
style='font-size:10.0pt;font-family:"Times New Roman"'> </span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
text-indent:21.25pt;line-height:150%'><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>分子生物学研究领域虽各有重点,但是研究对象之间存在着密切的联系,比如</span><span
lang=EN-US style='font-size:10.0pt'>DNA</span><span style='font-size:10.0pt;
font-family:宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>序列与蛋白质序列之间的联系,基因调控信息与基因表达数据之间的联系。因而,实验数据之间就必然存在着关联,一个方面的相关数据可能会影响或促进另一个方面的研究工作。现有的各类数据库已经成为分子生物学各方面交叉研究的桥梁。</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
text-indent:21.25pt;line-height:150%'><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>生物分子数据库目前的发展状况有几个明显的特征:</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
margin-left:36.0pt;text-indent:-36.0pt;line-height:150%;tab-stops:list 36.0pt'><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>(</span><span lang=EN-US
style='font-size:10.0pt'>1</span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>)</span><span
lang=EN-US style='font-size:10.0pt'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>生物分子数据库最突出的特征就是数据库的更新速度不断加快,数据量呈指数增长趋势。例如,核酸序列数据的年增长幅度约为</span><span
lang=EN-US style='font-size:10.0pt'>100%</span><span style='font-size:10.0pt;
font-family:宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>。</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
margin-left:36.0pt;text-indent:-36.0pt;line-height:150%;tab-stops:list 36.0pt'><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>(</span><span lang=EN-US
style='font-size:10.0pt'>2</span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>)</span><span
lang=EN-US style='font-size:10.0pt'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>数据库使用频率增长更快。人们越来越感到生物分子数据的重要性,也认识到它们的价值,因此,各种数据库的使用人员在不断增加。</span><span
style='font-size:10.0pt'> </span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>据统计,数据库的平均使用频率每年增长幅度接近于</span><span
lang=EN-US style='font-size:10.0pt'>500%</span><span style='font-size:10.0pt;
font-family:宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>。</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
margin-left:36.0pt;text-indent:-36.0pt;line-height:150%;tab-stops:list 36.0pt'><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>(</span><span lang=EN-US
style='font-size:10.0pt'>3</span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>)</span><span
lang=EN-US style='font-size:10.0pt'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>数据库的复杂程度不断增加。数据库中除了基本数据之外,还包括大量的注释、链接、参考文献等信息,例如,在</span><span
lang=EN-US style='font-size:10.0pt'>SWISS-PROT</span><span style='font-size:
10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>数据库中,注释项涉及蛋白质的功能、结构域和活性位点、二级结构、四级结构、翻译后修饰、与其他蛋白质的相似性、与该蛋白质关联的疾病、序列变化等。</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
margin-left:36.0pt;text-indent:-36.0pt;line-height:150%;tab-stops:list 36.0pt'><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>(</span><span lang=EN-US
style='font-size:10.0pt'>4</span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>)</span><span
lang=EN-US style='font-size:10.0pt'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>数据库网络化。几乎所有的数据库都可以在国际互联网上访问,并且公共数据库之间相互链接,使用户可以迅速得到大量的相关生物分子信息。有的系统则将多个生物分子数据库整合在一起,形成集成的数据库系统。</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
margin-left:36.0pt;text-indent:-36.0pt;line-height:150%;tab-stops:list 36.0pt'><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>(</span><span lang=EN-US
style='font-size:10.0pt'>5</span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>)</span><span
lang=EN-US style='font-size:10.0pt'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>面向应用。首先,各个数据库服务器除了提供数据之外,还提供许多分析工具,如核酸数据库提供的序列搜索、基因识别程序等,生物大分子结构数据库提供的结构比较程序、结构模拟程序等。此外,还在原始数据库的基础上开发了许多面向特殊应用的二级数据库,如蛋白质分类数据库、蛋白质二级结构数据库等。</span></p>

<p class=MsoNormal style='mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;
margin-left:36.0pt;text-indent:-36.0pt;line-height:150%;tab-stops:list 36.0pt'><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>(</span><span lang=EN-US
style='font-size:10.0pt'>6</span><span style='font-size:10.0pt;font-family:
宋体;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>)</span><span
lang=EN-US style='font-size:10.0pt'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;font-family:宋体;mso-ascii-font-family:"Times New Roman";
mso-hansi-font-family:"Times New Roman"'>先进的软硬件配置。从计算机硬件方面来看,许多数据库服务器已从工作站升级到大型服务器,使数据库能够高效地管理数据和为用户服务,并在专门的硬件(如并行机)上运行服务程序。而在系统软件方面,使用大型数据库管理系统,面向对象的数据库管理方法正在逐步取代旧的模式,数据库服务广泛采用服务器客户式结构。</span></p>

<p class=MsoBodyTextIndent style='line-height:150%'><span lang=EN-US
style='font-size:10.0pt;mso-ascii-font-family:"Times New Roman"'>&nbsp;&nbsp;&nbsp;</span><span
lang=EN-US style='font-size:10.0pt;font-family:"Times New Roman";mso-hansi-font-family:
宋体'> </span><span style='font-size:10.0pt;mso-ascii-font-family:"Times New Roman"'>一般而言,生物分子数据库可以分为一级数据库和二级数据库。一级数据库中的数据直接来源于实验获得的原始数据,只经过简单的归类整理和注释;二级数据库是对原始生物分子数据进行整理、分类的结果,是在一级数据库、实验数据和理论分析的基础上针对特定的应用目标而建立的。与蛋白质相关的二级数据库比较多。</span><span
style='font-family:"Times New Roman"'> </span></p>

<p> </p>

<p align=right style='text-align:right'><b><span lang=EN-US style='font-size:
18.0pt;font-family:隶书'><!--[if gte vml 1]><v:shape id="_x0000_i1037" type="#_x0000_t75"
 alt="" style='width:36pt;height:43.5pt'>
 <v:imagedata src="./第四章.files/image005.jpg" o:href="http://www.lmbe.seu.edu.cn/chenyuan/xsun/bioinfomatics/web/images/mytemp1.jpg"/>
</v:shape><![endif]--><![if !vml]><img width=48 height=58
src="./第四章.files/image005.jpg" v:shapes="_x0000_i1037"><![endif]><a
href="http://www.lmbe.seu.edu.cn/chenyuan/xsun/bioinfomatics/web/Index.html">返回总目录</a></span></b></p>

<p align=right style='text-align:right'><span lang=EN-US><!--[if gte vml 1]><v:shape
 id="_x0000_i1038" type="#_x0000_t75" alt="" style='width:33.75pt;height:33pt'>
 <v:imagedata src="./第四章.files/image006.jpg" o:href="http://www.lmbe.seu.edu.cn/chenyuan/xsun/bioinfomatics/web/images/mytemp2.jpg"/>
</v:shape><![endif]--><![if !vml]><img border=0 width=45 height=44
src="./第四章.files/image006.jpg" v:shapes="_x0000_i1038"><![endif]></span><b><span
lang=EN-US style='font-size:18.0pt;font-family:隶书'><a
href="http://www.lmbe.seu.edu.cn/chenyuan/xsun/bioinfomatics/web/CharpterFour/#mark1">返回页首
</a></span></b><b><span lang=EN-US style='font-size:18.0pt;mso-ascii-font-family:
隶书;mso-fareast-font-family:隶书'>&nbsp;</span></b></p>

<h1 align=center style='text-align:center'><span style='font-family:隶书;
color:#EFCE8F'>第五章</span><span lang=EN-US style='font-size:36.0pt;mso-ascii-font-family:
隶书;mso-fareast-font-family:隶书;color:#EFCE8F'>&nbsp;</span><span
style='font-family:隶书;color:#EFCE8F'>基因组信息分析</span></h1>

<p align=center style='text-align:center'><span lang=EN-US style='font-size:
36.0pt;font-family:隶书;color:#EFCE8F'><!--[if gte vml 1]><v:shape id="_x0000_i1039"
 type="#_x0000_t75" alt="" style='width:495.75pt;height:18.75pt'>
 <v:imagedata src="./第四章.files/image001.jpg" o:href="http://www.lmbe.seu.edu.cn/chenyuan/xsun/bioinfomatics/web/images/mytemp14.jpg"/>
</v:shape><![endif]--><![if !vml]><img border=0 width=661 height=25
src="./第四章.files/image007.jpg" v:shapes="_x0000_i1039"><![endif]></span></p>

<p><span lang=EN-US><SELECT NAME="str_sel">
<OPTION SELECTED>========= 选择章节 ==========
<OPTION VALUE="5.1.htm">5.1 关于遗传语言
<OPTION VALUE="5.2.htm">5.2 原核基因组特点
<OPTION VALUE="5.3.htm">5.3 真核基因组特点
<OPTION VALUE="5.4.htm">5.4 基因组序列分析
<OPTION VALUE="5.5.htm">5.5 基因识别方法
<OPTION VALUE="5.6.htm">5.6 非编码区域分析和调控元件识别
<OPTION VALUE="5.question.htm">问题与练习
<OPTION VALUE="5.referance.htm">参考文献
</SELECT></span></p>

<p style='line-height:150%'><span lang=EN-US style='font-size:10.0pt;
font-family:"Times New Roman"'>&nbsp;&nbsp;&nbsp; </span><span
style='font-size:10.0pt;mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:
"Times New Roman"'>人类基因组计划的主要成果是得到一本“天书”,这本天书既简单又复杂。说它简单,是因为这本天书仅仅由</span><span
lang=EN-US style='font-size:10.0pt'>4</span><span style='font-size:10.0pt;
mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>个字母构成,这</span><span
lang=EN-US style='font-size:10.0pt'>4</span><span style='font-size:10.0pt;
mso-ascii-font-family:"Times New Roman";mso-hansi-font-family:"Times New Roman"'>个字母分别是</span><span
lang=EN-US style='font-size:10.0pt'>A</span><span style='font-size:10.0pt;

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -