requirements.html

来自「网络爬虫开源代码」· HTML 代码 · 共 21 行

HTML
21
字号
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html><head><title>Heritrix - System Runtime Requirements</title><style type="text/css" media="all">          @import url("./style/maven-base.css");          			    @import url("./style/maven-theme.css");@import url("./style/project.css");</style><link rel="stylesheet" href="./style/print.css" type="text/css" media="print"></link><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"></meta><meta name="author" content="St.Ack"></meta><meta name="email" content="stack at archive dot org"></meta></head><body class="composite"><div id="banner"><a href="http://www.archive.org/" id="organizationLogo"><img alt="Internet Archive" src="http://www.archive.org/images/logo.jpg"></img></a><a href="http://crawler.archive.org" id="projectLogo"><img alt="Heritrix" src="./images/logo.gif"></img></a><div class="clear"><hr></hr></div></div><div id="breadcrumbs"><div class="xleft">                	Last published: 06 May 2007                  | Doc for 1.12.1</div><div class="xright"></div><div class="clear"><hr></hr></div></div><div id="leftColumn"><div id="navcolumn"><div id="menuOverview"><h5>Overview</h5><ul><li class="none"><a href="license.html">License</a></li><li class="none"><strong><a href="requirements.html">System Requirements</a></strong></li><li class="none"><a href="downloads.html">Downloads</a></li><li class="none"><a href="articles/user_manual/index.html">User Manual</a></li><li class="none"><a href="articles/developer_manual/index.html">Developer Manual</a></li><li class="none"><a href="apidocs/index.html">Javadocs</a></li><li class="none"><a href="faq.html">FAQ</a></li><li class="none"><a href="http://webteam.archive.org/confluence/display/Heritrix/Home" class="externalLink" title="External Link">Wiki</a></li><li class="none"><a href="http://sourceforge.net/tracker/?group_id=73833&amp;atid=539099" class="externalLink" title="External Link">Browse/Submit a Bug</a></li><li class="expanded"><a href="">Related Projects</a><ul><li class="none"><a href="http://archive-access.sourceforge.net/" class="externalLink" title="External Link">Archive Access</a></li><li class="none"><a href="http://crawler.sourceforge.net/hcc" class="externalLink" title="External Link">Heritrix Cluster Controller (hcc)</a></li><li class="none"><a href="http://crawler.sourceforge.net/cmdline-jmxclient" class="externalLink" title="External Link">cmdline-jmxclient</a></li><li class="none"><a href="http://deduplicator.sourceforge.net" class="externalLink" title="External Link">Deduplicator</a></li><li class="none"><a href="http://www.zvents.com/labs/heritrix_hadoop" class="externalLink" title="External Link">Hadoop DFS Writer Processor</a></li></ul></li></ul></div><div id="menuProject_Documentation"><h5>Project Documentation</h5><ul><li class="none"><a href="index.html">About Heritrix</a></li><li class="collapsed"><a href="project-info.html">Project Info</a></li><li class="collapsed"><a href="maven-reports.html">Project Reports</a></li><li class="none"><a href="http://maven.apache.org/development-process.html" class="externalLink" title="External Link">Development Process</a></li></ul></div><a href="http://maven.apache.org/" title="Built by Maven" id="poweredBy"><img alt="Built by Maven" src="./images/logos/maven-button-1.png"></img></a></div></div><div id="bodyColumn"><div class="contentBox"><div class="section"><a name="System_Runtime_Requirements"></a><h2>System Runtime Requirements</h2><div class="subsection"><a name="Java_Runtime_Environment"></a><h3>Java Runtime Environment</h3><p>The Heritrix crawler is implemented purely in java.  This means        that the only true requirement for running it is that you have a        JRE installed.</p><p>The Heritrix crawler makes use of Java 5.0 features so your JRE must 		be at least of a 5.0 (1.5.0+) pedigree.</p><p>We currently include all of the free/open source third-party        libraries necessary to run Heritrix in the distribution package.        See        <a href="http://crawler.archive.org/dependencies.html" class="externalLink" title="External Link">dependencies</a>        for the complete list (Licenses for all of the listed libraries        are listed in the dependencies section of the raw project.xml         at the root of the src download or here on        <a href="https://archive-crawler.svn.sourceforge.net/svnroot/archive-crawler/trunk/heritrix/project.xml">sourceforge</a>).</p></div><div class="subsection"><a name="Hardware"></a><h3>Hardware</h3><p>Default heap size is 256MB RAM.  This should be suitable for crawls        that range over hundreds of hosts.</p></div><div class="subsection"><a name="Linux"></a><h3>Linux</h3><p>The Heritrix crawler has been built and tested primarily on Linux.         It has seen some informal use on Macintosh, Windows 2000 and        Windows XP, but is not tested, packaged, nor supported on platforms        other than Linux at this time.</p></div></div></div></div><div class="clear"><hr></hr></div><div id="footer"><div class="xleft"><a href="http://sourceforge.net/projects/archive-crawler/" class="externalLink" title="External Link">            <img src="http://sourceforge.net/sflogo.php?group_id=archive-crawler&amp;type=1" border="0" alt="sf logo"></img></a></div><div class="xright">漏 2003-2007, Internet Archive</div><div class="clear"><hr></hr></div></div></body></html>

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?