⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 1_10_0.html

📁 用JAVA编写的,在做实验的时候留下来的,本来想删的,但是传上来,大家分享吧
💻 HTML
📖 第 1 页 / 共 2 页
字号:
<html><head><META http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>2.&nbsp;Release 1.10.0 - 09/11/2006</title><link href="../docbook.css" rel="stylesheet" type="text/css"><meta content="DocBook XSL Stylesheets V1.67.2" name="generator"><link rel="start" href="index.html" title="Heritrix Release Notes"><link rel="up" href="index.html" title="Heritrix Release Notes"><link rel="prev" href="1_10_1.html" title="1.&nbsp;Release 1.10.1 - 09/27/2006"><link rel="next" href="1_8_0.html" title="3.&nbsp;Release 1.8.0 - 05/05/2006"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table summary="Navigation header" width="100%"><tr><th align="center" colspan="3">2.&nbsp;Release 1.10.0 - 09/11/2006</th></tr><tr><td align="left" width="20%"><a accesskey="p" href="1_10_1.html">Prev</a>&nbsp;</td><th align="center" width="60%">&nbsp;</th><td align="right" width="20%">&nbsp;<a accesskey="n" href="1_8_0.html">Next</a></td></tr></table><hr></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="1_10_0"></a>2.&nbsp;Release 1.10.0 - 09/11/2006</h2></div></div></div><div class="abstract"><p class="title"><b>Abstract</b></p><p>Release 1.10.0 adds new configuration options, experimental       new protocol and format support, and lots of fixes. 43 tracked bugs       have been fixed and 35 feature requests added.</p><p>Release 1.10.0 requires JDK 1.5.x ("Java 5") Java       facilities.</p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="1_10_0_contributors"></a>2.1.&nbsp;Contributors</h3></div></div></div><p>Aside from the        <a href="http://crawler.archive.org/team-list.html" target="_top">usual        suspects</a>, the following contributed to this release:    <div class="itemizedlist"><ul type="disc"><li><p>Eric C Jensen</p></li><li><p>Olaf Freyer</p></li><li><p>Karl Wright (of MetaCarta)</p></li><li><p>Frank McCown (of Old Dominion University)</p></li><li><p>Max Sch&ouml;fmann</p></li><li><p>S&oslash;ren Vejrup Carlsen (of Royal Library, Denmark)</p></li></ul></div>        </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="1_10_0_limitations"></a>2.2.&nbsp;Known Limitations/Issues</h3></div></div></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="1_10_0_limitations_bdb_nfs"></a>2.2.1.&nbsp;java.io.IOException: No locks available</h4></div></div></div><p>See <a href="1_8_0.html#bdb_nfs" title="3.1.1.&nbsp;java.io.IOException: No locks available">Section&nbsp;3.1.1, &ldquo;java.io.IOException: No locks available&rdquo;</a> in 1.8.0 Release Notes.</p></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="old_checkpoints_and_old_order_files"></a>2.3.&nbsp;Pre-1.10.0 checkpoints</h3></div></div></div><p>For sure 1.8.0 checkpoints will not be recoverable with 1.10.0.        </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="1_10_0_changes"></a>2.4.&nbsp;Changes</h3></div></div></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="admindefaults"></a>2.4.1.&nbsp;No default login/password for web UI and JMX</h4></div></div></div><p>The old default login of 'admin' and password of 'letmein'          for access to the crawler web UI (and JMX agent control) have been          eliminated. It is now necessary to specify an access username and         password to start Heritrix. This may be done with the -a or          --admin command-line argument or via the system property          'heritrix.cmdline.admin'. (These each take a colon-separated          username and password, like 'username:password'.)</p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="localhostonly"></a>2.4.2.&nbsp;Web UI binds to localhost only by default</h4></div></div></div><p>Previously, the Jetty web server that runs the Heritrix         web UI listened on all available network interfaces.  In 1.10.0,         Jetty will only bind to localhost by default.  The -b or --bind         command-line argument can be used to specify a different interface         or list of interfaces to bind to instead. You may specify "-b /" to         get the old behavior -- binding on all interfaces -- but only take         this step after reading section 2.3 of the User Manual, "Security         Considerations".          </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="quotaretire"></a>2.4.3.&nbsp;QuotaEnforcer 'force-retire' option</h4></div></div></div><p>The optional QuotaEnforcer processor has a new setting,          'force-retire', which is by default 'true', and changes the          default behavior of QuotaEnforcer. Previously, when a URI was         noted as being over-quota, it would be marked with a special          over-quota failure code which caused it to complete processing         as an error. As a result, all over-quota URIs would quickly be         finished as errors and appear in the crawl.log, but there would         be no opportunity to raise the quota and continue crawling.</p><p>The new default behavior instead marks the URI with a          directive requesting its frontier queue be retired. If the          frontier supports this directive, the URI will be returned to          its queue as if never tried, and the whole queue retired from          active crawling. This offers the opportunity to raise the quota

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -