📄 1_10_0.html
字号:
<html><head><META http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>2. Release 1.10.0 - 09/11/2006</title><link href="../docbook.css" rel="stylesheet" type="text/css"><meta content="DocBook XSL Stylesheets V1.67.2" name="generator"><link rel="start" href="index.html" title="Heritrix Release Notes"><link rel="up" href="index.html" title="Heritrix Release Notes"><link rel="prev" href="1_10_1.html" title="1. Release 1.10.1 - 09/27/2006"><link rel="next" href="1_8_0.html" title="3. Release 1.8.0 - 05/05/2006"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table summary="Navigation header" width="100%"><tr><th align="center" colspan="3">2. Release 1.10.0 - 09/11/2006</th></tr><tr><td align="left" width="20%"><a accesskey="p" href="1_10_1.html">Prev</a> </td><th align="center" width="60%"> </th><td align="right" width="20%"> <a accesskey="n" href="1_8_0.html">Next</a></td></tr></table><hr></div><div class="sect1" lang="en"><div class="titlepage"><div><div><h2 class="title" style="clear: both"><a name="1_10_0"></a>2. Release 1.10.0 - 09/11/2006</h2></div></div></div><div class="abstract"><p class="title"><b>Abstract</b></p><p>Release 1.10.0 adds new configuration options, experimental new protocol and format support, and lots of fixes. 43 tracked bugs have been fixed and 35 feature requests added.</p><p>Release 1.10.0 requires JDK 1.5.x ("Java 5") Java facilities.</p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="1_10_0_contributors"></a>2.1. Contributors</h3></div></div></div><p>Aside from the <a href="http://crawler.archive.org/team-list.html" target="_top">usual suspects</a>, the following contributed to this release: <div class="itemizedlist"><ul type="disc"><li><p>Eric C Jensen</p></li><li><p>Olaf Freyer</p></li><li><p>Karl Wright (of MetaCarta)</p></li><li><p>Frank McCown (of Old Dominion University)</p></li><li><p>Max Schöfmann</p></li><li><p>Søren Vejrup Carlsen (of Royal Library, Denmark)</p></li></ul></div> </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="1_10_0_limitations"></a>2.2. Known Limitations/Issues</h3></div></div></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="1_10_0_limitations_bdb_nfs"></a>2.2.1. java.io.IOException: No locks available</h4></div></div></div><p>See <a href="1_8_0.html#bdb_nfs" title="3.1.1. java.io.IOException: No locks available">Section 3.1.1, “java.io.IOException: No locks available”</a> in 1.8.0 Release Notes.</p></div></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="old_checkpoints_and_old_order_files"></a>2.3. Pre-1.10.0 checkpoints</h3></div></div></div><p>For sure 1.8.0 checkpoints will not be recoverable with 1.10.0. </p></div><div class="sect2" lang="en"><div class="titlepage"><div><div><h3 class="title"><a name="1_10_0_changes"></a>2.4. Changes</h3></div></div></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="admindefaults"></a>2.4.1. No default login/password for web UI and JMX</h4></div></div></div><p>The old default login of 'admin' and password of 'letmein' for access to the crawler web UI (and JMX agent control) have been eliminated. It is now necessary to specify an access username and password to start Heritrix. This may be done with the -a or --admin command-line argument or via the system property 'heritrix.cmdline.admin'. (These each take a colon-separated username and password, like 'username:password'.)</p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="localhostonly"></a>2.4.2. Web UI binds to localhost only by default</h4></div></div></div><p>Previously, the Jetty web server that runs the Heritrix web UI listened on all available network interfaces. In 1.10.0, Jetty will only bind to localhost by default. The -b or --bind command-line argument can be used to specify a different interface or list of interfaces to bind to instead. You may specify "-b /" to get the old behavior -- binding on all interfaces -- but only take this step after reading section 2.3 of the User Manual, "Security Considerations". </p></div><div class="sect3" lang="en"><div class="titlepage"><div><div><h4 class="title"><a name="quotaretire"></a>2.4.3. QuotaEnforcer 'force-retire' option</h4></div></div></div><p>The optional QuotaEnforcer processor has a new setting, 'force-retire', which is by default 'true', and changes the default behavior of QuotaEnforcer. Previously, when a URI was noted as being over-quota, it would be marked with a special over-quota failure code which caused it to complete processing as an error. As a result, all over-quota URIs would quickly be finished as errors and appear in the crawl.log, but there would be no opportunity to raise the quota and continue crawling.</p><p>The new default behavior instead marks the URI with a directive requesting its frontier queue be retired. If the frontier supports this directive, the URI will be returned to its queue as if never tried, and the whole queue retired from active crawling. This offers the opportunity to raise the quota
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -