⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 index.html

📁 用JAVA编写的,在做实验的时候留下来的,本来想删的,但是传上来,大家分享吧
💻 HTML
字号:
<html><head><META http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"><title>Heritrix User Manual</title><link href="../docbook.css" rel="stylesheet" type="text/css"><meta content="DocBook XSL Stylesheets V1.67.2" name="generator"><link rel="start" href="index.html" title="Heritrix User Manual"><link rel="next" href="intro.html" title="1.&nbsp;Introduction"></head><body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="navheader"><table summary="Navigation header" width="100%"><tr><th align="center" colspan="3">Heritrix User Manual</th></tr><tr><td align="left" width="20%">&nbsp;</td><th align="center" width="60%">&nbsp;</th><td align="right" width="20%">&nbsp;<a accesskey="n" href="intro.html">Next</a></td></tr></table><hr></div><div class="article" lang="en" id="N10001"><div class="titlepage"><div><div><h2 class="title"><a name="N10001"></a>Heritrix User Manual</h2></div><div><div class="authorgroup"><h3 class="corpauthor">Internet Archive</h3><div class="author"><h3 class="author"><span class="firstname">Kristinn</span> <span class="surname">Sigur&#273;sson</span></h3></div><div class="author"><h3 class="author"><span class="firstname">Michael</span> <span class="surname">Stack</span></h3></div><div class="author"><h3 class="author"><span class="firstname">Igor</span> <span class="surname">Ranitovic</span></h3></div></div></div></div><hr></div><div class="toc"><p><b>Table of Contents</b></p><dl><dt><span class="sect1"><a href="intro.html">1. Introduction</a></span></dt><dt><span class="sect1"><a href="install.html">2. Installing and running Heritrix</a></span></dt><dd><dl><dt><span class="sect2"><a href="install.html#N1003F">2.1. Obtaining and installing Heritrix</a></span></dt><dt><span class="sect2"><a href="install.html#N1008F">2.2. Running Heritrix</a></span></dt><dt><span class="sect2"><a href="install.html#security">2.3. Security Considerations</a></span></dt></dl></dd><dt><span class="sect1"><a href="wui.html">3. Web based user interface</a></span></dt><dt><span class="sect1"><a href="tutorial.html">4. A quick guide to running your first crawl job</a></span></dt><dt><span class="sect1"><a href="creating.html">5. Creating jobs and profiles</a></span></dt><dd><dl><dt><span class="sect2"><a href="creating.html#N10250">5.1. Crawl job</a></span></dt><dt><span class="sect2"><a href="creating.html#N102B4">5.2. Profile</a></span></dt></dl></dd><dt><span class="sect1"><a href="config.html">6. Configuring jobs and profiles</a></span></dt><dd><dl><dt><span class="sect2"><a href="config.html#modules">6.1. Modules (Scope, Frontier, and Processors)</a></span></dt><dt><span class="sect2"><a href="config.html#submodules">6.2. Submodules</a></span></dt><dt><span class="sect2"><a href="config.html#settings">6.3. Settings</a></span></dt><dt><span class="sect2"><a href="config.html#overrides">6.4. Overrides</a></span></dt><dt><span class="sect2"><a href="config.html#refinements">6.5. Refinements</a></span></dt></dl></dd><dt><span class="sect1"><a href="running.html">7. Running a job</a></span></dt><dd><dl><dt><span class="sect2"><a href="running.html#console">7.1. Web Console</a></span></dt><dt><span class="sect2"><a href="running.html#pendingjobs">7.2. Pending jobs</a></span></dt><dt><span class="sect2"><a href="running.html#N10AA9">7.3. Monitoring a running job</a></span></dt><dt><span class="sect2"><a href="running.html#editrun">7.4. Editing a running job</a></span></dt></dl></dd><dt><span class="sect1"><a href="analysis.html">8. Analysis of jobs</a></span></dt><dd><dl><dt><span class="sect2"><a href="analysis.html#completedjobs">8.1. Completed jobs</a></span></dt><dt><span class="sect2"><a href="analysis.html#logs">8.2. Logs</a></span></dt><dt><span class="sect2"><a href="analysis.html#reports">8.3. Reports</a></span></dt></dl></dd><dt><span class="sect1"><a href="outside.html">9. Outside the user interface</a></span></dt><dd><dl><dt><span class="sect2"><a href="outside.html#N10CF3">9.1. Generated files</a></span></dt><dt><span class="sect2"><a href="outside.html#N10DBF">9.2. Helpful scripts</a></span></dt><dt><span class="sect2"><a href="outside.html#recover">9.3. Recovery of Frontier State and recover.gz</a></span></dt><dt><span class="sect2"><a href="outside.html#checkpoint">9.4. Checkpointing</a></span></dt><dt><span class="sect2"><a href="outside.html#mon_com">9.5. Remote Monitoring and Control</a></span></dt><dt><span class="sect2"><a href="outside.html#ftp_support">9.6. Experimental FTP Support</a></span></dt></dl></dd><dt><span class="appendix"><a href="usecases.html">A. Common Heritrix Use Cases</a></span></dt><dt><span class="glossary"><a href="glossary.html">Glossary</a></span></dt></dl></div></div><div class="navfooter"><hr><table summary="Navigation footer" width="100%"><tr><td align="left" width="40%">&nbsp;</td><td align="center" width="20%">&nbsp;</td><td align="right" width="40%">&nbsp;<a accesskey="n" href="intro.html">Next</a></td></tr><tr><td valign="top" align="left" width="40%">&nbsp;</td><td align="center" width="20%">&nbsp;</td><td valign="top" align="right" width="40%">&nbsp;1.&nbsp;Introduction</td></tr></table></div></body></html>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -