package-use.html

来自「网络爬虫开源代码」· HTML 代码 · 共 945 行 · 第 1/4 页

HTML
945
字号
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Collector of statististics for a 'subset' of a crawl, such as a server (host:port), host, or frontier group  (eg queue).</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlSubstats.HasCrawlSubstats.html#org.archive.crawler.datamodel"><B>CrawlSubstats.HasCrawlSubstats</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.datamodel"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CredentialStore.html#org.archive.crawler.datamodel"><B>CredentialStore</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Front door to the credential store.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/FetchStatusCodes.html#org.archive.crawler.datamodel"><B>FetchStatusCodes</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Constant flag codes to be used, in lieu of per-protocol codes (like HTTP's 200, 404, etc.), when network/internal/ out-of-band conditions occur.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/RobotsExclusionPolicy.html#org.archive.crawler.datamodel"><B>RobotsExclusionPolicy</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;RobotsExclusionPolicy represents the actual policy adopted with  respect to a specific remote server, usually constructed from  consulting the robots.txt, if any, the server provided.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/RobotsHonoringPolicy.html#org.archive.crawler.datamodel"><B>RobotsHonoringPolicy</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;RobotsHonoringPolicy represent the strategy used by the crawler  for determining how robots.txt files will be honored.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/UriUniqFilter.HasUriReceiver.html#org.archive.crawler.datamodel"><B>UriUniqFilter.HasUriReceiver</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;URIs that have not been seen before 'visit' this 'Visitor'.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.datamodel.credential"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/datamodel/credential/package-summary.html">org.archive.crawler.datamodel.credential</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.datamodel.credential"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.deciderules"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/deciderules/package-summary.html">org.archive.crawler.deciderules</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CandidateURI.html#org.archive.crawler.deciderules"><B>CandidateURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;A URI, discovered or passed-in, that may be scheduled.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CoreAttributeConstants.html#org.archive.crawler.deciderules"><B>CoreAttributeConstants</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;CrawlURI attribute keys used by the core crawler classes.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.deciderules"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.deciderules.recrawl"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/deciderules/recrawl/package-summary.html">org.archive.crawler.deciderules.recrawl</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CoreAttributeConstants.html#org.archive.crawler.deciderules.recrawl"><B>CoreAttributeConstants</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;CrawlURI attribute keys used by the core crawler classes.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.deciderules.recrawl"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.event"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/event/package-summary.html">org.archive.crawler.event</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.event"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.extractor"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/extractor/package-summary.html">org.archive.crawler.extractor</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CoreAttributeConstants.html#org.archive.crawler.extractor"><B>CoreAttributeConstants</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;CrawlURI attribute keys used by the core crawler classes.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.extractor"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.fetcher"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/fetcher/package-summary.html">org.archive.crawler.fetcher</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CoreAttributeConstants.html#org.archive.crawler.fetcher"><B>CoreAttributeConstants</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;CrawlURI attribute keys used by the core crawler classes.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlHost.html#org.archive.crawler.fetcher"><B>CrawlHost</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a single remote "host".</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.fetcher"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/FetchStatusCodes.html#org.archive.crawler.fetcher"><B>FetchStatusCodes</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Constant flag codes to be used, in lieu of per-protocol codes (like HTTP's 200, 404, etc.), when network/internal/ out-of-band conditions occur.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/ServerCache.html#org.archive.crawler.fetcher"><B>ServerCache</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Server and Host cache.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.filter"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/filter/package-summary.html">org.archive.crawler.filter</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CoreAttributeConstants.html#org.archive.crawler.filter"><B>CoreAttributeConstants</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;CrawlURI attribute keys used by the core crawler classes.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CrawlURI.html#org.archive.crawler.filter"><B>CrawlURI</B></A></B><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Represents a candidate URI and the associated state it collects as it is crawled.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.framework"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Classes in <A HREF="../../../../org/archive/crawler/datamodel/package-summary.html">org.archive.crawler.datamodel</A> used by <A HREF="../../../../org/archive/crawler/framework/package-summary.html">org.archive.crawler.framework</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><B><A HREF="../../../../org/archive/crawler/datamodel/class-use/CandidateURI.html#org.archive.crawler.framework"><B>CandidateURI</B></A></B>

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?