crawluri.html

来自「网络爬虫开源代码」· HTML 代码 · 共 988 行 · 第 1/5 页

HTML
988
字号
<TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>&nbsp;boolean</CODE></FONT></TD><TD><CODE><B>HtmlFormCredential.</B><B><A HREF="../../../../../org/archive/crawler/datamodel/credential/HtmlFormCredential.html#populate(org.archive.crawler.datamodel.CrawlURI, org.apache.commons.httpclient.HttpClient, org.apache.commons.httpclient.HttpMethod, java.lang.String)">populate</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi,         org.apache.commons.httpclient.HttpClient&nbsp;http,         org.apache.commons.httpclient.HttpMethod&nbsp;method,         java.lang.String&nbsp;payload)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>abstract &nbsp;boolean</CODE></FONT></TD><TD><CODE><B>Credential.</B><B><A HREF="../../../../../org/archive/crawler/datamodel/credential/Credential.html#populate(org.archive.crawler.datamodel.CrawlURI, org.apache.commons.httpclient.HttpClient, org.apache.commons.httpclient.HttpMethod, java.lang.String)">populate</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi,         org.apache.commons.httpclient.HttpClient&nbsp;http,         org.apache.commons.httpclient.HttpMethod&nbsp;method,         java.lang.String&nbsp;payload)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>&nbsp;boolean</CODE></FONT></TD><TD><CODE><B>Credential.</B><B><A HREF="../../../../../org/archive/crawler/datamodel/credential/Credential.html#rootUriMatch(org.archive.crawler.framework.CrawlController, org.archive.crawler.datamodel.CrawlURI)">rootUriMatch</A></B>(<A HREF="../../../../../org/archive/crawler/framework/CrawlController.html" title="class in org.archive.crawler.framework">CrawlController</A>&nbsp;controller,             <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Test passed curi matches this credentials rootUri.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.deciderules"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Uses of <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A> in <A HREF="../../../../../org/archive/crawler/deciderules/package-summary.html">org.archive.crawler.deciderules</A></FONT></TH></TR></TABLE>&nbsp;<P><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2">Methods in <A HREF="../../../../../org/archive/crawler/deciderules/package-summary.html">org.archive.crawler.deciderules</A> with parameters of type <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected &nbsp;boolean</CODE></FONT></TD><TD><CODE><B>FilterDecideRule.</B><B><A HREF="../../../../../org/archive/crawler/deciderules/FilterDecideRule.html#filtersAccept(org.archive.crawler.datamodel.CrawlURI)">filtersAccept</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Do all specified filters (if any) accept this CrawlURI?</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected &nbsp;boolean</CODE></FONT></TD><TD><CODE><B>FilterDecideRule.</B><B><A HREF="../../../../../org/archive/crawler/deciderules/FilterDecideRule.html#filtersAccept(org.archive.crawler.settings.MapType, org.archive.crawler.datamodel.CrawlURI)">filtersAccept</A></B>(<A HREF="../../../../../org/archive/crawler/settings/MapType.html" title="class in org.archive.crawler.settings">MapType</A>&nbsp;fs,              <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Do all specified filters (if any) accept this CrawlURI?</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.deciderules.recrawl"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Uses of <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A> in <A HREF="../../../../../org/archive/crawler/deciderules/recrawl/package-summary.html">org.archive.crawler.deciderules.recrawl</A></FONT></TH></TR></TABLE>&nbsp;<P><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2">Methods in <A HREF="../../../../../org/archive/crawler/deciderules/recrawl/package-summary.html">org.archive.crawler.deciderules.recrawl</A> with parameters of type <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static&nbsp;boolean</CODE></FONT></TD><TD><CODE><B>IdenticalDigestDecideRule.</B><B><A HREF="../../../../../org/archive/crawler/deciderules/recrawl/IdenticalDigestDecideRule.html#hasIdenticalDigest(org.archive.crawler.datamodel.CrawlURI)">hasIdenticalDigest</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Utility method for testing if a CrawlURI's last two history  entiries (one being the most recent fetch) have identical  content-digest information.</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.event"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Uses of <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A> in <A HREF="../../../../../org/archive/crawler/event/package-summary.html">org.archive.crawler.event</A></FONT></TH></TR></TABLE>&nbsp;<P><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2">Methods in <A HREF="../../../../../org/archive/crawler/event/package-summary.html">org.archive.crawler.event</A> with parameters of type <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>&nbsp;void</CODE></FONT></TD><TD><CODE><B>CrawlURIDispositionListener.</B><B><A HREF="../../../../../org/archive/crawler/event/CrawlURIDispositionListener.html#crawledURIDisregard(org.archive.crawler.datamodel.CrawlURI)">crawledURIDisregard</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Notification of a crawled URI that is to be disregarded.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>&nbsp;void</CODE></FONT></TD><TD><CODE><B>CrawlURIDispositionListener.</B><B><A HREF="../../../../../org/archive/crawler/event/CrawlURIDispositionListener.html#crawledURIFailure(org.archive.crawler.datamodel.CrawlURI)">crawledURIFailure</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Notification of a failed crawling of a URI.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>&nbsp;void</CODE></FONT></TD><TD><CODE><B>CrawlURIDispositionListener.</B><B><A HREF="../../../../../org/archive/crawler/event/CrawlURIDispositionListener.html#crawledURINeedRetry(org.archive.crawler.datamodel.CrawlURI)">crawledURINeedRetry</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Notification of a failed crawl of a URI that will be retried (failure due to possible transient problems).</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>&nbsp;void</CODE></FONT></TD><TD><CODE><B>CrawlURIDispositionListener.</B><B><A HREF="../../../../../org/archive/crawler/event/CrawlURIDispositionListener.html#crawledURISuccessful(org.archive.crawler.datamodel.CrawlURI)">crawledURISuccessful</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Notification of a successfully crawled URI</TD></TR></TABLE>&nbsp;<P><A NAME="org.archive.crawler.extractor"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2">Uses of <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A> in <A HREF="../../../../../org/archive/crawler/extractor/package-summary.html">org.archive.crawler.extractor</A></FONT></TH></TR></TABLE>&nbsp;<P><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2">Fields in <A HREF="../../../../../org/archive/crawler/extractor/package-summary.html">org.archive.crawler.extractor</A> declared as <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>(package private) &nbsp;<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></CODE></FONT></TD><TD><CODE><B>CrawlUriSWFAction.</B><B><A HREF="../../../../../org/archive/crawler/extractor/CrawlUriSWFAction.html#curi">curi</A></B></CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</TD></TR></TABLE>&nbsp;<P><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2">Methods in <A HREF="../../../../../org/archive/crawler/extractor/package-summary.html">org.archive.crawler.extractor</A> that return <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected &nbsp;<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></CODE></FONT></TD><TD><CODE><B>ExtractorTool.</B><B><A HREF="../../../../../org/archive/crawler/extractor/ExtractorTool.html#getCrawlURI(org.archive.io.arc.ARCRecord, org.archive.util.HttpRecorder)">getCrawlURI</A></B>(<A HREF="../../../../../org/archive/io/arc/ARCRecord.html" title="class in org.archive.io.arc">ARCRecord</A>&nbsp;record,            <A HREF="../../../../../org/archive/util/HttpRecorder.html" title="class in org.archive.util">HttpRecorder</A>&nbsp;hr)</CODE><BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</TD></TR></TABLE>&nbsp;<P><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2">Methods in <A HREF="../../../../../org/archive/crawler/extractor/package-summary.html">org.archive.crawler.extractor</A> with parameters of type <A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected &nbsp;void</CODE></FONT></TD><TD><CODE><B>ExtractorHTTP.</B><B><A HREF="../../../../../org/archive/crawler/extractor/ExtractorHTTP.html#addHeaderLink(org.archive.crawler.datamodel.CrawlURI, org.apache.commons.httpclient.Header)">addHeaderLink</A></B>(<A HREF="../../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi,              org.apache.commons.httpclient.Header&nbsp;loc)</CODE>

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?