crawluri.html
来自「网络爬虫开源代码」· HTML 代码 · 共 1,029 行 · 第 1/5 页
HTML
1,029 行
Return the retained content-digest value, if any.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getContentDigestSchemeString()">getContentDigestSchemeString</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getContentDigestString()">getContentDigestString</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> long</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getContentLength()">getContentLength</A></B>()</CODE><BR> For completed HTTP transactions, the length of the content-body.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> long</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getContentSize()">getContentSize</A></B>()</CODE><BR> Get the size in bytes of this URI's recorded content, inclusive of things like protocol headers.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getContentType()">getContentType</A></B>()</CODE><BR> Get the content type of this URI.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getCrawlURIString()">getCrawlURIString</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.Set<<A HREF="../../../../org/archive/crawler/datamodel/credential/CredentialAvatar.html" title="class in org.archive.crawler.datamodel.credential">CredentialAvatar</A>></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getCredentialAvatars()">getCredentialAvatars</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getDeferrals()">getDeferrals</A></B>()</CODE><BR> Get the deferral count.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getEmbedHopCount()">getEmbedHopCount</A></B>()</CODE><BR> <B>Deprecated.</B> <I></I> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getFetchAttempts()">getFetchAttempts</A></B>()</CODE><BR> Get the number of attempts at getting the document referenced by this URI.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getFetchStatus()">getFetchStatus</A></B>()</CODE><BR> Return the overall/fetch status of this CrawlURI for its current trip through the processing loop.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.Object</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getHolder()">getHolder</A></B>()</CODE><BR> Return the 'holder' for the convenience of an external facility.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getHolderCost()">getHolderCost</A></B>()</CODE><BR> Return the 'holderCost' for convenience of external facility (frontier)</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.Object</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getHolderKey()">getHolderKey</A></B>()</CODE><BR> Return the 'holderKey' for convenience of an external facility (Frontier).</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/util/HttpRecorder.html" title="class in org.archive.util">HttpRecorder</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getHttpRecorder()">getHttpRecorder</A></B>()</CODE><BR> Get the http recorder associated with this uri.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getLinkHopCount()">getLinkHopCount</A></B>()</CODE><BR> <B>Deprecated.</B> <I></I> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> long</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getOrdinal()">getOrdinal</A></B>()</CODE><BR> Get the ordinal (serial number) assigned at creation.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.Collection<<A HREF="../../../../org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">CandidateURI</A>></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getOutCandidates()">getOutCandidates</A></B>()</CODE><BR> Returns discovered candidate URIs.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.Collection<<A HREF="../../../../org/archive/crawler/extractor/Link.html" title="class in org.archive.crawler.extractor">Link</A>></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getOutLinks()">getOutLinks</A></B>()</CODE><BR> Returns discovered links.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.Collection<java.lang.Object></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getOutObjects()">getOutObjects</A></B>()</CODE><BR> Returns all of the outbound objects.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> st.ata.util.AList</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getPersistentAList()">getPersistentAList</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.Object</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getPrerequisiteUri()">getPrerequisiteUri</A></B>()</CODE><BR> Get the prerequisite for this URI.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> long</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getRecordedSize()">getRecordedSize</A></B>()</CODE><BR> Get size of data recorded (transferred)</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getThreadNumber()">getThreadNumber</A></B>()</CODE><BR> Get the number of the ToeThread responsible for processing this uri.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html#getUserAgent()">getUserAgent</A></B>()</CODE><BR> Get the user agent to use for crawling this URI.</TD>
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?