📄 serialized-form.html
字号:
<H3>currentDocsPerSecond</H3><PRE>double <B>currentDocsPerSecond</B></PRE><DL><DL></DL></DL><HR><H3>currentKBPerSec</H3><PRE>int <B>currentKBPerSec</B></PRE><DL><DL></DL></DL><HR><H3>totalKBPerSec</H3><PRE>long <B>totalKBPerSec</B></PRE><DL><DL></DL></DL><HR><H3>busyThreads</H3><PRE>int <B>busyThreads</B></PRE><DL><DL></DL></DL><HR><H3>totalProcessedBytes</H3><PRE>long <B>totalProcessedBytes</B></PRE><DL><DL></DL></DL><HR><H3>congestionRatio</H3><PRE>float <B>congestionRatio</B></PRE><DL><DL></DL></DL><HR><H3>deepestUri</H3><PRE>long <B>deepestUri</B></PRE><DL><DL></DL></DL><HR><H3>averageDepth</H3><PRE>long <B>averageDepth</B></PRE><DL><DL></DL></DL><HR><H3>mimeTypeDistribution</H3><PRE>java.util.Hashtable<K,V> <B>mimeTypeDistribution</B></PRE><DL><DD>Keep track of the file types we see (mime type -> count)<P><DL></DL></DL><HR><H3>mimeTypeBytes</H3><PRE>java.util.Hashtable<K,V> <B>mimeTypeBytes</B></PRE><DL><DL></DL></DL><HR><H3>statusCodeDistribution</H3><PRE>java.util.Hashtable<K,V> <B>statusCodeDistribution</B></PRE><DL><DD>Keep track of fetch status codes<P><DL></DL></DL><HR><H3>seedsCrawled</H3><PRE>int <B>seedsCrawled</B></PRE><DL><DL></DL></DL><HR><H3>seedsNotCrawled</H3><PRE>int <B>seedsNotCrawled</B></PRE><DL><DL></DL></DL><HR><H3>sExitMessage</H3><PRE>java.lang.String <B>sExitMessage</B></PRE><DL><DL></DL></DL><HR SIZE="4" NOSHADE><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="center"><FONT SIZE="+2"><B>Package</B> <B>org.archive.crawler.datamodel</B></FONT></TH></TR></TABLE><P><A NAME="org.archive.crawler.datamodel.CandidateURI"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2"><B>Class <A HREF="org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">org.archive.crawler.datamodel.CandidateURI</A> extends java.lang.Object implements Serializable</B></FONT></TH></TR></TABLE><P><B>serialVersionUID: </B>-7152937921526560388L<P><A NAME="serialized_methods"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Serialization Methods</B></FONT></TH></TR></TABLE><P><A NAME="readObject(java.io.ObjectInputStream)"><!-- --></A><H3>readObject</H3><PRE>private void <B>readObject</B>(java.io.ObjectInputStream stream) throws java.io.IOException, java.lang.ClassNotFoundException</PRE><DL><DD>Custom deserialization to reconstruct UURI instances from more compact Strings.<P><DD><DL><DT><B>Throws:</B><DD><CODE>java.io.IOException</CODE><DD><CODE>java.lang.ClassNotFoundException</CODE></DD></DL></DL><HR><A NAME="writeObject(java.io.ObjectOutputStream)"><!-- --></A><H3>writeObject</H3><PRE>private void <B>writeObject</B>(java.io.ObjectOutputStream stream) throws java.io.IOException</PRE><DL><DD>Custom serialization writing 'uuri' and 'via' as Strings, rather than the bloated full serialization of their object classes, and an empty alist as 'null'. Shrinks serialized form by 50% or more in short tests.<P><DD><DL><DT><B>Throws:</B><DD><CODE>java.io.IOException</CODE></DD></DL></DL><A NAME="serializedForm"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Serialized Fields</B></FONT></TH></TR></TABLE><H3>schedulingDirective</H3><PRE>int <B>schedulingDirective</B></PRE><DL><DL></DL></DL><HR><H3>isSeed</H3><PRE>boolean <B>isSeed</B></PRE><DL><DD>Seed status<P><DL></DL></DL><HR><H3>forceRevisit</H3><PRE>boolean <B>forceRevisit</B></PRE><DL><DL></DL></DL><HR><H3>pathFromSeed</H3><PRE>java.lang.String <B>pathFromSeed</B></PRE><DL><DD>String of letters indicating how this URI was reached from a seed. <pre> P precondition R redirection E embedded (as frame, src, link, codebase, etc.) X speculative embed (as from javascript, some alternate-format extractors L link</pre> For example LLLE (an embedded image on a page 3 links from seed).<P><DL></DL></DL><HR><H3>viaContext</H3><PRE>java.lang.CharSequence <B>viaContext</B></PRE><DL><DD>Context of URI's discovery, as per the 'context' in Link<P><DL></DL></DL><HR><H3>cachedCandidateURIString</H3><PRE>java.lang.String <B>cachedCandidateURIString</B></PRE><DL><DD>Cache of this candidate uuri as a string. Profiling shows us spending about 1-2% of total elapsed time in toString.<P><DL></DL></DL><HR><H3>classKey</H3><PRE>java.lang.String <B>classKey</B></PRE><DL><DD>Frontier/Scheduler lifecycle info. This is an identifier set by the Frontier for its purposes. Usually its the name of the Frontier queue this URI gets queued to. Values can be host + port or IP, etc.<P><DL></DL></DL><P><A NAME="org.archive.crawler.datamodel.Checkpoint"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2"><B>Class <A HREF="org/archive/crawler/datamodel/Checkpoint.html" title="class in org.archive.crawler.datamodel">org.archive.crawler.datamodel.Checkpoint</A> extends java.lang.Object implements Serializable</B></FONT></TH></TR></TABLE><P><B>serialVersionUID: </B>5121498771788002844L<P><A NAME="serialized_methods"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Serialization Methods</B></FONT></TH></TR></TABLE><P><A NAME="readObject(java.io.ObjectInputStream)"><!-- --></A><H3>readObject</H3><PRE>private void <B>readObject</B>(java.io.ObjectInputStream s) throws java.io.IOException, java.lang.ClassNotFoundException</PRE><DL><DD><DL><DT><B>Throws:</B><DD><CODE>java.io.IOException</CODE><DD><CODE>java.lang.ClassNotFoundException</CODE></DD></DL></DL><A NAME="serializedForm"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Serialized Fields</B></FONT></TH></TR></TABLE><H3>directory</H3><PRE>java.io.File <B>directory</B></PRE><DL><DL></DL></DL><P><A NAME="org.archive.crawler.datamodel.CrawlHost"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableSubHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2"><B>Class <A HREF="org/archive/crawler/datamodel/CrawlHost.html" title="class in org.archive.crawler.datamodel">org.archive.crawler.datamodel.CrawlHost</A> extends java.lang.Object implements Serializable</B></FONT></TH></TR></TABLE><P><A NAME="serializedForm"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Serialized Fields</B></FONT></TH></TR></TABLE><H3>hostname</H3><PRE>java.lang.String <B>hostname</B></PRE><DL><DL></DL></DL><HR><H3>countryCode</H3><PRE>java.lang.String <B>countryCode</B></PRE><DL><DL></DL></DL><HR><H3>ip</H3><PRE>java.net.InetAddress <B>ip</B></PRE><DL><DL></DL></DL><HR><H3>ipFetched</H3><PRE>long <B>ipFetched</B></PRE><DL><DL></DL></DL><HR><H3>substats</H3><PRE><A HREF="org/archive/crawler/datamodel/CrawlSubstats.html" title="class in org.archive.crawler.datamodel">CrawlSubstats</A> <B>substats</B></PRE><DL><DL></DL></DL><HR><H3>ipTTL</H3><PRE>long <B>ipTTL</B></PRE><DL><DD>TTL gotten from dns record. From rfc2035: <pre> TTL a 32 bit unsigned integer that specifies the time interval (in seconds) that the resource record may be cached before it should be discarded. Zero values are interpreted to mean that the RR can only be used for the transaction in progress, and should not be cached. </pre><P><DL></DL></DL><HR><H3>earliestNextURIEmitTime</H3>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -