⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 fpmergeuriuniqfilter.html

📁 用JAVA编写的,在做实验的时候留下来的,本来想删的,但是传上来,大家分享吧
💻 HTML
📖 第 1 页 / 共 3 页
字号:
</DD><DD><DL><DT><B>Parameters:</B><DD><CODE>receiver</CODE> - Object that will be passed items. Must implement HasUriReceiver interface.</DL></DD></DL><HR><A NAME="profileLog(java.lang.String)"><!-- --></A><H3>profileLog</H3><PRE>protected void <B>profileLog</B>(java.lang.String&nbsp;key)</PRE><DL><DD><DL></DL></DD><DD><DL></DL></DD></DL><HR><A NAME="add(java.lang.String, org.archive.crawler.datamodel.CandidateURI)"><!-- --></A><H3>add</H3><PRE>public void <B>add</B>(java.lang.String&nbsp;key,                <A HREF="../../../../org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">CandidateURI</A>&nbsp;value)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#add(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">UriUniqFilter</A></CODE></B></DD><DD>Add given uri, if not already present.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#add(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">add</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>key</CODE> - Usually a canonicalized version of <code>value</code>. This is the key used doing lookups, forgets and insertions on the already included list.<DD><CODE>value</CODE> - item to add.</DL></DD></DL><HR><A NAME="pend(long, org.archive.crawler.datamodel.CandidateURI)"><!-- --></A><H3>pend</H3><PRE>protected void <B>pend</B>(long&nbsp;fp,                    <A HREF="../../../../org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">CandidateURI</A>&nbsp;value)</PRE><DL><DD>Place the given FP/CandidateURI pair into the pending set, awaiting a merge to determine if it's actually accepted.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>fp</CODE> - long fingerprint<DD><CODE>value</CODE> - CandidateURI or null, if fp only needs merging (as when  CandidateURI was already forced in</DL></DD></DL><HR><A NAME="createFp(java.lang.CharSequence)"><!-- --></A><H3>createFp</H3><PRE>public static long <B>createFp</B>(java.lang.CharSequence&nbsp;key)</PRE><DL><DD>Create a fingerprint from the given key<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>key</CODE> - CharSequence (URI) to fingerprint<DT><B>Returns:</B><DD>long fingerprint</DL></DD></DL><HR><A NAME="addNow(java.lang.String, org.archive.crawler.datamodel.CandidateURI)"><!-- --></A><H3>addNow</H3><PRE>public void <B>addNow</B>(java.lang.String&nbsp;key,                   <A HREF="../../../../org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">CandidateURI</A>&nbsp;value)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#addNow(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">UriUniqFilter</A></CODE></B></DD><DD>Immediately add uri.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#addNow(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">addNow</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>key</CODE> - Usually a canonicalized version of <code>uri</code>. This is the key used doing lookups, forgets and insertions on the already included list.<DD><CODE>value</CODE> - item to add.</DL></DD></DL><HR><A NAME="addForce(java.lang.String, org.archive.crawler.datamodel.CandidateURI)"><!-- --></A><H3>addForce</H3><PRE>public void <B>addForce</B>(java.lang.String&nbsp;key,                     <A HREF="../../../../org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">CandidateURI</A>&nbsp;value)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#addForce(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">UriUniqFilter</A></CODE></B></DD><DD>Add given uri, all the way through to underlying destination, even  if already present.  (Sometimes a URI must be fetched, or refetched, for example when DNS or robots info expires or the operator forces a refetch. A normal add() or addNow() would drop the URI without forwarding on once it is determmined to already be in the filter.)<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#addForce(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">addForce</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>key</CODE> - Usually a canonicalized version of <code>uri</code>. This is the key used doing lookups, forgets and insertions on the already included list.<DD><CODE>value</CODE> - item to add.</DL></DD></DL><HR><A NAME="note(java.lang.String)"><!-- --></A><H3>note</H3><PRE>public void <B>note</B>(java.lang.String&nbsp;key)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#note(java.lang.String)">UriUniqFilter</A></CODE></B></DD><DD>Note item as seen, without passing through to receiver.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#note(java.lang.String)">note</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>key</CODE> - Usually a canonicalized version of an <code>URI</code>. This is the key used doing lookups, forgets and insertions on the already included list.</DL></DD></DL><HR><A NAME="forget(java.lang.String, org.archive.crawler.datamodel.CandidateURI)"><!-- --></A><H3>forget</H3><PRE>public void <B>forget</B>(java.lang.String&nbsp;key,                   <A HREF="../../../../org/archive/crawler/datamodel/CandidateURI.html" title="class in org.archive.crawler.datamodel">CandidateURI</A>&nbsp;value)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#forget(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">UriUniqFilter</A></CODE></B></DD><DD>Forget item was seen<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#forget(java.lang.String, org.archive.crawler.datamodel.CandidateURI)">forget</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>key</CODE> - Usually a canonicalized version of an <code>URI</code>. This is the key used doing lookups, forgets and insertions on the already included list.<DD><CODE>value</CODE> - item to add.</DL></DD></DL><HR><A NAME="requestFlush()"><!-- --></A><H3>requestFlush</H3><PRE>public long <B>requestFlush</B>()</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#requestFlush()">UriUniqFilter</A></CODE></B></DD><DD>Request that any pending items be added/dropped. Implementors may ignore the request if a flush would be too expensive/too  soon.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#requestFlush()">requestFlush</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL><DT><B>Returns:</B><DD>Number added.</DL></DD></DL><HR><A NAME="flush()"><!-- --></A><H3>flush</H3><PRE>public long <B>flush</B>()</PRE><DL><DD>Perform a merge of all 'pending' items to the overall fingerprint list.  If the pending item is new, and has an associated CandidateURI, pass that URI along to the 'receiver' (frontier) for queueing.<P><DD><DL></DL></DD><DD><DL><DT><B>Returns:</B><DD>number of pending items actually added</DL></DD></DL><HR><A NAME="beginFpMerge()"><!-- --></A><H3>beginFpMerge</H3><PRE>protected abstract it.unimi.dsi.fastutil.longs.LongIterator <B>beginFpMerge</B>()</PRE><DL><DD>Begin merging pending candidates with complete list. Return an Iterator which will return all previously-known FPs in turn.<P><DD><DL></DL></DD><DD><DL><DT><B>Returns:</B><DD>Iterator over all previously-known FPs</DL></DD></DL><HR><A NAME="addNewFp(long)"><!-- --></A><H3>addNewFp</H3><PRE>protected abstract void <B>addNewFp</B>(long&nbsp;fp)</PRE><DL><DD>Add an FP (which may be an old or new FP) to the new complete list. Should only be called after beginFpMerge() and before finishFpMerge().<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>fp</CODE> - the FP to add</DL></DD></DL><HR><A NAME="finishFpMerge()"><!-- --></A><H3>finishFpMerge</H3><PRE>protected abstract void <B>finishFpMerge</B>()</PRE><DL><DD>Complete the merge of candidate and previously-known FPs (closing files/iterators as appropriate).<P><DD><DL></DL></DD><DD><DL></DL></DD></DL><HR><A NAME="close()"><!-- --></A><H3>close</H3><PRE>public void <B>close</B>()</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#close()">UriUniqFilter</A></CODE></B></DD><DD>Close down any allocated resources. Makes sense calling this when checkpointing.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#close()">close</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL></DL></DD></DL><HR><A NAME="setProfileLog(java.io.File)"><!-- --></A><H3>setProfileLog</H3><PRE>public void <B>setProfileLog</B>(java.io.File&nbsp;logfile)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#setProfileLog(java.io.File)">UriUniqFilter</A></CODE></B></DD><DD>Set a File to receive a log for replay profiling.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html#setProfileLog(java.io.File)">setProfileLog</A></CODE> in interface <CODE><A HREF="../../../../org/archive/crawler/datamodel/UriUniqFilter.html" title="interface in org.archive.crawler.datamodel">UriUniqFilter</A></CODE></DL></DD><DD><DL></DL></DD></DL><!-- ========= END OF CLASS DATA ========= --><HR><!-- ======= START OF BOTTOM NAVBAR ====== --><A NAME="navbar_bottom"><!-- --></A><A HREF="#skip-navbar_bottom" title="Skip navigation links"></A><TABLE BORDER="0" WIDTH="100%" CELLPADDING="1" CELLSPACING="0" SUMMARY=""><TR><TD COLSPAN=2 BGCOLOR="#EEEEFF" CLASS="NavBarCell1"><A NAME="navbar_bottom_firstrow"><!-- --></A><TABLE BORDER="0" CELLPADDING="0" CELLSPACING="3" SUMMARY="">  <TR ALIGN="center" VALIGN="top">  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../overview-summary.html"><FONT CLASS="NavBarFont1"><B>Overview</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-summary.html"><FONT CLASS="NavBarFont1"><B>Package</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#FFFFFF" CLASS="NavBarCell1Rev"> &nbsp;<FONT CLASS="NavBarFont1Rev"><B>Class</B></FONT>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="class-use/FPMergeUriUniqFilter.html"><FONT CLASS="NavBarFont1"><B>Use</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-tree.html"><FONT CLASS="NavBarFont1"><B>Tree</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../deprecated-list.html"><FONT CLASS="NavBarFont1"><B>Deprecated</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../index-all.html"><FONT CLASS="NavBarFont1"><B>Index</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../help-doc.html"><FONT CLASS="NavBarFont1"><B>Help</B></FONT></A>&nbsp;</TD>  </TR></TABLE></TD><TD ALIGN="right" VALIGN="top" ROWSPAN=3><EM></EM></TD></TR><TR><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">&nbsp;<A HREF="../../../../org/archive/crawler/util/DiskFPMergeUriUniqFilter.DataFileLongIterator.html" title="class in org.archive.crawler.util"><B>PREV CLASS</B></A>&nbsp;&nbsp;<A HREF="../../../../org/archive/crawler/util/FPMergeUriUniqFilter.PendingItem.html" title="class in org.archive.crawler.util"><B>NEXT CLASS</B></A></FONT></TD><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">  <A HREF="../../../../index.html?org/archive/crawler/util/FPMergeUriUniqFilter.html" target="_top"><B>FRAMES</B></A>  &nbsp;&nbsp;<A HREF="FPMergeUriUniqFilter.html" target="_top"><B>NO FRAMES</B></A>  &nbsp;&nbsp;<SCRIPT type="text/javascript">  <!--  if(window==top) {    document.writeln('<A HREF="../../../../allclasses-noframe.html"><B>All Classes</B></A>');  }  //--></SCRIPT><NOSCRIPT>  <A HREF="../../../../allclasses-noframe.html"><B>All Classes</B></A></NOSCRIPT></FONT></TD></TR><TR><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">  SUMMARY:&nbsp;<A HREF="#nested_class_summary">NESTED</A>&nbsp;|&nbsp;<A HREF="#field_summary">FIELD</A>&nbsp;|&nbsp;<A HREF="#constructor_summary">CONSTR</A>&nbsp;|&nbsp;<A HREF="#method_summary">METHOD</A></FONT></TD><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">DETAIL:&nbsp;<A HREF="#field_detail">FIELD</A>&nbsp;|&nbsp;<A HREF="#constructor_detail">CONSTR</A>&nbsp;|&nbsp;<A HREF="#method_detail">METHOD</A></FONT></TD></TR></TABLE><A NAME="skip-navbar_bottom"></A><!-- ======== END OF BOTTOM NAVBAR ======= --><HR>Copyright &copy; 2003-2006 Internet Archive. All Rights Reserved.</BODY></HTML>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -