persiststoreprocessor.html

来自「网络爬虫开源代码」· HTML 代码 · 共 631 行 · 第 1/3 页

HTML
631
字号
<DD><B>Description copied from class: <CODE><A HREF="../../../../../org/archive/crawler/framework/Processor.html#innerProcess(org.archive.crawler.datamodel.CrawlURI)">Processor</A></CODE></B></DD><DD>Classes subclassing this one should override this method to perform their custom actions on the CrawlURI.<P><DD><DL><DT><B>Overrides:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/framework/Processor.html#innerProcess(org.archive.crawler.datamodel.CrawlURI)">innerProcess</A></CODE> in class <CODE><A HREF="../../../../../org/archive/crawler/framework/Processor.html" title="class in org.archive.crawler.framework">Processor</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - The CrawlURI being processed.<DT><B>Throws:</B><DD><CODE>java.lang.InterruptedException</CODE></DL></DD></DL><HR><A NAME="crawlCheckpoint(java.io.File)"><!-- --></A><H3>crawlCheckpoint</H3><PRE>public void <B>crawlCheckpoint</B>(java.io.File&nbsp;checkpointDir)                     throws java.lang.Exception</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlCheckpoint(java.io.File)">CrawlStatusListener</A></CODE></B></DD><DD>Called by <A HREF="../../../../../org/archive/crawler/framework/CrawlController.html" title="class in org.archive.crawler.framework"><CODE>CrawlController</CODE></A> when checkpointing.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlCheckpoint(java.io.File)">crawlCheckpoint</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>checkpointDir</CODE> - Checkpoint dir.  Write checkpoint state here.<DT><B>Throws:</B><DD><CODE>java.lang.Exception</CODE> - A fatal exception.  Any exceptions that are let out of this checkpoint are assumed fatal and terminate further checkpoint processing.</DL></DD></DL><HR><A NAME="crawlEnded(java.lang.String)"><!-- --></A><H3>crawlEnded</H3><PRE>public void <B>crawlEnded</B>(java.lang.String&nbsp;sExitMessage)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlEnded(java.lang.String)">CrawlStatusListener</A></CODE></B></DD><DD>Called when a CrawlController has ended a crawl and is about to exit.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlEnded(java.lang.String)">crawlEnded</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>sExitMessage</CODE> - Type of exit. Should be one of the STATUS constants in defined in CrawlJob.<DT><B>See Also:</B><DD><A HREF="../../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin"><CODE>CrawlJob</CODE></A></DL></DD></DL><HR><A NAME="crawlEnding(java.lang.String)"><!-- --></A><H3>crawlEnding</H3><PRE>public void <B>crawlEnding</B>(java.lang.String&nbsp;sExitMessage)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlEnding(java.lang.String)">CrawlStatusListener</A></CODE></B></DD><DD>Called when a CrawlController is ending a crawl (for any reason)<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlEnding(java.lang.String)">crawlEnding</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>sExitMessage</CODE> - Type of exit. Should be one of the STATUS constants in defined in CrawlJob.<DT><B>See Also:</B><DD><A HREF="../../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin"><CODE>CrawlJob</CODE></A></DL></DD></DL><HR><A NAME="crawlPaused(java.lang.String)"><!-- --></A><H3>crawlPaused</H3><PRE>public void <B>crawlPaused</B>(java.lang.String&nbsp;statusMessage)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlPaused(java.lang.String)">CrawlStatusListener</A></CODE></B></DD><DD>Called when a CrawlController is actually paused (all threads are idle).<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlPaused(java.lang.String)">crawlPaused</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>statusMessage</CODE> - Should be <A HREF="../../../../../org/archive/crawler/admin/CrawlJob.html#STATUS_PAUSED"><CODE>CrawlJob.STATUS_PAUSED</CODE></A>. Passed for convenience</DL></DD></DL><HR><A NAME="crawlPausing(java.lang.String)"><!-- --></A><H3>crawlPausing</H3><PRE>public void <B>crawlPausing</B>(java.lang.String&nbsp;statusMessage)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlPausing(java.lang.String)">CrawlStatusListener</A></CODE></B></DD><DD>Called when a CrawlController is going to be paused.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlPausing(java.lang.String)">crawlPausing</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>statusMessage</CODE> - Should be <A HREF="../../../../../org/archive/crawler/admin/CrawlJob.html#STATUS_WAITING_FOR_PAUSE"><CODE>STATUS_WAITING_FOR_PAUSE</CODE></A>. Passed for convenience</DL></DD></DL><HR><A NAME="crawlResuming(java.lang.String)"><!-- --></A><H3>crawlResuming</H3><PRE>public void <B>crawlResuming</B>(java.lang.String&nbsp;statusMessage)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlResuming(java.lang.String)">CrawlStatusListener</A></CODE></B></DD><DD>Called when a CrawlController is resuming a crawl that had been paused.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlResuming(java.lang.String)">crawlResuming</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>statusMessage</CODE> - Should be <A HREF="../../../../../org/archive/crawler/admin/CrawlJob.html#STATUS_RUNNING"><CODE>CrawlJob.STATUS_RUNNING</CODE></A>. Passed for convenience</DL></DD></DL><HR><A NAME="crawlStarted(java.lang.String)"><!-- --></A><H3>crawlStarted</H3><PRE>public void <B>crawlStarted</B>(java.lang.String&nbsp;message)</PRE><DL><DD><B>Description copied from interface: <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlStarted(java.lang.String)">CrawlStatusListener</A></CODE></B></DD><DD>Called on crawl start.<P><DD><DL><DT><B>Specified by:</B><DD><CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html#crawlStarted(java.lang.String)">crawlStarted</A></CODE> in interface <CODE><A HREF="../../../../../org/archive/crawler/event/CrawlStatusListener.html" title="interface in org.archive.crawler.event">CrawlStatusListener</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>message</CODE> - Start message.</DL></DD></DL><!-- ========= END OF CLASS DATA ========= --><HR><!-- ======= START OF BOTTOM NAVBAR ====== --><A NAME="navbar_bottom"><!-- --></A><A HREF="#skip-navbar_bottom" title="Skip navigation links"></A><TABLE BORDER="0" WIDTH="100%" CELLPADDING="1" CELLSPACING="0" SUMMARY=""><TR><TD COLSPAN=2 BGCOLOR="#EEEEFF" CLASS="NavBarCell1"><A NAME="navbar_bottom_firstrow"><!-- --></A><TABLE BORDER="0" CELLPADDING="0" CELLSPACING="3" SUMMARY="">  <TR ALIGN="center" VALIGN="top">  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../overview-summary.html"><FONT CLASS="NavBarFont1"><B>Overview</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-summary.html"><FONT CLASS="NavBarFont1"><B>Package</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#FFFFFF" CLASS="NavBarCell1Rev"> &nbsp;<FONT CLASS="NavBarFont1Rev"><B>Class</B></FONT>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="class-use/PersistStoreProcessor.html"><FONT CLASS="NavBarFont1"><B>Use</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-tree.html"><FONT CLASS="NavBarFont1"><B>Tree</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../deprecated-list.html"><FONT CLASS="NavBarFont1"><B>Deprecated</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../index-all.html"><FONT CLASS="NavBarFont1"><B>Index</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../../help-doc.html"><FONT CLASS="NavBarFont1"><B>Help</B></FONT></A>&nbsp;</TD>  </TR></TABLE></TD><TD ALIGN="right" VALIGN="top" ROWSPAN=3><EM></EM></TD></TR><TR><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">&nbsp;<A HREF="../../../../../org/archive/crawler/processor/recrawl/PersistProcessor.html" title="class in org.archive.crawler.processor.recrawl"><B>PREV CLASS</B></A>&nbsp;&nbsp;NEXT CLASS</FONT></TD><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">  <A HREF="../../../../../index.html?org/archive/crawler/processor/recrawl/PersistStoreProcessor.html" target="_top"><B>FRAMES</B></A>  &nbsp;&nbsp;<A HREF="PersistStoreProcessor.html" target="_top"><B>NO FRAMES</B></A>  &nbsp;&nbsp;<SCRIPT type="text/javascript">  <!--  if(window==top) {    document.writeln('<A HREF="../../../../../allclasses-noframe.html"><B>All Classes</B></A>');  }  //--></SCRIPT><NOSCRIPT>  <A HREF="../../../../../allclasses-noframe.html"><B>All Classes</B></A></NOSCRIPT></FONT></TD></TR><TR><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">  SUMMARY:&nbsp;<A HREF="#nested_classes_inherited_from_class_org.archive.crawler.settings.ComplexType">NESTED</A>&nbsp;|&nbsp;<A HREF="#fields_inherited_from_class_org.archive.crawler.processor.recrawl.PersistOnlineProcessor">FIELD</A>&nbsp;|&nbsp;<A HREF="#constructor_summary">CONSTR</A>&nbsp;|&nbsp;<A HREF="#method_summary">METHOD</A></FONT></TD><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">DETAIL:&nbsp;FIELD&nbsp;|&nbsp;<A HREF="#constructor_detail">CONSTR</A>&nbsp;|&nbsp;<A HREF="#method_detail">METHOD</A></FONT></TD></TR></TABLE><A NAME="skip-navbar_bottom"></A><!-- ======== END OF BOTTOM NAVBAR ======= --><HR>Copyright &copy; 2003-2007 Internet Archive. All Rights Reserved.</BODY></HTML>

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?