crawlerjournal.html
来自「网络爬虫开源代码」· HTML 代码 · 共 732 行 · 第 1/2 页
HTML
732 行
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"><!--NewPage--><HTML><HEAD><!-- Generated by javadoc (build 1.5.0_07) on Sun May 06 17:59:52 GMT 2007 --><TITLE>CrawlerJournal (Heritrix 1.12.1)</TITLE><META NAME="keywords" CONTENT="org.archive.crawler.io.CrawlerJournal class"><LINK REL ="stylesheet" TYPE="text/css" HREF="../../../../stylesheet.css" TITLE="Style"><SCRIPT type="text/javascript">function windowTitle(){ parent.document.title="CrawlerJournal (Heritrix 1.12.1)";}</SCRIPT><NOSCRIPT></NOSCRIPT></HEAD><BODY BGCOLOR="white" onload="windowTitle();"><!-- ========= START OF TOP NAVBAR ======= --><A NAME="navbar_top"><!-- --></A><A HREF="#skip-navbar_top" title="Skip navigation links"></A><TABLE BORDER="0" WIDTH="100%" CELLPADDING="1" CELLSPACING="0" SUMMARY=""><TR><TD COLSPAN=2 BGCOLOR="#EEEEFF" CLASS="NavBarCell1"><A NAME="navbar_top_firstrow"><!-- --></A><TABLE BORDER="0" CELLPADDING="0" CELLSPACING="3" SUMMARY=""> <TR ALIGN="center" VALIGN="top"> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="../../../../overview-summary.html"><FONT CLASS="NavBarFont1"><B>Overview</B></FONT></A> </TD> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="package-summary.html"><FONT CLASS="NavBarFont1"><B>Package</B></FONT></A> </TD> <TD BGCOLOR="#FFFFFF" CLASS="NavBarCell1Rev"> <FONT CLASS="NavBarFont1Rev"><B>Class</B></FONT> </TD> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="class-use/CrawlerJournal.html"><FONT CLASS="NavBarFont1"><B>Use</B></FONT></A> </TD> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="package-tree.html"><FONT CLASS="NavBarFont1"><B>Tree</B></FONT></A> </TD> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="../../../../deprecated-list.html"><FONT CLASS="NavBarFont1"><B>Deprecated</B></FONT></A> </TD> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="../../../../index-all.html"><FONT CLASS="NavBarFont1"><B>Index</B></FONT></A> </TD> <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1"> <A HREF="../../../../help-doc.html"><FONT CLASS="NavBarFont1"><B>Help</B></FONT></A> </TD> </TR></TABLE></TD><TD ALIGN="right" VALIGN="top" ROWSPAN=3><EM></EM></TD></TR><TR><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2"> PREV CLASS <A HREF="../../../../org/archive/crawler/io/LocalErrorFormatter.html" title="class in org.archive.crawler.io"><B>NEXT CLASS</B></A></FONT></TD><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2"> <A HREF="../../../../index.html?org/archive/crawler/io/CrawlerJournal.html" target="_top"><B>FRAMES</B></A> <A HREF="CrawlerJournal.html" target="_top"><B>NO FRAMES</B></A> <SCRIPT type="text/javascript"> <!-- if(window==top) { document.writeln('<A HREF="../../../../allclasses-noframe.html"><B>All Classes</B></A>'); } //--></SCRIPT><NOSCRIPT> <A HREF="../../../../allclasses-noframe.html"><B>All Classes</B></A></NOSCRIPT></FONT></TD></TR><TR><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2"> SUMMARY: NESTED | <A HREF="#field_summary">FIELD</A> | <A HREF="#constructor_summary">CONSTR</A> | <A HREF="#method_summary">METHOD</A></FONT></TD><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">DETAIL: <A HREF="#field_detail">FIELD</A> | <A HREF="#constructor_detail">CONSTR</A> | <A HREF="#method_detail">METHOD</A></FONT></TD></TR></TABLE><A NAME="skip-navbar_top"></A><!-- ========= END OF TOP NAVBAR ========= --><HR><!-- ======== START OF CLASS DATA ======== --><H2><FONT SIZE="-1">org.archive.crawler.io</FONT><BR>Class CrawlerJournal</H2><PRE>java.lang.Object <IMG SRC="../../../../resources/inherit.gif" ALT="extended by "><B>org.archive.crawler.io.CrawlerJournal</B></PRE><DL><DT><B>Direct Known Subclasses:</B> <DD><A HREF="../../../../org/archive/crawler/frontier/RecoveryJournal.html" title="class in org.archive.crawler.frontier">RecoveryJournal</A></DD></DL><HR><DL><DT><PRE>public class <B>CrawlerJournal</B><DT>extends java.lang.Object</DL></PRE><P>Utility class for a crawler journal/log that is compressed and rotates by serial number at checkpoints.<P><P><DL><DT><B>Author:</B></DT> <DD>gojomo</DD></DL><HR><P><!-- =========== FIELD SUMMARY =========== --><A NAME="field_summary"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2"><B>Field Summary</B></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected it.unimi.dsi.mg4j.util.MutableString</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#accumulatingBuffer">accumulatingBuffer</A></B></CODE><BR> Allocate a buffer for accumulating lines to write and reuse it.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#GZIP_SUFFIX">GZIP_SUFFIX</A></B></CODE><BR> suffix to recognize gzipped files</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected java.io.File</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#gzipFile">gzipFile</A></B></CODE><BR> File we're writing journal to.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected long</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#lines">lines</A></B></CODE><BR> line count</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#LOG_ERROR">LOG_ERROR</A></B></CODE><BR> prefix for error lines</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#LOG_TIMESTAMP">LOG_TIMESTAMP</A></B></CODE><BR> prefix for timestamp lines</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected java.io.Writer</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#out">out</A></B></CODE><BR> Stream on which we record frontier events.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#timestamp_interval">timestamp_interval</A></B></CODE><BR> number of lines between timestamps</TD></TR></TABLE> <!-- ======== CONSTRUCTOR SUMMARY ======== --><A NAME="constructor_summary"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2"><B>Constructor Summary</B></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#CrawlerJournal(java.io.File)">CrawlerJournal</A></B>(java.io.File file)</CODE><BR> Create a new crawler journal at the given location</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#CrawlerJournal(java.lang.String, java.lang.String)">CrawlerJournal</A></B>(java.lang.String path, java.lang.String filename)</CODE><BR> Create a new crawler journal at the given location</TD></TR></TABLE> <!-- ========== METHOD SUMMARY =========== --><A NAME="method_summary"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="2"><FONT SIZE="+2"><B>Method Summary</B></FONT></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#checkpoint(java.io.File)">checkpoint</A></B>(java.io.File checkpointDir)</CODE><BR> Handle a checkpoint by rotating the current log to a checkpoint-named file and starting a new log.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#close()">close</A></B>()</CODE><BR> Flush and close the underlying IO objects.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#considerTimestamp()">considerTimestamp</A></B>()</CODE><BR> Write a timestamp line if appropriate</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static java.io.BufferedInputStream</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#getBufferedInput(java.io.File)">getBufferedInput</A></B>(java.io.File source)</CODE><BR> Get a BufferedInputStream on the recovery file given.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static java.io.BufferedReader</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#getBufferedReader(java.io.File)">getBufferedReader</A></B>(java.io.File source)</CODE><BR> Get a BufferedReader on the crawler journal given</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected java.io.Writer</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#initialize(java.io.File)">initialize</A></B>(java.io.File f)</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#noteLine()">noteLine</A></B>()</CODE><BR> Count and note a line</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#seriousError(java.lang.String)">seriousError</A></B>(java.lang.String err)</CODE><BR> Note a serious error vioa a special log line</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#writeLine(it.unimi.dsi.mg4j.util.MutableString)">writeLine</A></B>(it.unimi.dsi.mg4j.util.MutableString mstring)</CODE><BR> Write a line.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#writeLine(java.lang.String)">writeLine</A></B>(java.lang.String string)</CODE><BR> Write a line</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#writeLine(java.lang.String, java.lang.String)">writeLine</A></B>(java.lang.String s1, java.lang.String s2)</CODE><BR> Write a line of two strings</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/io/CrawlerJournal.html#writeLine(java.lang.String, java.lang.String, java.lang.String)">writeLine</A></B>(java.lang.String s1, java.lang.String s2, java.lang.String s3)</CODE><BR> Write a line of three strings</TD></TR></TABLE> <A NAME="methods_inherited_from_class_java.lang.Object"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#EEEEFF" CLASS="TableSubHeadingColor"><TH ALIGN="left"><B>Methods inherited from class java.lang.Object</B></TH></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD><CODE>clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait</CODE></TD></TR></TABLE> <P><!-- ============ FIELD DETAIL =========== --><A NAME="field_detail"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Field Detail</B></FONT></TH></TR></TABLE><A NAME="LOG_ERROR"><!-- --></A><H3>LOG_ERROR</H3><PRE>public static final java.lang.String <B>LOG_ERROR</B></PRE><DL><DD>prefix for error lines<P><DL><DT><B>See Also:</B><DD><A HREF="../../../../constant-values.html#org.archive.crawler.io.CrawlerJournal.LOG_ERROR">Constant Field Values</A></DL></DL><HR><A NAME="LOG_TIMESTAMP"><!-- --></A><H3>LOG_TIMESTAMP</H3><PRE>public static final java.lang.String <B>LOG_TIMESTAMP</B></PRE><DL><DD>prefix for timestamp lines<P><DL><DT><B>See Also:</B><DD><A HREF="../../../../constant-values.html#org.archive.crawler.io.CrawlerJournal.LOG_TIMESTAMP">Constant Field Values</A></DL></DL><HR>
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?