📄 crawljobhandler.html
字号:
<TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#crawlStarted(java.lang.String)">crawlStarted</A></B>(java.lang.String message)</CODE><BR> Called on crawl start.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#createNewJob(java.io.File, java.lang.String, java.lang.String, java.lang.String, int)">createNewJob</A></B>(java.io.File orderFile, java.lang.String name, java.lang.String description, java.lang.String seeds, int priority)</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected <A HREF="../../../../org/archive/crawler/settings/XMLSettingsHandler.html" title="class in org.archive.crawler.settings">XMLSettingsHandler</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#createSettingsHandler(java.io.File, java.lang.String, java.lang.String, java.lang.String, java.io.File, org.archive.crawler.admin.CrawlJobErrorHandler, java.lang.String, java.lang.String)">createSettingsHandler</A></B>(java.io.File orderFile, java.lang.String name, java.lang.String description, java.lang.String seeds, java.io.File newSettingsDir, <A HREF="../../../../org/archive/crawler/admin/CrawlJobErrorHandler.html" title="class in org.archive.crawler.admin">CrawlJobErrorHandler</A> errorHandler, java.lang.String filename, java.lang.String seedfile)</CODE><BR> Creates a new settings handler based on an existing job.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#deleteJob(java.lang.String)">deleteJob</A></B>(java.lang.String jobUID)</CODE><BR> The specified job will be removed from the pending queue or aborted if currently running.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#deleteProfile(org.archive.crawler.admin.CrawlJob)">deleteProfile</A></B>(<A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A> cj)</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> long</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#deleteURIsFromPending(java.lang.String)">deleteURIsFromPending</A></B>(java.lang.String regexpr)</CODE><BR> Delete any URI from the frontier of the current (paused) job that match the specified regular expression.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#discardNewJob()">discardNewJob</A></B>()</CODE><BR> Discard the handler's 'new job'.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#doFlush()">doFlush</A></B>()</CODE><BR> If its a HostQueuesFrontier, needs to be flushed for the queued.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#ensureNewJobWritten(org.archive.crawler.admin.CrawlJob, java.lang.String, java.lang.String)">ensureNewJobWritten</A></B>(<A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A> newJob, java.lang.String metaname, java.lang.String description)</CODE><BR> Ensure order file with new name/desc is written.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.List</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getCompletedJobs()">getCompletedJobs</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getCurrentJob()">getCurrentJob</A></B>()</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getDefaultProfile()">getDefaultProfile</A></B>()</CODE><BR> Returns the default profile.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/framework/FrontierMarker.html" title="interface in org.archive.crawler.framework">FrontierMarker</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getInitialMarker(java.lang.String, boolean)">getInitialMarker</A></B>(java.lang.String regexpr, boolean inCacheOnly)</CODE><BR> Returns a URIFrontierMarker for the current, paused, job.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getJob(java.lang.String)">getJob</A></B>(java.lang.String jobUID)</CODE><BR> Return a job with the given UID.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getNewJob()">getNewJob</A></B>()</CODE><BR> Get the handler's 'new job'</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getNextJobUID()">getNextJobUID</A></B>()</CODE><BR> Returns a unique job ID.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.List</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getPendingJobs()">getPendingJobs</A></B>()</CODE><BR> A List of all pending jobs</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.ArrayList</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getPendingURIsList(org.archive.crawler.framework.FrontierMarker, int, boolean)">getPendingURIsList</A></B>(<A HREF="../../../../org/archive/crawler/framework/FrontierMarker.html" title="interface in org.archive.crawler.framework">FrontierMarker</A> marker, int numberOfMatches, boolean verbose)</CODE><BR> Returns the frontiers URI list based on the provided marker.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.util.List</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getProfiles()">getProfiles</A></B>()</CODE><BR> Returns a List of all known profiles.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected java.io.File</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#getStateJobFile(java.io.File)">getStateJobFile</A></B>(java.io.File jobDir)</CODE><BR> Find the state.job file in the job directory.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#importUri(java.lang.String, boolean, boolean)">importUri</A></B>(java.lang.String uri, boolean forceFetch, boolean isSeed)</CODE><BR> Schedule a uri.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#importUri(java.lang.String, boolean, boolean, boolean)">importUri</A></B>(java.lang.String str, boolean forceFetch, boolean isSeed, boolean isFlush)</CODE><BR> Schedule a uri.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected int</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#importUris(java.io.InputStream, java.lang.String, boolean)">importUris</A></B>(java.io.InputStream is, java.lang.String style, boolean forceRevisit)</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#importUris(java.lang.String, java.lang.String, boolean)">importUris</A></B>(java.lang.String fileOrUrl, java.lang.String style, boolean forceRevisit)</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> java.lang.String</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#importUris(java.lang.String, java.lang.String, java.lang.String)">importUris</A></B>(java.lang.String file, java.lang.String style, java.lang.String force)</CODE><BR> </TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> boolean</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#isCrawling()">isCrawling</A></B>()</CODE><BR> Is a crawl job being crawled?</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> boolean</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#isRunning()">isRunning</A></B>()</CODE><BR> Is the crawler accepting crawl jobs to run?</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#kickUpdate()">kickUpdate</A></B>()</CODE><BR> Forward a 'kick' update to current job if any.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected void</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#loadJob(java.io.File)">loadJob</A></B>(java.io.File job)</CODE><BR> Loads a job given a specific job file.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>static java.util.ArrayList</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#loadOptions(java.lang.String)">loadOptions</A></B>(java.lang.String file)</CODE><BR> Loads options from a file.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE>protected boolean</CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#loadProfile(java.io.File)">loadProfile</A></B>(java.io.File profile)</CODE><BR> Load one profile.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#newJob(org.archive.crawler.admin.CrawlJob, java.lang.String, java.lang.String, java.lang.String, java.lang.String, int)">newJob</A></B>(<A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A> baseOn, java.lang.String recovery, java.lang.String name, java.lang.String description, java.lang.String seeds, int priority)</CODE><BR> Creates a new job.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#newJob(java.io.File, java.lang.String, java.lang.String, java.lang.String)">newJob</A></B>(java.io.File orderFile, java.lang.String name, java.lang.String description, java.lang.String seeds)</CODE><BR> Creates a new job.</TD></TR><TR BGCOLOR="white" CLASS="TableRowColor"><TD ALIGN="right" VALIGN="top" WIDTH="1%"><FONT SIZE="-1"><CODE> <A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A></CODE></FONT></TD><TD><CODE><B><A HREF="../../../../org/archive/crawler/admin/CrawlJobHandler.html#newProfile(org.archive.crawler.admin.CrawlJob, java.lang.String, java.lang.String, java.lang.String)">newProfile</A></B>(<A HREF="../../../../org/archive/crawler/admin/CrawlJob.html" title="class in org.archive.crawler.admin">CrawlJob</A> baseOn, java.lang.String name, java.lang.String description, java.lang.String seeds)</CODE><BR>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -