⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 fetchftp.html

📁 一个开源的网页爬虫一个开源的网页爬虫一个开源的网页爬虫一个开源的网页爬虫一个开源的网页爬虫一个开源的网页爬虫
💻 HTML
📖 第 1 页 / 共 3 页
字号:
<A NAME="method_detail"><!-- --></A><TABLE BORDER="1" WIDTH="100%" CELLPADDING="3" CELLSPACING="0" SUMMARY=""><TR BGCOLOR="#CCCCFF" CLASS="TableHeadingColor"><TH ALIGN="left" COLSPAN="1"><FONT SIZE="+2"><B>Method Detail</B></FONT></TH></TR></TABLE><A NAME="innerProcess(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>innerProcess</H3><PRE>public void <B>innerProcess</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)                  throws java.lang.InterruptedException</PRE><DL><DD>Processes the given URI.  If the given URI is not an FTP URI, then this method does nothing.  Otherwise an attempt is made to connect to the FTP server.  <p>If the connection is successful, an attempt will be made to CD to  the path specified in the URI.  If the remote CD command succeeds,  then it is assumed that the URI represents a directory.  If the CD command fails, then it is assumed that the URI represents a file.  <p>For directories, the directory listing will be fetched using the FTP LIST command, and saved to the HttpRecorder.  If the <code>extract.from.dirs</code> attribute is set to true, then the files in the fetched list will be added to the curi as extracted FTP links.  (It was easier to do that here, rather than writing a separate FTPExtractor.)  <p>For files, the file will be fetched using the FTP RETR command, and saved to the HttpRecorder.  <p>All file transfers (including directory listings) occur using Binary mode transfer.  Also, the local passive transfer mode is always used, to play well with firewalls.<P><DD><DL><DT><B>Overrides:</B><DD><CODE><A HREF="../../../../org/archive/crawler/framework/Processor.html#innerProcess(org.archive.crawler.datamodel.CrawlURI)">innerProcess</A></CODE> in class <CODE><A HREF="../../../../org/archive/crawler/framework/Processor.html" title="class in org.archive.crawler.framework">Processor</A></CODE></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi to process<DT><B>Throws:</B><DD><CODE>java.lang.InterruptedException</CODE> - if the thread is interrupted during   processing</DL></DD></DL><HR><A NAME="getExtractFromDirs(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>getExtractFromDirs</H3><PRE>public boolean <B>getExtractFromDirs</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</PRE><DL><DD>Returns the <code>extract.from.dirs</code> attribute for this <code>FetchFTP</code> and the given curi.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi whose attribute to return<DT><B>Returns:</B><DD>that curi's <code>extract.from.dirs</code></DL></DD></DL><HR><A NAME="getExtractParent(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>getExtractParent</H3><PRE>public boolean <B>getExtractParent</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</PRE><DL><DD>Returns the <code>extract.parent</code> attribute for this <code>FetchFTP</code> and the given curi.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi whose attribute to return<DT><B>Returns:</B><DD>that curi's <code>extract-parent</code></DL></DD></DL><HR><A NAME="getTimeout(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>getTimeout</H3><PRE>public int <B>getTimeout</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</PRE><DL><DD>Returns the <code>timeout-seconds</code> attribute for this <code>FetchFTP</code> and the given curi.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi whose attribute to return<DT><B>Returns:</B><DD>that curi's <code>timeout-seconds</code></DL></DD></DL><HR><A NAME="getMaxLength(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>getMaxLength</H3><PRE>public long <B>getMaxLength</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</PRE><DL><DD>Returns the <code>max-length-bytes</code> attribute for this <code>FetchFTP</code> and the given curi.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi whose attribute to return<DT><B>Returns:</B><DD>that curi's <code>max-length-bytes</code></DL></DD></DL><HR><A NAME="getFetchBandwidth(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>getFetchBandwidth</H3><PRE>public int <B>getFetchBandwidth</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</PRE><DL><DD>Returns the <code>fetch-bandwidth</code> attribute for this <code>FetchFTP</code> and the given curi.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi whose attribute to return<DT><B>Returns:</B><DD>that curi's <code>fetch-bandwidth</code></DL></DD></DL><HR><A NAME="determinePassword(org.archive.crawler.datamodel.CrawlURI)"><!-- --></A><H3>determinePassword</H3><PRE>public java.lang.String <B>determinePassword</B>(<A HREF="../../../../org/archive/crawler/datamodel/CrawlURI.html" title="class in org.archive.crawler.datamodel">CrawlURI</A>&nbsp;curi)</PRE><DL><DD>Determines the password for the given URI.  If the URI itself contains a password, then that password is returned.  Otherwise the settings system is probed for the <code>password</code> attribute, and the value for that attribute is returned.<P><DD><DL></DL></DD><DD><DL><DT><B>Parameters:</B><DD><CODE>curi</CODE> - the curi whose password to return<DT><B>Returns:</B><DD>that password</DL></DD></DL><!-- ========= END OF CLASS DATA ========= --><HR><!-- ======= START OF BOTTOM NAVBAR ====== --><A NAME="navbar_bottom"><!-- --></A><A HREF="#skip-navbar_bottom" title="Skip navigation links"></A><TABLE BORDER="0" WIDTH="100%" CELLPADDING="1" CELLSPACING="0" SUMMARY=""><TR><TD COLSPAN=2 BGCOLOR="#EEEEFF" CLASS="NavBarCell1"><A NAME="navbar_bottom_firstrow"><!-- --></A><TABLE BORDER="0" CELLPADDING="0" CELLSPACING="3" SUMMARY="">  <TR ALIGN="center" VALIGN="top">  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../overview-summary.html"><FONT CLASS="NavBarFont1"><B>Overview</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-summary.html"><FONT CLASS="NavBarFont1"><B>Package</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#FFFFFF" CLASS="NavBarCell1Rev"> &nbsp;<FONT CLASS="NavBarFont1Rev"><B>Class</B></FONT>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="class-use/FetchFTP.html"><FONT CLASS="NavBarFont1"><B>Use</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="package-tree.html"><FONT CLASS="NavBarFont1"><B>Tree</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../deprecated-list.html"><FONT CLASS="NavBarFont1"><B>Deprecated</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../index-all.html"><FONT CLASS="NavBarFont1"><B>Index</B></FONT></A>&nbsp;</TD>  <TD BGCOLOR="#EEEEFF" CLASS="NavBarCell1">    <A HREF="../../../../help-doc.html"><FONT CLASS="NavBarFont1"><B>Help</B></FONT></A>&nbsp;</TD>  </TR></TABLE></TD><TD ALIGN="right" VALIGN="top" ROWSPAN=3><EM></EM></TD></TR><TR><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">&nbsp;<A HREF="../../../../org/archive/crawler/fetcher/FetchDNS.html" title="class in org.archive.crawler.fetcher"><B>PREV CLASS</B></A>&nbsp;&nbsp;<A HREF="../../../../org/archive/crawler/fetcher/FetchHTTP.html" title="class in org.archive.crawler.fetcher"><B>NEXT CLASS</B></A></FONT></TD><TD BGCOLOR="white" CLASS="NavBarCell2"><FONT SIZE="-2">  <A HREF="../../../../index.html?org/archive/crawler/fetcher/FetchFTP.html" target="_top"><B>FRAMES</B></A>  &nbsp;&nbsp;<A HREF="FetchFTP.html" target="_top"><B>NO FRAMES</B></A>  &nbsp;&nbsp;<SCRIPT type="text/javascript">  <!--  if(window==top) {    document.writeln('<A HREF="../../../../allclasses-noframe.html"><B>All Classes</B></A>');  }  //--></SCRIPT><NOSCRIPT>  <A HREF="../../../../allclasses-noframe.html"><B>All Classes</B></A></NOSCRIPT></FONT></TD></TR><TR><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">  SUMMARY:&nbsp;<A HREF="#nested_classes_inherited_from_class_org.archive.crawler.settings.ComplexType">NESTED</A>&nbsp;|&nbsp;<A HREF="#field_summary">FIELD</A>&nbsp;|&nbsp;<A HREF="#constructor_summary">CONSTR</A>&nbsp;|&nbsp;<A HREF="#method_summary">METHOD</A></FONT></TD><TD VALIGN="top" CLASS="NavBarCell3"><FONT SIZE="-2">DETAIL:&nbsp;<A HREF="#field_detail">FIELD</A>&nbsp;|&nbsp;<A HREF="#constructor_detail">CONSTR</A>&nbsp;|&nbsp;<A HREF="#method_detail">METHOD</A></FONT></TD></TR></TABLE><A NAME="skip-navbar_bottom"></A><!-- ======== END OF BOTTOM NAVBAR ======= --><HR>Copyright &copy; 2003-2006 Internet Archive. All Rights Reserved.</BODY></HTML>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -