⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 crawljob.jsp

📁 这是个爬虫和lucece相结合最好了
💻 JSP
📖 第 1 页 / 共 3 页
字号:
            </tr>                        <tr>            	<td>&nbsp;</td>            </tr>                        </table>                                                <table cellspacing="0" width=750>	                               <tr class="headerrow">                <th width=70>                    TLD                </th>                <th width=100>                	Hosts                </th>                <th>                    Documents                </th>                <th>                    Data                </th>            </tr>            <%                alt = true;            	scd = summary.getReverseSortedCopy(summary.getTldDistribution());                for (Iterator i = scd.keySet().iterator(); i.hasNext();) {                    Object key = i.next();                    LongWrapper lw = (LongWrapper)scd.get(key);                    long count = lw == null ? 0 : lw.longValue;                    long displaybarwidth = 0;                    double per = ((double)count) / summary.getTotalTldDocuments();                                        long hostsPerTld = summary.getHostsPerTld((String)key);                    double perHost = ((double)hostsPerTld) / summary.getTotalHosts();                                        if (summary.getTotalTldDocuments() > 1) {                       displaybarwidth = (long)(per * MAX_BAR_WIDTH);                    }                     if (displaybarwidth == 0){                       displaybarwidth = 1;                    }                                        String percent = percentFormat.format(100 * per);                    String percentHost = percentFormat.format(100 * perHost);            %>                    <tr <%=alt?"bgcolor=#EEEEFF":""%>>                        <td nowrap>                            <a style="text-decoration: none;" href="<%=request.getContextPath()%>/logs.jsp?job=<%=cjob.getUID()%>&log=crawl.log&mode=regexpr&regexpr=^[^ ].*<%=(String)key%>&grep=true"><%=(String)key%></a>&nbsp;                        </td>                        <td>                        	<%= hostsPerTld %> &nbsp; <span class=percent>(<%= percentHost %>%)</span>                        </td>                        <td nowrap>                            <img src="<%=request.getContextPath()%>/images/blue.jpg" height="10" width="<%=displaybarwidth%>"> <%=count%> &nbsp;                            	<span class=percent>(<%= percent %>%)</span>                        </td>                        <td align="right" nowrap>                            <%=ArchiveUtils.formatBytesForDisplay(summary.getBytesPerTld((String)key))%>&nbsp;                        </td>                                          </tr>            <%                    alt = !alt;                }            %>                            <tr class="totalrow">  	        	<td><b>Total</b></td>  	        	<td><%= summary.getTotalHosts() %></td>  	        	<td><%= summary.getTotalTldDocuments() %> &nbsp; </td>   	        	<td align="right" nowrap>        	      		<%= ArchiveUtils.formatBytesForDisplay(summary.getTotalTldSize()) %>&nbsp;        	    </td>            </tr>                                       </table>                 <p>		<br>		</p>			<table width=750>	    <tr>	        <td valign="center" ><img 	        src="<%=request.getContextPath()%>/images/blue.jpg" height="1" width="40"></td>	        <td align="center"><i>DNS</i></td>	        <td valign="center" ><img 	        src="<%=request.getContextPath()%>/images/blue.jpg" height="1" width="660"></td>	    </tr>	</table>	         <table cellspacing="0" width=750>	                	<tr class="headerrow">                <th>                    Status code                </th>                <th width="200" colspan="2">                    Documents                </th>            </tr>            <%                alt = true;                scd = summary.getReverseSortedCopy(summary.getDnsStatusCodeDistribution());                for (Iterator i = scd.keySet().iterator(); i.hasNext();) {                    Object key = i.next();                    long count = ((LongWrapper)scd.get(key)).longValue;                    long displaybarwidth = 0;                    long barwidthadjust = 3;                    double per = ((double)count) / summary.getTotalDnsStatusCodeDocuments();                     if(summary.getTotalDnsStatusCodeDocuments()/barwidthadjust>0){                       displaybarwidth = (long)(per * MAX_BAR_WIDTH);                    }                     if(displaybarwidth==0){                       displaybarwidth=1;                    }                                                     String percent = percentFormat.format(100 * per);            %>                    <tr <%=alt?"bgcolor=#EEEEFF":""%>>                        <td nowrap>                            <a style="text-decoration: none;" href="<%=request.getContextPath()%>/logs.jsp?job=<%=cjob.getUID()%>&log=crawl.log&mode=regexpr&regexpr=^.{24}\s*<%=(String)key%>&grep=true">                                <%=CrawlURI.fetchStatusCodesToString(Integer.parseInt((String)key))%>                            </a>&nbsp;                        </td>                        <td colspan="2" nowrap>                            <img src="<%=request.getContextPath()%>/images/blue.jpg" height="10" width="<%=displaybarwidth%>"> <%=count%> &nbsp;                            	<span class=percent>(<%= percent %>%)</span>                        </td>                    </tr>            <%                    alt = !alt;                }            %>                            <tr class="totalrow">            	<td><b>Total:</b></td>            	<td><%= summary.getTotalDnsStatusCodeDocuments() %> &nbsp; </td>            	<td>&nbsp;</td>            </tr>            <tr>                <td>&nbsp;</td>            </tr>            <tr class="headerrow">                <th width="100">                    MIME type                </th>                <th width="200">                    Documents                </th>                <th>                    Data                </th>            </tr>            <%                alt = true;                fd = summary.getReverseSortedCopy(summary.getDnsMimeDistribution());                for (Iterator i = fd.keySet().iterator(); i.hasNext();) {                    Object key = i.next();                    long count = ((LongWrapper)fd.get(key)).longValue;                    long displaybarwidth = 0;                    double per = ((double)count)/summary.getTotalDnsMimeTypeDocuments();                    if(summary.getTotalMimeTypeDocuments()/6>0){                       displaybarwidth = (long)(per * MAX_BAR_WIDTH);                    }                     if(displaybarwidth==0){                       displaybarwidth=1;                    }                                        String percent = percentFormat.format(100 * per);            %>                    <tr <%=alt?"bgcolor=#EEEEFF":""%>>                        <td nowrap>                            <a style="text-decoration: none;" href="<%=request.getContextPath()%>/logs.jsp?job=<%=cjob.getUID()%>&log=crawl.log&mode=regexpr&regexpr=^[^ ].*<%=(String)key%>&grep=true"><%=key%></a>&nbsp;&nbsp;                        </td>                        <td nowrap>                            <img src="<%=request.getContextPath()%>/images/blue.jpg" height="10" width="<%=displaybarwidth%>"> <%=count%> &nbsp;                             <span class=percent>(<%= percent %>%)</span>                        </td>                        <td align="right" nowrap>                            <%=ArchiveUtils.formatBytesForDisplay(summary.getBytesPerMimeType((String)key))%>&nbsp;                                                   </td>                    </tr>            <%                    alt = !alt;                }                           %>   		            <tr class="totalrow">        	        	<td><b>Total</b></td>        	        	<td>        	        		<%= summary.getTotalDnsMimeTypeDocuments() %> &nbsp;         	        	</td>         	        	<td align="right" nowrap>        	        		<%= ArchiveUtils.formatBytesForDisplay(summary.getTotalDnsMimeSize()) %>&nbsp;        	        	</td>            	    </tr>            	                	    <tr>            	<td>&nbsp;</td>            </tr>                        	    <tr class="headerrow">                <th>                    Hosts                </th>                <th>                    Documents                </th>                <th>                    Data                </th>            </tr>            <%                alt = true;                hd = summary.getReverseSortedCopy(summary.getHostsDnsDistribution());                for (Iterator i = hd.keySet().iterator(); i.hasNext();) {                    Object key = i.next();                    LongWrapper lw = (LongWrapper)hd.get(key);                    long count = lw == null ? 0 : lw.longValue;                    long displaybarwidth = 0;                    double per = ((double)count) / summary.getTotalHostDnsDocuments();                    if(summary.getTotalHostDnsDocuments() > 1) {                       displaybarwidth = (long)(per * MAX_BAR_WIDTH);                    }                     if(displaybarwidth==0){                       displaybarwidth=1;                    }                                        String percent = percentFormat.format(100 * per);            %>                    <tr <%=alt?"bgcolor=#EEEEFF":""%>>                        <td nowrap>                            <a style="text-decoration: none;" href="<%=request.getContextPath()%>/logs.jsp?job=<%=cjob.getUID()%>&log=crawl.log&mode=regexpr&regexpr=^[^ ].*<%=(String)key%>&grep=true"><%=(String)key%></a>                        </td>                        <td nowrap>                            <img src="<%=request.getContextPath()%>/images/blue.jpg" height="10" width="<%=displaybarwidth%>"> <%=count%> &nbsp;                            	<span class=percent>(<%= percent %>%)</span>                        </td>                        <td align="right" nowrap>                            <%=ArchiveUtils.formatBytesForDisplay(summary.getBytesPerHost((String)key))%>&nbsp;                        </td>                                          </tr>            <%                    alt = !alt;                }            %>                            <tr class="totalrow">  	        	<td><b>Total</b></td>  	        	<td><%= summary.getTotalDnsHostDocuments() %> &nbsp; </td>   	        	<td align="right" nowrap>        	      		<%= ArchiveUtils.formatBytesForDisplay(summary.getTotalDnsHostSize()) %>&nbsp;        	    </td>            </tr>                        <tr>            	<td>&nbsp;</td>            </tr>	</table> <%    } else if(stats == null) {        out.println("<b>No statistics associated with job.</b><p><b>Job status:</b> " + cjob.getStatus());                    if(cjob.getErrorMessage()!=null){            out.println("<p><pre><font color='red'>"+cjob.getErrorMessage()+"</font></pre>");        }    } else {%>        <table border="0">            <tr>                <td valign="top">                    <table border="0" cellspacing="0" cellpadding="0" >                        <tr>                            <td>                                <b>Job name:</b>&nbsp;                            </td>                            <td>                                <%=cjob.getJobName()%>                            </td>                        </tr>                        <tr>                            <td>                                <b>Status:</b>&nbsp;                            </td>                            <td>                                <%=cjob.getStatus()%>                            </td>                        </tr>                        <tr>                            <td>                                <b>Time:</b>&nbsp;                            </td>                            <td>                                <%                                    long time = (stats.getCrawlerTotalElapsedTime())/1000;                                    if(time>3600)                                    {                                        //got hours.                                        out.println(time/3600 + " h., ");                                        time = time % 3600;                                    }                                                                        if(time > 60)

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -