1、锁定某个主题抓取; 2、能够产生日志文本文件,格式为:时间戳(timestamp)、URL; 3、抓取某一URL时最多允许建立2个连接(注意:本地作网页解析的线程数则不限) 4、遵守文明蜘蛛规则:必须分析robots.txt文件和meta tag有无限制;一个线程抓完一个网页后要sleep 2秒钟; 5、能对HTML网页进行解析,提取出链接URL,能判别提取的URL是否已处理过,不重复解析已crawl过的网页; 6、能够对spider/crawler程序的一些基本参数进行设置,包括:抓取深度(depth)、种子URL等; 7、使用User-agent向服务器表明自己的身份; 8、产生抓取统计信息:包括抓取速度、抓取完成所需时间、抓取网页总数;重要变量和所有类、方法加注释; 9、请遵守编程规范,如类、方法、文件等的命名规范, 10、可选:GUI图形用户界面、web界面,通过界面管理spider/crawler,包括启停、URL增删等
标签: 日志
上传时间: 2013-12-22
上传用户:wang5829
This practical, roll-up-your-sleeves guide distills years of ingenious XML hacking into a complete set of tips, tricks, and tools for web developers, system administrators, and programmers who want to leverage the untapped power of XML. If you want more than the average XML user--to explore and experiment, discover clever shortcuts, and show off just a little (and have fun in the process)--this invaluable book is a must-have.
标签: roll-up-your-sleeves practical ingenious distills
上传时间: 2015-11-01
上传用户:lili123
Student status management system is development two aspects that typical information management system, IMS( MIS), its development includes primarily the background database creates with support and the front end applies the procedure.Creates to rise to the former request the consistency of data is strong with the integrity, the library that the safeness of data like.But request the latter very much to apply the procedure function complete, easy usage etc. characteristics.
标签: management development information Student
上传时间: 2015-11-01
上传用户:1101055045
自从微软推出16位的Windows操作系统起,此后每种版本的Windows操作系统都非常依赖于动态链接库(DLL)中的函数和数据,实际上Windows操作系统中几乎所有的内容都由DLL以一种或另外一种形式代表着,例如显示的字体和图标存储在GDI DLL中、显示Windows桌面和处理用户的输入所需要的代码被存储在一个User DLL中、Windows编程所需要的大量的API函数也被包含在Kernel DLL中。
上传时间: 2013-12-14
上传用户:zhuimenghuadie
<%@ LANGUAGE="VBSCRIPT" %> <!--#include file="conn.asp" --> <% ProductClass_2=request("ProductClass_2") set rs=server.createobject("adodb.recordset") sqltext="select * from Product" if request("Product_Name")<>"" then sqltext=sqltext &" where Product_Name like %"& request("Product_Name") &"% " else sqltext=sqltext &" where Product_Name like %"& "" &"% " end if if request("Product_Class")<>"" then sqltext=sqltext &" and Class_1 like %"& request("Product_Class") &"% " end if
标签: ProductClass lt LANGUAGE VBSCRIPT
上传时间: 2013-11-25
上传用户:wl9454
<%@ LANGUAGE="VBSCRIPT" %> <!--#include file="conn.asp" --> <% Id=Session("LoginSuccess") ProductList = Session("ProductList") Products = Split(Request("cpbm"), ", ") For I=0 To UBound(Products) PutToShopBag Products(I), ProductList Next Session("ProductList") = ProductList ProductList = Session("ProductList") If Len(ProductList) =0 Then Response.Redirect "Nothing.asp" response.end end if
标签: lt LANGUAGE VBSCRIPT Session
上传时间: 2013-12-09
上传用户:huql11633
<%@ LANGUAGE="VBSCRIPT" %> <!--#include file="conn.asp" --> <% Id=Session("LoginSuccess") ProductList = Session("ProductList") Products = Split(Request("cpbm"), ", ") For I=0 To UBound(Products) PutToShopBag Products(I), ProductList Next Session("ProductList") = ProductList ProductList = Session("ProductList") If Len(ProductList) =0 Then Response.Redirect "Nothing.asp" response.end end if
标签: lt LANGUAGE VBSCRIPT Session
上传时间: 2014-08-18
上传用户:小草123
A translator that converts Qt Designer UI files into SWT java classes. Use the power of Qt Designer to generate your user interface and then automatically generate SWT java source code.
标签: Designer translator converts classes
上传时间: 2014-10-30
上传用户:zhanditian
design LP,HP,B S digital Butterworth and Chebyshev filter. All array has been specified internally,so user only need to input f1,f2,f3,f4,fs(in hz), alpha1,alpha2(in db) and iband (to specify the type of to design). This program output hk(z)=bk(z)/ak(z),k=1,2,..., ksection and the freq.
标签: Butterworth internally Chebyshev specified
上传时间: 2015-11-08
上传用户:253189838
利用VB开发通信程序主要的方法有两种,一是利用VB本身提供的控件(CONTRALS),另一种方法是利用WINDOWS API应用程序接口,WINDOWS API 主要提供了三个动态连接库KERNEL.EXE、USER.EXE、GDI.EXE供开发人员调用
标签: 通信程序
上传时间: 2015-11-09
上传用户:qazxsw