searchengineguide.com.script~2

来自「垂直搜索的网络爬虫」· SCRIPT~2 代码 · 共 17 行

SCRIPT~2
17
字号
FIND_NODE	... <font face="Verdana, Arial, Helvetica, sans-serif" size="+1">	[next][END_OF_DOCUMENT]
STORE_TEXT	title	-1	-1	[next][END_OF_DOCUMENT]
FIND_NODE	... font face="Verdana, Arial, Helvetica, sans-serif" size="-2" color="#666666"	[next][END_OF_DOCUMENT]
SAVE_TEXT	$meta	-1	-1	[next][END_OF_DOCUMENT]
STORE_NODE	meta	$meta	[next][END_OF_DOCUMENT]
REGEXP	$meta	/(.+)\s+-\s+(.+)/	[next][END_OF_DOCUMENT]
STORE_ISODATE	iso-date	MMM dd',' yyyy	$2	[next][END_OF_DOCUMENT]
FIND_NODE	... </font>	[next][END_OF_DOCUMENT]
SAVE_POS	$content_start	[next][END_OF_DOCUMENT]
FIND_NODE	...  <td width="145" valign="top">	[GOTO_TASK	12][next]
FIND_NODE	...  <hr>	[next][END_OF_DOCUMENT]
SAVE_POS	$content_end	[next][END_OF_DOCUMENT]
STORE_TEXT	content	$content_start	$content_end	[next][END_OF_DOCUMENT]
STORE_LINKS	$content_start	$content_end	[next][END_OF_DOCUMENT]
END_OF_ARTICLE		[next][END_OF_DOCUMENT]
END_OF_ARTICLE		[next][END_OF_DOCUMENT]

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?