htmlcleanup2.pl

来自「harvest是一个下载html网页得机器人」· PL 代码 · 共 35 行

PL
35
字号
#!/usr/local/bin/perl# things this script does:#    1. removes consecutive <p>'s$sawp=0;$sawnonp=0;while(<>) {    if($ARGV ne $oldargv) {	rename($ARGV, $ARGV . '.2.bak');	open(ARGVOUT, ">$ARGV");	select(ARGVOUT);	$oldargv = $ARGV;    }    chop;    if(/^\<[pP]\>$/) {	if(!$sawp) {	    print "<p>\n";	}        $sawp=1;    }    else {	$sawp=0;	print "$_\n";    }}        

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?