htmlcleanup2.pl
来自「harvest是一个下载html网页得机器人」· PL 代码 · 共 35 行
PL
35 行
#!/usr/local/bin/perl# things this script does:# 1. removes consecutive <p>'s$sawp=0;$sawnonp=0;while(<>) { if($ARGV ne $oldargv) { rename($ARGV, $ARGV . '.2.bak'); open(ARGVOUT, ">$ARGV"); select(ARGVOUT); $oldargv = $ARGV; } chop; if(/^\<[pP]\>$/) { if(!$sawp) { print "<p>\n"; } $sawp=1; } else { $sawp=0; print "$_\n"; }}
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?