urlpredicate.java
来自「一个Web爬虫(机器人」· Java 代码 · 共 48 行
JAVA
48 行
/* * WebSPHINX web crawling toolkit * Copyright (C) 1998,1999 Carnegie Mellon University * * This library is free software; you can redistribute it * and/or modify it under the terms of the GNU Library * General Public License as published by the Free Software * Foundation, version 2. * * WebSPHINX homepage: http://www.cs.cmu.edu/~rcm/websphinx/ */package websphinx.workbench;import websphinx.*;public class URLPredicate implements LinkPredicate, PagePredicate { Pattern pattern; public URLPredicate (Pattern pattern) { this.pattern = pattern; } public boolean equals (Object object) { if (! (object instanceof URLPredicate)) return false; URLPredicate p = (URLPredicate)object; return p.pattern.equals (pattern); } public Pattern getPattern () { return pattern; } public void connected (Crawler crawler) {} public void disconnected (Crawler crawler) {} public boolean shouldVisit (Link link) { return pattern.found (link.getURL().toString()); } public boolean shouldActOn (Page page) { try { return pattern.found (page.getOrigin().getURL().toString()); } catch (NullPointerException e) { return false; } }}
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?