htmlpaser.java

来自「Jaoso新闻文章发布系统 0.9.1final 程序架构: Struts」· Java 代码 · 共 43 行

JAVA
43
字号
package jaoso.framework.util;

import java.util.ArrayList;
import java.util.regex.Matcher;
import java.util.regex.Pattern;


/**
 * @author 边缘孤客 edgeloner@yahoo.com.cn
 * @since 2004-11-27
 */
public class HtmlPaser {
    private final static String REGEX_IMG = "<\\s*img\\s+[^>]*src\\s*=\\s*[\"']?([^\"'>\\s]+\\.[a-zA-Z]{3,4})[\"']?[^>]*>";

    public static String[] getImgUrls(String html) {
        String[] result = null;
        Pattern p = Pattern.compile(REGEX_IMG,Pattern.CASE_INSENSITIVE);
        Matcher m = p.matcher(html);
        ArrayList tmp = new ArrayList();

        while (m.find()) {
            String match = m.group(1);

            if (notLocal(match)) {
                tmp.add(match);
            }
        }

        result = new String[tmp.size()];
        result = (String[]) tmp.subList(0, tmp.size()).toArray(result);

        return result;
    }

	/**
	 * @param match
	 * @return
	 */
	private static boolean notLocal(String match) {
		return (match.indexOf("pages/upload") == -1);
	}
}

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?