⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 filtfrontback.java

📁 用来为垂直搜索引擎抓取数据的采集系统
💻 JAVA
字号:
/*
 * *****************************************************
 * Copyright (c) 2005 IIM Lab. All  Rights Reserved.
 * Created by xuehao at 2005-10-12
 * Contact: zxuehao@mail.ustc.edu.cn
 * *****************************************************
 */

package org.indigo.filters;

/**
 * 此类实现了Filter接口,主要是去掉成对的><标签。
 * @author wbz
 *
 */
public class FiltFrontBack implements Filter
{
    private String itsFront=null, itsBack=null;

    public FiltFrontBack()
    {
        itsFront = "<";
        itsBack = ">";
    }
    public FiltFrontBack( String front, String back )
    {
        if( front==null )
            itsFront = "<";
        else
        	itsFront = front;
        itsBack = back;
    }
    public String filt(String str)
    {
        int i=-1,sz=0;
        boolean bIgnore=false;
        char chs[],ch;

        str = str.trim();
        StringBuffer buf=new StringBuffer();
        i = 0;
        chs = str.toCharArray();
        sz = str.length();
        for( i=0; i<sz; i++ )
        {
            ch = chs[i];
            if( ch=='<' )
            {
                bIgnore = true;
            }
            if( !bIgnore )
            {
                buf.append( ch );
            }
            if( ch=='>' )
            {
                bIgnore = false;
            }
        }
        str = buf.toString();
        return str;
    }

}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -