⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 regexpsubstfilter.java

📁 java编写的OCR软件
💻 JAVA
字号:
package de.spieleck.app.jacson.filter;

import org.apache.oro.text.regex.*;

import de.spieleck.config.ConfigNode;
import de.spieleck.config.ConfigVerify.Acceptor;

import de.spieleck.app.jacson.JacsonException;
import de.spieleck.app.jacson.JacsonConfigException;
import de.spieleck.app.jacson.JacsonRegistry;
import de.spieleck.app.jacson.util.RegExpUtil;
import de.spieleck.app.jacson.util.ConfigUtil;

/**
 * Filter to perform regexp based subsitiution (interpolated $i)
 * on a chunk.
 * @author fsn
 */
public class RegExpSubstFilter
    extends FilterBase
    implements Acceptor
{
    public final static String NSUBST_NODE = "numsubst";
    public final static String SUBST_NODE = "substitution";
    public final static String DEBUG_NODE = "debug";

    /** oro-Regexp-Pattern */
    protected Pattern pattern;

    /** Subsitution-Pattern */
    protected Substitution subst;

    /** limit the number of Substitutions*/
    protected int numSubs;

    protected RegExpUtil rutil;

    protected boolean debug;

    public RegExpSubstFilter() { }

    public void init(ConfigNode config, JacsonRegistry registry)
        throws JacsonConfigException
    {
        rutil = registry.getRegExpUtil();
        pattern = rutil.obtainPattern("RegExpSubstFilter", config, registry);
        String h = config.getString(SUBST_NODE, "");
        subst = new Perl5Substitution(h);
        numSubs = config.getInt(NSUBST_NODE, 0);
        if ( numSubs <= 0 )
            numSubs = Util.SUBSTITUTE_ALL;
        debug = config.getBoolean(DEBUG_NODE, false);
        if ( debug )
            System.err.println("-- "+NSUBST_NODE+"="+numSubs+" <"+pattern+">");
        ConfigUtil.verify(config, this);
    }

    public boolean accept(ConfigNode node)
    {
        String name = node.getName();
        return NSUBST_NODE.equals(name)
            || SUBST_NODE.equals(name)
            || DEBUG_NODE.equals(name)
            || rutil.accept(node);
    }

    public void putChunk(String chunk)
        throws JacsonException
    {
        if ( chunk == null )
            drain.putChunk(null);
        else
        {
            PatternMatcher matcher = rutil.getMatcher();
            String res = Util.substitute(matcher, pattern, subst, chunk, numSubs);
            if ( debug )
            {
                System.err.println("<<"+chunk);
                System.err.println("=="+matcher.contains(chunk, pattern));
                System.err.println(">>"+res);
            }
            drain.putChunk(res);
        }
    }
}
//
//    Jacson - Text Filtering with Java.
//    Copyright (C) 2002 Frank S. Nestel (nestefan -at- users.sourceforge.net)
//
//    This library is free software; you can redistribute it and/or
//    modify it under the terms of the GNU Lesser General Public
//    License as published by the Free Software Foundation; either
//    version 2.1 of the License, or (at your option) any later version.
//
//    This library is distributed in the hope that it will be useful,
//    but WITHOUT ANY WARRANTY; without even the implied warranty of
//    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
//    Lesser General Public License for more details.
//
//    You should have received a copy of the GNU Lesser General Public
//    License along with this library; if not, write to the Free Software
//    Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA  02111-1307  USA
//

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -