⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 regexpgroupselect.java

📁 java编写的OCR软件
💻 JAVA
字号:
package de.spieleck.app.jacson.select;

import org.apache.oro.text.regex.*;

import de.spieleck.config.ConfigNode;
import de.spieleck.config.ConfigVerify.Acceptor;

import de.spieleck.app.jacson.JacsonSelect;
import de.spieleck.app.jacson.JacsonRegistry;
import de.spieleck.app.jacson.SelectionResult;
import de.spieleck.app.jacson.JacsonConfigException;
import de.spieleck.app.jacson.util.RegExpUtil;
import de.spieleck.app.jacson.util.ConfigUtil;

/**
 * Select all the groups in the matching regexp.
 * @author fsn
 */
public class RegExpGroupSelect
    implements JacsonSelect, Acceptor
{
    /** oro-Regexp-Pattern */
    protected Pattern pattern;

    protected RegExpUtil rutil;

    protected String curChunk;

    protected SimpleResult result = new SimpleResult();

    protected MatchResult match = null;

    protected int matchCount = 0;

    protected PatternMatcherInput pmi;

    public RegExpGroupSelect() { }

    public void init(ConfigNode config, JacsonRegistry registry)
        throws JacsonConfigException
    {
        rutil = registry.getRegExpUtil();
        pattern = rutil.obtainPattern("RegExpExtract", config, registry);
        ConfigUtil.verify(config, this);
    }

    public boolean accept(ConfigNode node)
    {
        return rutil.accept(node);
    }

    public void setChunk(String chunk)
    {
        curChunk = chunk;
        match = null;
        pmi = new PatternMatcherInput(curChunk);
    }

    public SelectionResult getNextSelection()
    {
        if ( curChunk == null )
            return null;
        else
        {
            while ( true )
            {
                if ( match == null )
                {
                    PatternMatcher matcher = rutil.getMatcher();
                    if ( !matcher.contains(pmi, pattern) )
                        return null;
                    match = matcher.getMatch();
                    matchCount = 1;
                }
                int n = match.groups();
                while ( matchCount < n )
                {
                    int start = match.beginOffset(matchCount);
                    int end   = match.endOffset(matchCount);
                    String grp   = match.group(matchCount);
                    matchCount++;
                    if ( grp != null )
                    {
                        result.set(start, end, grp);
                        return result;
                    }
                }
                match = null;
            }
        }    
    }
}
//
//    Jacson - Text Filtering with Java.
//    Copyright (C) 2002 Frank S. Nestel (nestefan -at- users.sourceforge.net)
//
//    This library is free software; you can redistribute it and/or
//    modify it under the terms of the GNU Lesser General Public
//    License as published by the Free Software Foundation; either
//    version 2.1 of the License, or (at your option) any later version.
//
//    This library is distributed in the hope that it will be useful,
//    but WITHOUT ANY WARRANTY; without even the implied warranty of
//    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
//    Lesser General Public License for more details.
//
//    You should have received a copy of the GNU Lesser General Public
//    License along with this library; if not, write to the Free Software
//    Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA  02111-1307  USA
//

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -