📄 util.java

📁 jakarta-oro-2.0.8 正则表达式引擎源代码
💻 JAVA
📖 第 1 页 / 共 2 页
字号:
12 下一页
/* * $Id: Util.java,v 1.15 2003/11/07 20:16:25 dfs Exp $ * * ==================================================================== * The Apache Software License, Version 1.1 * * Copyright (c) 2000-2002 The Apache Software Foundation.  All rights * reserved. * * Redistribution and use in source and binary forms, with or without * modification, are permitted provided that the following conditions * are met: * * 1. Redistributions of source code must retain the above copyright *    notice, this list of conditions and the following disclaimer. * * 2. Redistributions in binary form must reproduce the above copyright *    notice, this list of conditions and the following disclaimer in *    the documentation and/or other materials provided with the *    distribution. * * 3. The end-user documentation included with the redistribution, *    if any, must include the following acknowledgment: *       "This product includes software developed by the *        Apache Software Foundation (http://www.apache.org/)." *    Alternately, this acknowledgment may appear in the software itself, *    if and wherever such third-party acknowledgments normally appear. * * 4. The names "Apache" and "Apache Software Foundation", "Jakarta-Oro"  *    must not be used to endorse or promote products derived from this *    software without prior written permission. For written *    permission, please contact apache@apache.org. * * 5. Products derived from this software may not be called "Apache"  *    or "Jakarta-Oro", nor may "Apache" or "Jakarta-Oro" appear in their  *    name, without prior written permission of the Apache Software Foundation. * * THIS SOFTWARE IS PROVIDED ``AS IS'' AND ANY EXPRESSED OR IMPLIED * WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES * OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE * DISCLAIMED.  IN NO EVENT SHALL THE APACHE SOFTWARE FOUNDATION OR * ITS CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, * SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT * LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF * USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, * OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT * OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF * SUCH DAMAGE. * ==================================================================== * * This software consists of voluntary contributions made by many * individuals on behalf of the Apache Software Foundation.  For more * information on the Apache Software Foundation, please see * <http://www.apache.org/>. */package org.apache.oro.text.regex;import java.util.*;/** * The Util class is a holder for useful static utility methods that can * be generically applied to Pattern and PatternMatcher instances. * This class cannot and is not meant to be instantiated. * The Util class currently contains versions of the split() and substitute() * methods inspired by Perl's split function and <b>s</b> operation * respectively, although they are implemented in such a way as not to * rely on the Perl5 implementations of the OROMatcher packages regular * expression interfaces.  They may operate on any interface implementations * conforming to the OROMatcher API specification for the PatternMatcher, * Pattern, and MatchResult interfaces. Future versions of the class may * include additional utility methods. * <p> * A grep method is not included for two reasons: * <ol> *     <li> The details of reading a line at a time from an input stream *          differ in JDK 1.0.2 and JDK 1.1, making it difficult to *          retain compatibility across both Java releases. *     <li> Grep style processing is trivial for the programmer to implement *          in a while loop.  Rarely does anyone want to retrieve all *          occurences of a pattern and then process them.  More often a *          programmer will retrieve pattern matches and process them as they *          are retrieved, which is more efficient than storing them all in a *          Vector and then accessing them. * </ol> * * @version @version@ * @since 1.0 * @see Pattern * @see PatternMatcher */public final class Util {  /**   * A constant passed to the {@link #substitute substitute()}   * methods indicating that all occurrences of a pattern should be    * substituted.   */  public static final int SUBSTITUTE_ALL = -1;  /**   * A constant passed to the {@link #split split()} methods   * indicating that all occurrences of a pattern should be used to   * split a string.   */  public static final int SPLIT_ALL = 0;  /**   * The default destructor for the Util class.  It is made private   * to prevent the instantiation of the class.   */  private Util() { }  /**   * Splits up a <code>String</code> instance and stores results as a   * <code>List</code> of substrings numbering no more than a specified   * limit.  The string is split with a regular expression as the delimiter.    * The <b>limit</b> parameter essentially says to split the   * string only on at most the first <b>limit - 1</b> number of pattern   * occurences.   * <p>   * This method is inspired by the Perl split() function and behaves    * identically to it when used in conjunction with the Perl5Matcher and   * Perl5Pattern classes except for the following difference:   * <ul><p>   * In Perl, if the split expression contains parentheses, the split()   * method creates additional list elements from each of the matching   * subgroups in the pattern.  In other words:   * <ul><p>   * <code>split(list, "/([,-])/", "8-12,15,18", Util.SPLIT_ALL)</code></ul>   * <p> produces the list containing:   * <ul><p><code> { "8", "-", "12", ",", "15", ",", "18" } </code> </ul>   * <p> The OROMatcher split method does not follow this behavior.  The   * following list would be produced by OROMatcher:   * <ul><p><code> { "8", "12",  "15", "18" } </code> </ul>   * <p> To obtain the Perl behavior, use   * {@link org.apache.oro.text.perl.Perl5Util#split}.   * </ul>   * <p>   * @param results A Collection to which the split results are appended.   *         After the method returns, it contains the substrings of the input   *         that occur between the regular expression delimiter occurences.   *         The input will not be split into any more substrings than the   *         specified <code>limit</code>.  A way of thinking of this is that   *         only the first <code>limit - 1</code> matches of the delimiting   *         regular expression will be used to split the input.   * @param matcher The regular expression matcher to execute the split.   * @param pattern The regular expression to use as a split delimiter.   * @param input   The <code>String</code> to split.   * @param limit  The limit on the number of resulting split elements.   *               Values <= 0 produce the same behavior as using the   *               <b>SPLIT_ALL</b> constant which causes the limit to be    *               ignored and splits to be performed on all occurrences of   *               the pattern.  You should use the <b>SPLIT_ALL</b> constant   *               to achieve this behavior instead of relying on the default   *               behavior associated with non-positive limit values.   * @since 2.0   */  public static void split(Collection results, PatternMatcher matcher,			   Pattern pattern, String input, int limit)  {    int beginOffset;    MatchResult currentResult;    PatternMatcherInput pinput;    pinput = new PatternMatcherInput(input);    beginOffset = 0;    while(--limit != 0 && matcher.contains(pinput, pattern)) {      currentResult = matcher.getMatch();      results.add(input.substring(beginOffset,				  currentResult.beginOffset(0)));      beginOffset = currentResult.endOffset(0);    }    results.add(input.substring(beginOffset, input.length()));  }  /**   * Splits up a <code>String</code> instance and stores results as a   * <code>Collection</code> of all its substrings using a regular expression   * as the delimiter.   * This method is inspired by the Perl split() function and behaves    * identically to it when used in conjunction with the Perl5Matcher and   * Perl5Pattern classes except for the following difference:   * <p>   * <ul>   * In Perl, if the split expression contains parentheses, the split()   * method creates additional list elements from each of the matching   * subgroups in the pattern.  In other words:   * <ul><p><code>split(list, "/([,-])/", "8-12,15,18")</code></ul>   * <p> produces the list containing:    * <ul><p><code> { "8", "-", "12", ",", "15", ",", "18" } </code> </ul>   * <p> The OROMatcher split method does not follow this behavior.  The   * following list would be produced by OROMatcher:   * <ul><p><code> { "8", "12",  "15", "18" } </code> </ul>   * <p> To obtain the Perl behavior, use   * {@link org.apache.oro.text.perl.Perl5Util#split}.   * </ul>   * <p>   * This method is identical to calling:   * <blockquote><pre>   * split(matcher, pattern, input, Util.SPLIT_ALL);   * </pre></blockquote>   * <p>   * @param results A <code>Collection</code> to which all the substrings of   *         the input that occur between the regular expression delimiter   *         occurences are appended.   * @param matcher The regular expression matcher to execute the split.   * @param pattern The regular expression to use as a split delimiter.   * @param input   The <code>String</code> to split.   * @since 2.0   */  public static void split(Collection results,  PatternMatcher matcher,			   Pattern pattern, String input)  {    split(results, matcher, pattern, input, SPLIT_ALL);  }  /**   * Splits up a <code>String</code> instance into strings contained in a   * <code>Vector</code> of size not greater than a specified limit.  The   * string is split with a regular expression as the delimiter.    * The <b>limit</b> parameter essentially says to split the   * string only on at most the first <b>limit - 1</b> number of pattern   * occurences.   * <p>   * This method is inspired by the Perl split() function and behaves    * identically to it when used in conjunction with the Perl5Matcher and   * Perl5Pattern classes except for the following difference:
12 下一页
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -