parseline.html

来自「java类库详细讲解」· HTML 代码 · 共 150 行

HTML
150
字号
<HTML>
<HEAD>
<META http-equiv="Content-Type" content="text/html; charset=UTF-8">
<TITLE>Parsing Character-Separated Data with a Regular Expression
(Java Developers Almanac Example)
</TITLE>
<META CONTENT="Patrick Chan" NAME="AUTHOR">
<META CONTENT="Code Examples from The Java Developers Almanac 1.4" NAME="DESCRIPTION">
<META CONTENT="Addison-Wesley/Patrick Chan" NAME="OWNER">
<META CONTENT="3/20/02" NAME="revision">
<STYLE TYPE="text/css">
<!--     BODY CODE  {font-family: Courier, Monospace;           font-size: 11pt}    TABLE, BODY          {font-family: Verdana, Arial, Helvetica, sans-serif;           font-size: 10pt}    PRE   {font-family: Courier, Monospace;           font-size: 10pt}    H3    {font-family: Verdana, Arial, Helvetica, sans-serif;           font-size: 11pt}    A.eglink {text-decoration: none}    A:hover.eglink {text-decoration: underline}    -->
</STYLE>
</HEAD>
<BODY>
<TABLE CELLSPACING="0" CELLPADDING="0" BORDER="0">
<TR>
<TD rowspan="3"><A HREF="/?l=ex"><IMG BORDER="0" ALIGN="BOTTOM" HSPACE="10" SRC="/egs/almanac14a.jpg"></A></TD><TD VALIGN="top"><font face="Times" size="6"><b>The Java Developers Almanac 1.4</b></font>
<br>
        Order this book from <a href="/cgi-bin/scripts/redirect.pl?l=ex&url=http://www.amazon.com/exec/obidos/ASIN/0201752808/xeo">Amazon</a>.
    </TD>
</TR>
<TR>
<TD align="right" valign="bottom">
<FORM method="get" action="/cgi-bin/search/find.pl">
<INPUT size="25" name="words" type="text"><INPUT value="Search" type="submit">
</FORM>
</TD>
</TR>
</TABLE>
<HR color="#6666cc">
<DIV ALIGN="LEFT">
<A HREF="/">Home</A>
    &gt;
    <A HREF="../index.html">List of Packages</A>
    &gt;
    <B><A HREF="../java.util.regex/pkg.html">java.util.regex</A></B><font color="#666666" SIZE="-2">
        &nbsp;[26 examples]
        </font>
        &gt;
        <B><A HREF="../java.util.regex/pkg.html#Tokenizing">Tokenizing</A></B><font color="#666666" SIZE="-2">
            &nbsp;[2 examples]
            </font>
</DIV><P>
  <h3>
    e431.  
    Parsing Character-Separated Data with a Regular Expression</h3>

A line from a flat-file is typically formatted using a separator
character to separate the fields.  If the separator is simply a comma,
tab, or single character, the <code>StringTokenizer</code> class can be used to
parse the line into fields.  If the separator is more complex (e.g., a
space after a comma), a regular expression is needed.
<code>String.split()</code> conveniently parses a line using a regular
expression to specify the separator.

<P> <code>String.split()</code> returns only the nondelimiter strings.  To
obtain the delimiter strings, see <a href="../java.util.regex/Tokenize.html" class="eglink"><font size="-1"><b>e432</b> Parsing a String into Tokens Using a Regular Expression</font></a>.
 
<P> Note: The <code>StringTokenizer</code> does not conveniently handle
empty fields properly.  For example, given the line <code>a,,b</code>, rather
than return three fields (the second being empty), the
<code>StringTokenizer</code> returns two fields, discarding the empty field.
<code>String.split()</code> properly handles empty fields.


<pre>
    // Parse a comma-separated string
    String inputStr = <font color="#0066ff"><i>"a,,b"</i></font>;
    String patternStr = <font color="#0066ff"><i>","</i></font>;
    String[] fields = inputStr.split(patternStr);
    // ["a", "", "b"]
    
    // Parse a line whose separator is a comma followed by a space
    inputStr = <font color="#0066ff"><i>"a, b, c,d"</i></font>;
    patternStr = <font color="#0066ff"><i>", "</i></font>;
    fields = inputStr.split(patternStr, -1);
    // ["a", "b", "c,d"]
    
    // Parse a line with and's and or's
    inputStr = <font color="#0066ff"><i>"a, b, and c"</i></font>;
    patternStr = <font color="#0066ff"><i>"[, ]+(and|or)*[, ]*"</i></font>;
    fields = inputStr.split(patternStr, -1);
    // ["a", "b", "c"]
</pre>
<P><table width="600" CELLSPACING="0" CELLPADDING="2" BORDER="0">
<tr>
<td bgcolor="#6666cc" align="center"><font color="#ffffff">
            &nbsp;Related Examples
        </font></td>
</tr>
</table>


e432. <a class="eglink" href="Tokenize.html?l=rel">
    Parsing a String into Tokens Using a Regular Expression
</a>
<br>


<table width="600" CELLSPACING="0" CELLPADDING="2" BORDER="0">
<tr>
<td align="left">
<br>
        See also: 
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Flags">
    Flags
</a>&nbsp;&nbsp;

<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Groups">
    Groups
</a>&nbsp;&nbsp;

<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Lines">
    Lines
</a>&nbsp;&nbsp;

<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Paragraphs">
    Paragraphs
</a>&nbsp;&nbsp;

<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Searching%20and%20Replacing">
    Searching and Replacing
</a>&nbsp;&nbsp;

</td>
</tr>
</table>

<br>

<br>
<FONT FACE="Verdana, Arial, Helvetica, sans-serif" SIZE="0">
&copy; 2002 Addison-Wesley.
</FONT>
</BODY>
</HTML>

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?