📄 parseline.htm
字号:
<HTML>
<HEAD>
<META http-equiv="Content-Type" content="text/html; charset=UTF-8">
<TITLE>Parsing Character-Separated Data with a Regular Expression (Java Developers Almanac Example)
</TITLE>
<META CONTENT="Patrick Chan" NAME="AUTHOR">
<META CONTENT="Code Examples from The Java Developers Almanac 1.4" NAME="DESCRIPTION">
<META CONTENT="Addison-Wesley/Patrick Chan" NAME="OWNER">
<META CONTENT="3/20/02" NAME="revision">
<META CONTENT="no-cache" HTTP-EQUIV="Pragma">
<LINK href="/almanac.css" media="screen" type="text/css" rel="stylesheet">
</HEAD>
<BODY>
<TABLE CELLSPACING="0" CELLPADDING="0" BORDER="0">
<TR>
<TD></TD>
</TR>
</TABLE>
<br>
<TABLE CELLSPACING="0" CELLPADDING="0" BORDER="0">
<TR>
<TD></TD>
</TR>
<TR>
<TD rowspan="3"><A HREF="/?l=ex"><IMG BORDER="0" ALIGN="BOTTOM" HSPACE="10" SRC="/egs/almanac14a.jpg"></A></TD><TD VALIGN="top">
<h1>The Java Developers Almanac 1.4</h1>
<br>
Order this book from <a href="/cgi-bin/scripts/redirect.pl?l=ex&url=http://www.amazon.com/exec/obidos/ASIN/0201752808/xeo">Amazon</a>.
</TD>
</TR>
<TR>
<TD align="right" valign="bottom">
<FORM method="get" action="/cgi-bin/search/find.pl">
<INPUT size="25" name="words" type="text"><INPUT value="Search" type="submit">
</FORM>
</TD>
</TR>
</TABLE>
<HR color="#6666cc">
<TABLE CELLSPACING="0" CELLPADDING="0" BORDER="0">
<TR>
<TD valign="top"><script type="text/javascript">
<!--
google_ad_client = "pub-6001183370374757";
google_ad_width = 120;
google_ad_height = 600;
google_ad_format = "120x600_as";
google_ad_channel = "4777242811";
google_ad_type = "text_image";
google_color_border = "FFFFFF";
google_color_bg = "FFFFFF";
google_color_link = "6666CC";
google_color_url = "6666CC";
google_color_text = "000000";
//--></script><script src="http://pagead2.googlesyndication.com/pagead/show_ads.js" type="text/javascript"></script></TD><TD> </TD><TD valign="top">
<DIV ALIGN="LEFT">
<A HREF="/">Home</A>
>
<A HREF="../index.html">List of Packages</A>
>
<A HREF="../java.util.regex/pkg.html">java.util.regex</A><font color="#666666" class="xsmall-font">
[26 examples]
</font>
>
<B><A HREF="../java.util.regex/pkg.html#Tokenizing">Tokenizing</A></B><font color="#666666" class="xsmall-font">
[2 examples]
</font>
</DIV><P>
<h3>e431. Parsing Character-Separated Data with a Regular Expression</h3>
A line from a flat-file is typically formatted using a separator
character to separate the fields. If the separator is simply a comma,
tab, or single character, the <code>StringTokenizer</code> class can be used to
parse the line into fields. If the separator is more complex (e.g., a
space after a comma), a regular expression is needed.
<code>String.split()</code> conveniently parses a line using a regular
expression to specify the separator.
<P> <code>String.split()</code> returns only the nondelimiter strings. To
obtain the delimiter strings, see <a href="../java.util.regex/Tokenize.html" class="eglink"><b>e432</b> Parsing a String into Tokens Using a Regular Expression</a>.
<P> Note: The <code>StringTokenizer</code> does not conveniently handle
empty fields properly. For example, given the line <code>a,,b</code>, rather
than return three fields (the second being empty), the
<code>StringTokenizer</code> returns two fields, discarding the empty field.
<code>String.split()</code> properly handles empty fields.
<pre> // Parse a comma-separated string
String inputStr = <font color="#0066ff"><i>"a,,b"</i></font>;
String patternStr = <font color="#0066ff"><i>","</i></font>;
String[] fields = inputStr.split(patternStr);
// ["a", "", "b"]
// Parse a line whose separator is a comma followed by a space
inputStr = <font color="#0066ff"><i>"a, b, c,d"</i></font>;
patternStr = <font color="#0066ff"><i>", "</i></font>;
fields = inputStr.split(patternStr, -1);
// ["a", "b", "c,d"]
// Parse a line with and's and or's
inputStr = <font color="#0066ff"><i>"a, b, and c"</i></font>;
patternStr = <font color="#0066ff"><i>"[, ]+(and|or)*[, ]*"</i></font>;
fields = inputStr.split(patternStr, -1);
// ["a", "b", "c"]
</pre>
<P><table width="600" CELLSPACING="0" CELLPADDING="2" BORDER="0">
<tr>
<td bgcolor="#6666cc" align="center"><font color="#ffffff">
<b>Related Examples</b></font></td>
</tr>
</table>
e432. <a class="eglink" href="Tokenize.html?l=rel">
Parsing a String into Tokens Using a Regular Expression
</a>
<br>
<table width="600" CELLSPACING="0" CELLPADDING="2" BORDER="0">
<tr>
<td align="left">
<br>
See also:
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Flags">
Flags
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Groups">
Groups
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Lines">
Lines
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Paragraphs">
Paragraphs
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Searching%20and%20Replacing">
Searching and Replacing
</a>
</td>
</tr>
</table>
<br>
<br>
<FONT class="xsmall-font">
© 2002 Addison-Wesley.
</FONT></TD><TD> </TD><TD valign="top"><A href="http://compositesw.com/devzone?ref=javaalmanac"><IMG alt="Click Here" height="600" width="120" border="0" src="/csw_oad_120x600_final.gif"></A></TD>
</TR>
</TABLE>
</BODY>
<HEAD>
<META http-equiv="Content-Type" content="text/html; charset=UTF-8">
<META CONTENT="NO-CACHE" HTTP-EQUIV="PRAGMA">
</HEAD>
</HTML>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -