⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 ch27_14.htm

📁 the unix power tools
💻 HTM
字号:
<HTML><!--Distributed by F --><HEAD><TITLE>[Chapter 27] 27.14 Compound Searches </TITLE><METANAME="DC.title"CONTENT="UNIX Power Tools"><METANAME="DC.creator"CONTENT="Jerry Peek, Tim O'Reilly &amp; Mike Loukides"><METANAME="DC.publisher"CONTENT="O'Reilly &amp; Associates, Inc."><METANAME="DC.date"CONTENT="1998-08-04T21:44:31Z"><METANAME="DC.type"CONTENT="Text.Monograph"><METANAME="DC.format"CONTENT="text/html"SCHEME="MIME"><METANAME="DC.source"CONTENT="1-56592-260-3"SCHEME="ISBN"><METANAME="DC.language"CONTENT="en-US"><METANAME="generator"CONTENT="Jade 1.1/O'Reilly DocBook 3.0 to HTML 4.0"><LINKREV="made"HREF="mailto:online-books@oreilly.com"TITLE="Online Books Comments"><LINKREL="up"HREF="ch27_01.htm"TITLE="27. Searching Through Files"><LINKREL="prev"HREF="ch27_13.htm"TITLE="27.13 More grep-like Programs Written in Perl "><LINKREL="next"HREF="ch27_15.htm"TITLE="27.15 Narrowing a Search Quickly "></HEAD><BODYBGCOLOR="#FFFFFF"TEXT="#000000"><DIVCLASS="htmlnav"><H1><IMGSRC="gifs/smbanner.gif"ALT="UNIX Power Tools"USEMAP="#srchmap"BORDER="0"></H1><MAPNAME="srchmap"><AREASHAPE="RECT"COORDS="0,0,466,58"HREF="index.htm"ALT="UNIX Power Tools"><AREASHAPE="RECT"COORDS="467,0,514,18"HREF="jobjects/fsearch.htm"ALT="Search this book"></MAP><TABLEWIDTH="515"BORDER="0"CELLSPACING="0"CELLPADDING="0"><TR><TDALIGN="LEFT"VALIGN="TOP"WIDTH="172"><ACLASS="SECT1"HREF="ch27_13.htm"TITLE="27.13 More grep-like Programs Written in Perl "><IMGSRC="gifs/txtpreva.gif"SRC="gifs/txtpreva.gif"ALT="Previous: 27.13 More grep-like Programs Written in Perl "BORDER="0"></A></TD><TDALIGN="CENTER"VALIGN="TOP"WIDTH="171"><B><FONTFACE="ARIEL,HELVETICA,HELV,SANSERIF"SIZE="-1">Chapter 27<BR>Searching Through Files</FONT></B></TD><TDALIGN="RIGHT"VALIGN="TOP"WIDTH="172"><ACLASS="SECT1"HREF="ch27_15.htm"TITLE="27.15 Narrowing a Search Quickly "><IMGSRC="gifs/txtnexta.gif"SRC="gifs/txtnexta.gif"ALT="Next: 27.15 Narrowing a Search Quickly "BORDER="0"></A></TD></TR></TABLE>&nbsp;<HRALIGN="LEFT"WIDTH="515"TITLE="footer"></DIV><DIVCLASS="SECT1"><H2CLASS="sect1"><ACLASS="title"NAME="UPT-ART-8003">27.14 Compound Searches </A></H2><PCLASS="para"><ACLASS="indexterm"NAME="AUTOID-30632"></A><ACLASS="indexterm"NAME="AUTOID-30636"></A><ACLASS="indexterm"NAME="AUTOID-30638"></A><ACLASS="indexterm"NAME="AUTOID-30642"></A>You may recall that you can search for lines containing &quot;this&quot;<EMCLASS="emphasis">or</EM> &quot;that&quot; using the<SPANCLASS="link"><EMCLASS="emphasis">egrep</EM> (<ACLASS="linkend"HREF="ch27_05.htm"TITLE="Extended Searching for Text with egrep ">27.5</A>)</SPAN><CODECLASS="literal">|</CODE> metacharacter:</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">egrep 'this|that' <CODECLASS="replaceable"><I>files</I></CODE></PRE></BLOCKQUOTE></P><PCLASS="para">But how do you grep for &quot;this&quot; <EMCLASS="emphasis">and</EM> &quot;that&quot;?Conventional regular expressions don't support an <EMCLASS="emphasis">and</EM> operatorbecause it breaks the rule that patterns match one consecutive stringof text.Well,<SPANCLASS="link"><EMCLASS="emphasis">agrep</EM> (<ACLASS="linkend"HREF="ch28_09.htm"TITLE="ex Scripts Built by diff ">28.9</A>)</SPAN>is one version of <EMCLASS="emphasis">grep</EM> that breaks allthe rules.If you're lucky enough to have it installed, just use:</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">agrep 'cat;dog;bird' <CODECLASS="replaceable"><I>files</I></CODE></PRE></BLOCKQUOTE></P><PCLASS="para">If you don't have <EMCLASS="emphasis">agrep</EM>, a common technique is to filter the textthrough several <EMCLASS="emphasis">grep</EM>s so that only lines containing all thekeywords make it through the pipeline intact:</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">grep cat <CODECLASS="replaceable"><I>files</I></CODE> | grep dog | grep bird</PRE></BLOCKQUOTE></P><PCLASS="para">But can it be done in one command?The closest you can come with <EMCLASS="emphasis">grep</EM> is this idea:</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">grep 'cat.*dog.*bird' <CODECLASS="replaceable"><I>files</I></CODE></PRE></BLOCKQUOTE></P><PCLASS="para">which has two limitations&nbsp;- thewords must appear in the given order, and they cannot overlap.(The first limitation can be overcome using <CODECLASS="literal">egrep 'cat.*dog|dog.*cat'</CODE>,but this trick is not really scalable to more than two terms.)</P><PCLASS="para">As usual, the problem can also be solved by moving beyond the grep familyto the more powerful tools.Here is how to do a line-by-line <EMCLASS="emphasis">and</EM>search using <EMCLASS="emphasis">sed</EM>, <EMCLASS="emphasis">awk</EM>, or <EMCLASS="emphasis">perl</EM>:[2]</P><BLOCKQUOTECLASS="footnote"><PCLASS="para">[2] Some versions of <EMCLASS="emphasis">nawk</EM> require an explicit <CODECLASS="literal">$0~</CODE>in front of each pattern.</P></BLOCKQUOTE><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">sed '/cat/!d; /dog/!d; /bird/!d' <CODECLASS="replaceable"><I>files</I></CODE>awk '/cat/ &amp;&amp; /dog/ &amp;&amp; /bird/' <CODECLASS="replaceable"><I>files</I></CODE>perl -ne 'print if /cat/ &amp;&amp; /dog/ &amp;&amp; /bird/' <CODECLASS="replaceable"><I>files</I></CODE></PRE></BLOCKQUOTE></P><PCLASS="para">Okay, but what if you want to find where all the words occur in the same<EMCLASS="emphasis">paragraph</EM>?Just turn on paragraph mode by setting <CODECLASS="literal">RS=&quot;&quot;</CODE> in <EMCLASS="emphasis">awk</EM>or by giving the <EMCLASS="emphasis">-00</EM> option to <EMCLASS="emphasis">perl</EM>:</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">awk '/cat/ &amp;&amp; /dog/ &amp;&amp; /bird/ {print $0 ORS}' RS= <CODECLASS="replaceable"><I>files</I></CODE>perl -n00e 'print &quot;$_\n&quot; if /cat/ &amp;&amp; /dog/ &amp;&amp; /bird/' <CODECLASS="replaceable"><I>files</I></CODE></PRE></BLOCKQUOTE></P><PCLASS="para">And if you just want a list of the <CODECLASS="replaceable"><I>files</I></CODE> that contain all thewords anywhere in them?Well, <EMCLASS="emphasis">perl</EM> can easily slurp in entire filesif you have the memory and you use the <EMCLASS="emphasis">-0</EM> option to set the recordseparator to something that won't occur in the file (like <EMCLASS="emphasis">NUL</EM>):</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">perl -ln0e 'print $ARGV if /cat/ &amp;&amp; /dog/ &amp;&amp; /bird/' <CODECLASS="replaceable"><I>files</I></CODE></PRE></BLOCKQUOTE></P><PCLASS="para">(Notice that as the problem gets harder, the less powerful commands dropout.)</P><PCLASS="para">The grep filter technique shown above also works on this problem.Just add a<SPANCLASS="link"><EMCLASS="emphasis">-l</EM> option (<ACLASS="linkend"HREF="ch15_07.htm"TITLE="Getting a List of Matching Files with grep -l ">15.7</A>)</SPAN>and the<SPANCLASS="link"><EMCLASS="emphasis">xargs</EM> command (<ACLASS="linkend"HREF="ch09_21.htm"TITLE="Handle Too-Long Command Lines with xargs ">9.21</A>)</SPAN>to make itpass filenames through the pipeline rather than text lines:</P><PCLASS="para"><BLOCKQUOTECLASS="screen"><PRECLASS="screen">grep -l cat <CODECLASS="replaceable"><I>files</I></CODE> | xargs grep -l dog | xargs grep -l bird</PRE></BLOCKQUOTE></P><PCLASS="para">(<EMCLASS="emphasis">xargs</EM> is basically glue used when one program produces outputthat's needed by another program as command-line arguments.)</P><DIVCLASS="sect1info"><PCLASS="SECT1INFO">- <SPANCLASS="authorinitials">GU</SPAN></P></DIV></DIV><DIVCLASS="htmlnav"><P></P><HRALIGN="LEFT"WIDTH="515"TITLE="footer"><TABLEWIDTH="515"BORDER="0"CELLSPACING="0"CELLPADDING="0"><TR><TDALIGN="LEFT"VALIGN="TOP"WIDTH="172"><ACLASS="SECT1"HREF="ch27_13.htm"TITLE="27.13 More grep-like Programs Written in Perl "><IMGSRC="gifs/txtpreva.gif"SRC="gifs/txtpreva.gif"ALT="Previous: 27.13 More grep-like Programs Written in Perl "BORDER="0"></A></TD><TDALIGN="CENTER"VALIGN="TOP"WIDTH="171"><ACLASS="book"HREF="index.htm"TITLE="UNIX Power Tools"><IMGSRC="gifs/txthome.gif"SRC="gifs/txthome.gif"ALT="UNIX Power Tools"BORDER="0"></A></TD><TDALIGN="RIGHT"VALIGN="TOP"WIDTH="172"><ACLASS="SECT1"HREF="ch27_15.htm"TITLE="27.15 Narrowing a Search Quickly "><IMGSRC="gifs/txtnexta.gif"SRC="gifs/txtnexta.gif"ALT="Next: 27.15 Narrowing a Search Quickly "BORDER="0"></A></TD></TR><TR><TDALIGN="LEFT"VALIGN="TOP"WIDTH="172">27.13 More grep-like Programs Written in Perl </TD><TDALIGN="CENTER"VALIGN="TOP"WIDTH="171"><ACLASS="index"HREF="index/idx_0.htm"TITLE="Book Index"><IMGSRC="gifs/index.gif"SRC="gifs/index.gif"ALT="Book Index"BORDER="0"></A></TD><TDALIGN="RIGHT"VALIGN="TOP"WIDTH="172">27.15 Narrowing a Search Quickly </TD></TR></TABLE><HRALIGN="LEFT"WIDTH="515"TITLE="footer"><IMGSRC="gifs/smnavbar.gif"SRC="gifs/smnavbar.gif"USEMAP="#map"BORDER="0"ALT="The UNIX CD Bookshelf Navigation"><MAPNAME="map"><AREASHAPE="RECT"COORDS="0,0,73,21"HREF="../index.htm"ALT="The UNIX CD Bookshelf"><AREASHAPE="RECT"COORDS="74,0,163,21"HREF="index.htm"ALT="UNIX Power Tools"><AREASHAPE="RECT"COORDS="164,0,257,21"HREF="../unixnut/index.htm"ALT="UNIX in a Nutshell"><AREASHAPE="RECT"COORDS="258,0,321,21"HREF="../vi/index.htm"ALT="Learning the vi Editor"><AREASHAPE="RECT"COORDS="322,0,378,21"HREF="../sedawk/index.htm"ALT="sed &amp; awk"><AREASHAPE="RECT"COORDS="379,0,438,21"HREF="../ksh/index.htm"ALT="Learning the Korn Shell"><AREASHAPE="RECT"COORDS="439,0,514,21"HREF="../lrnunix/index.htm"ALT="Learning the UNIX Operating System"></MAP></DIV></BODY></HTML>

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -