📄 wget.texi
字号:
\input texinfo @c -*-texinfo-*-@c %**start of header@setfilename wget.info@include version.texi@set UPDATED May 2003@settitle GNU Wget @value{VERSION} Manual@c Disable the monstrous rectangles beside overfull hbox-es.@finalout@c Use `odd' to print double-sided.@setchapternewpage on@c %**end of header@iftex@c Remove this if you don't use A4 paper.@afourpaper@end iftex@c Title for man page. The weird way texi2pod.pl is written requires@c the preceding @set.@set Wget Wget@c man title Wget The non-interactive network downloader.@dircategory Network Applications@direntry* Wget: (wget). The non-interactive network downloader.@end direntry@ifnottexThis file documents the the GNU Wget utility for downloading networkdata.@c man begin COPYRIGHTCopyright @copyright{} 1996, 1997, 1998, 2000, 2001, 2002, 2003 FreeSoftware Foundation, Inc.Permission is granted to make and distribute verbatim copies ofthis manual provided the copyright notice and this permission noticeare preserved on all copies.@ignorePermission is granted to process this file through TeX and print theresults, provided the printed document carries a copying permissionnotice identical to this one except for the removal of this paragraph(this paragraph not being relevant to the printed manual).@end ignorePermission is granted to copy, distribute and/or modify this documentunder the terms of the GNU Free Documentation License, Version 1.1 orany later version published by the Free Software Foundation; with theInvariant Sections being ``GNU General Public License'' and ``GNU FreeDocumentation License'', with no Front-Cover Texts, and with noBack-Cover Texts. A copy of the license is included in the sectionentitled ``GNU Free Documentation License''.@c man end@end ifnottex@titlepage@title GNU Wget @value{VERSION}@subtitle The non-interactive download utility@subtitle Updated for Wget @value{VERSION}, @value{UPDATED}@author by Hrvoje Nik@v{s}i@'{c} and the developers@ignore@c man begin AUTHOROriginally written by Hrvoje Niksic <hniksic@xemacs.org>.@c man end@c man begin SEEALSOGNU Info entry for @file{wget}.@c man end@end ignore@page@vskip 0pt plus 1filllCopyright @copyright{} 1996, 1997, 1998, 2000, 2001, 2003 Free SoftwareFoundation, Inc.Permission is granted to copy, distribute and/or modify this documentunder the terms of the GNU Free Documentation License, Version 1.1 orany later version published by the Free Software Foundation; with theInvariant Sections being ``GNU General Public License'' and ``GNU FreeDocumentation License'', with no Front-Cover Texts, and with noBack-Cover Texts. A copy of the license is included in the sectionentitled ``GNU Free Documentation License''.@end titlepage@ifnottex@node Top, Overview, (dir), (dir)@top Wget @value{VERSION}This manual documents version @value{VERSION} of GNU Wget, the freelyavailable utility for network downloads.Copyright @copyright{} 1996, 1997, 1998, 2000, 2001, 2003 Free SoftwareFoundation, Inc.@menu* Overview:: Features of Wget.* Invoking:: Wget command-line arguments.* Recursive Retrieval:: Description of recursive retrieval.* Following Links:: The available methods of chasing links.* Time-Stamping:: Mirroring according to time-stamps.* Startup File:: Wget's initialization file.* Examples:: Examples of usage.* Various:: The stuff that doesn't fit anywhere else.* Appendices:: Some useful references.* Copying:: You may give out copies of Wget and of this manual.* Concept Index:: Topics covered by this manual.@end menu@end ifnottex@node Overview, Invoking, Top, Top@chapter Overview@cindex overview@cindex features@c man begin DESCRIPTIONGNU Wget is a free utility for non-interactive download of files fromthe Web. It supports @sc{http}, @sc{https}, and @sc{ftp} protocols, aswell as retrieval through @sc{http} proxies.@c man endThis chapter is a partial overview of Wget's features.@itemize @bullet@item@c man begin DESCRIPTIONWget is non-interactive, meaning that it can work in the background,while the user is not logged on. This allows you to start a retrievaland disconnect from the system, letting Wget finish the work. Bycontrast, most of the Web browsers require constant user's presence,which can be a great hindrance when transferring a lot of data.@c man end@sp 1@item@ignore@c man begin DESCRIPTION@c man end@end ignore@c man begin DESCRIPTIONWget can follow links in @sc{html} and @sc{xhtml} pages and create local versions of remote web sites, fully recreating the directory structure of the original site. This is sometimes referred to as ``recursivedownloading.'' While doing that, Wget respects the Robot ExclusionStandard (@file{/robots.txt}). Wget can be instructed to convert thelinks in downloaded @sc{html} files to the local files for offlineviewing.@c man end@sp 1@itemFile name wildcard matching and recursive mirroring of directories areavailable when retrieving via @sc{ftp}. Wget can read the time-stampinformation given by both @sc{http} and @sc{ftp} servers, and store itlocally. Thus Wget can see if the remote file has changed since lastretrieval, and automatically retrieve the new version if it has. Thismakes Wget suitable for mirroring of @sc{ftp} sites, as well as homepages.@sp 1@item@ignore@c man begin DESCRIPTION@c man end@end ignore@c man begin DESCRIPTIONWget has been designed for robustness over slow or unstable networkconnections; if a download fails due to a network problem, it willkeep retrying until the whole file has been retrieved. If the serversupports regetting, it will instruct the server to continue thedownload from where it left off.@c man end@sp 1@itemWget supports proxy servers, which can lighten the network load, speedup retrieval and provide access behind firewalls. However, if you arebehind a firewall that requires that you use a socks style gateway, youcan get the socks library and build Wget with support for socks. Wgetalso supports the passive @sc{ftp} downloading as an option.@sp 1@itemBuilt-in features offer mechanisms to tune which links you wish to follow(@pxref{Following Links}).@sp 1@itemThe retrieval is conveniently traced with printing dots, each dotrepresenting a fixed amount of data received (1KB by default). Theserepresentations can be customized to your preferences.@sp 1@itemMost of the features are fully configurable, either through command lineoptions, or via the initialization file @file{.wgetrc} (@pxref{StartupFile}). Wget allows you to define @dfn{global} startup files(@file{/usr/local/etc/wgetrc} by default) for site settings.@ignore@c man begin FILES@table @samp@item /usr/local/etc/wgetrcDefault location of the @dfn{global} startup file.@item .wgetrcUser startup file.@end table@c man end@end ignore@sp 1@itemFinally, GNU Wget is free software. This means that everyone may useit, redistribute it and/or modify it under the terms of the GNU GeneralPublic License, as published by the Free Software Foundation(@pxref{Copying}).@end itemize@node Invoking, Recursive Retrieval, Overview, Top@chapter Invoking@cindex invoking@cindex command line@cindex arguments@cindex nohupBy default, Wget is very simple to invoke. The basic syntax is:@example@c man begin SYNOPSISwget [@var{option}]@dots{} [@var{URL}]@dots{}@c man end@end exampleWget will simply download all the @sc{url}s specified on the commandline. @var{URL} is a @dfn{Uniform Resource Locator}, as defined below.However, you may wish to change some of the default parameters ofWget. You can do it two ways: permanently, adding the appropriatecommand to @file{.wgetrc} (@pxref{Startup File}), or specifying it onthe command line.@menu* URL Format::* Option Syntax::* Basic Startup Options::* Logging and Input File Options::* Download Options::* Directory Options::* HTTP Options::* FTP Options::* Recursive Retrieval Options::* Recursive Accept/Reject Options::@end menu@node URL Format, Option Syntax, Invoking, Invoking@section URL Format@cindex URL@cindex URL syntax@dfn{URL} is an acronym for Uniform Resource Locator. A uniformresource locator is a compact string representation for a resourceavailable via the Internet. Wget recognizes the @sc{url} syntax as per@sc{rfc1738}. This is the most widely used form (square brackets denoteoptional parts):@examplehttp://host[:port]/directory/fileftp://host[:port]/directory/file@end exampleYou can also encode your username and password within a @sc{url}:@exampleftp://user:password@@host/pathhttp://user:password@@host/path@end exampleEither @var{user} or @var{password}, or both, may be left out. If youleave out either the @sc{http} username or password, no authenticationwill be sent. If you leave out the @sc{ftp} username, @samp{anonymous}will be used. If you leave out the @sc{ftp} password, your emailaddress will be supplied as a default password.@footnote{If you have a@file{.netrc} file in your home directory, password will also besearched for there.}@strong{Important Note}: if you specify a password-containing @sc{url}on the command line, the username and password will be plainly visibleto all users on the system, by way of @code{ps}. On multi-user systems,this is a big security risk. To work around it, use @code{wget -i -}and feed the @sc{url}s to Wget's standard input, each on a separateline, terminated by @kbd{C-d}.You can encode unsafe characters in a @sc{url} as @samp{%xy}, @code{xy}being the hexadecimal representation of the character's @sc{ascii}value. Some common unsafe characters include @samp{%} (quoted as@samp{%25}), @samp{:} (quoted as @samp{%3A}), and @samp{@@} (quoted as@samp{%40}). Refer to @sc{rfc1738} for a comprehensive list of unsafecharacters.Wget also supports the @code{type} feature for @sc{ftp} @sc{url}s. Bydefault, @sc{ftp} documents are retrieved in the binary mode (type@samp{i}), which means that they are downloaded unchanged. Anotheruseful mode is the @samp{a} (@dfn{ASCII}) mode, which converts the linedelimiters between the different operating systems, and is thus usefulfor text files. Here is an example:@exampleftp://host/directory/file;type=a@end exampleTwo alternative variants of @sc{url} specification are also supported,because of historical (hysterical?) reasons and their widespreaded use.@sc{ftp}-only syntax (supported by @code{NcFTP}):@examplehost:/dir/file@end example@sc{http}-only syntax (introduced by @code{Netscape}):@examplehost[:port]/dir/file@end exampleThese two alternative forms are deprecated, and may cease beingsupported in the future.If you do not understand the difference between these notations, or donot know which one to use, just use the plain ordinary format you usewith your favorite browser, like @code{Lynx} or @code{Netscape}.
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -