⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 html-hj.dtd

📁 harvest是一个下载html网页得机器人
💻 DTD
📖 第 1 页 / 共 3 页
字号:
<!--        html-hj.dtd        Document Type Definition for the HyperText Markup Language (HTML	DTD) for Sun Microsystems Corp. HotJava environment	        Date: 24 Jul 95	Revisions made by: Mark Gaither (markg@hal.com)        Original Author: Dave Raggett <dsr@hplb.hpl.hp.com>	I have taken the HTML 3.0 DTD dated 16 Mar 95 and added the APP	element introduced by Sun Microsystems Corp. to accomodate their	Java/HotJava WWW application environment.--><!ENTITY % HTML.Version        "-//Sun Micorsystems Corp.//DTD HotJava HTML//EN"        -- Typical usage:            <!DOCTYPE HTML PUBLIC "-//Sun Micorsystems Corp.//DTD HotJava HTML//EN">            <html>            ...            </html>        --        ><!--================== Flags for Marked Sections ==========================--><!ENTITY % HTML.Recommended "IGNORE"        -- Certain features of the language are necessary for compatibility           with widespread usage, but they may compromise the structural           integrity of a document. This feature test entity enables           a more prescriptive document type definition that eliminates           the above features.        --><![ %HTML.Recommended [        <!ENTITY % HTML.Deprecated "IGNORE">]]><!ENTITY % HTML.Deprecated "INCLUDE"        -- Certain features of the language are necessary for compatibility           with earlier versions of the specification, but they tend           to be used an implemented inconsistently, and their use is           deprecated. This feature test entity enables a document type           definition that eliminates these features.        -->        <!ENTITY % HTML.Obsoleted "IGNORE"        -- The XMP, LISTING and PLAINTEXT tags are incompatible with SGML           and derive from very early versions of HTML. They require non-           standard parsers and will cause problems for processing           documents with standard SGML tools.        --><!--================== Imported Names =====================================--><!ENTITY % Content-Type "CDATA"        -- meaning a MIME content type, as per RFC1521        --><!ENTITY % HTTP-Method "GET | POST"        -- as per HTTP specification        --><!ENTITY % URI "CDATA"        -- The term URI means a CDATA attribute           whose value is a Uniform Resource Identifier,           as defined by        "Uniform Resource Identifiers" by Tim Berners-Lee        aka http://info.cern.ch/hypertext/WWW/Addressing/URL/URI_Overview.html        aka RFC 1630        Note that CDATA attributes are limited by the LITLEN        capacity (1024 in the current version of html.decl),        so that URIs in HTML have a bounded length.        --><!ENTITY % REAL "CDATA" -- real numbers (not in SGML) --><!ENTITY % SHAPE "CDATA"     -- Shape of hotzone in image.       All coordinates are assumed to be numbers in the range 0 to 1       and interpreted as fractional width/height and measured from       the top left corner of the associated image.       The attribute value is a string taking one of the following forms:              		"default"       		       Used to define a default link for the figure background.            "circle x, y, r"        (x, y) define the center and r the radius.            "rect x, y, w, h"       (x, y) defines upper left, and w and h the width and height.            "polygon x1, y1, x2, y2, ..."       Given n pairs of x, y coordinates, the polygon is closed by a       line linking the n'th point to the first. Intersecting polygons       use the non-zero winding number rule to determine if a point lies       inside the polygon.I    --><!-- 3.0 Parameter Entities --><!ENTITY % heading "H1|H2|H3|H4|H5|H6"><![ %HTML.Obsoleted [    <!ENTITY % preformatted "PRE | XMP | LISTING">]]><![ %HTML.Deprecated [    <!ENTITY % list "UL | OL | DIR | MENU">    <!ENTITY % blockquote "BLOCKQUOTE | BQ">]]><!ENTITY % list "UL | OL"><!ENTITY % blockquote "BQ"><!ENTITY % preformatted "PRE"><!-- The CLASS attribute is used to subclass HTML elements for     rendering purposes, when used with style sheets, e.g. DSSSL lite --><!ENTITY % attrs  -- common attributes for elements --        'id       ID      #IMPLIED -- as target for hrefs (link ends) --         lang     CDATA   "en.us"  -- ISO language, country code --         class    NAMES   #IMPLIED -- for subclassing elements --'><!-- SGML standard forces different NAMES for all attribute values     in the same element, regardless of the attribute name! As a result     CDATA is used for CLEAR attribute to avoid clash with ALIGN attribute.--><!--When text flows around a figure or table in the margin, you sometimes wantto start an element like a header, paragraph or list below the figure ratherthan alongside it. The CLEAR attribute allows you to move down unconditionally:	clear=left          move down until left margin is clear	clear=right         move down until right margin is clear	clear=all           move down until both margins are clearAlternatively, you can decide to place the element alongside the figure justso long as there is enough room. The minimum width needed is specified as:	clear="40 en"       move down until there is at least 40 en units free	clear="100 pixels"  move down until there is at least 100 pixels free	The style sheet (or browser defaults) may provide default minimum widths foreach class of block-like elements.--><!ENTITY % needs -- Attributes for controlling text flow. Used in headers                    and other elements to guarantee sufficient room --        'clear  CDATA #IMPLIED'><!--    The following attribute may be included where ever a URL can be given:    md      message digest e.g. md="md5:jV2OfH+nnXHU8bnkPAad/mSQlTDZ"            where the digest is base64 encoded and preceded by a prefix            denoting the algorithm (in this case MD5).--><!ENTITY % url.link -- Attributes associated with URL based links --        "md     CDATA  #IMPLIED  -- message digest for linked object --"><!--================ Character mnemonic entities ==========================--><!-- The HTML list of Latin-1 entities includes the full range     of characters in widely available Latin-1 fonts, and as such     is a mixture of ISOlat1 and other ISO publishing symbols --><!ENTITY % HTMLlat1 PUBLIC  "-//IETF//ENTITIES Added Latin 1 for HTML//EN">%HTMLlat1;<!--================ Entities for special symbols =========================--><!ENTITY emsp   SDATA "[emsp  ]" -- em space --><!ENTITY ensp   SDATA "[ensp  ]" -- en space (1/2-em) --><!ENTITY mdash  SDATA "[ndash ]" -- em dash --><!ENTITY ndash  SDATA "[ndash ]" -- en dash (1/2-em) --><!ENTITY nbsp   SDATA "[nbsp  ]" -- non breaking space --><!ENTITY shy    SDATA "[shy   ]" -- soft hyphen --><!ENTITY copy   SDATA "[copy  ]" -- copyright sign --><!ENTITY trade  SDATA "[trade ]" -- trade mark sign --><!ENTITY reg    SDATA "[reg   ]" -- registered sign --><!--================ Entities for standard icons ==========================--><!-- a range of standard icons such as &folder; for use     in speeding up display of directory listings etc. --><!ENTITY % HTMLicons PUBLIC  "-//IETF//ENTITIES icons for HTML//EN">%HTMLicons;<!--================ Entities for math symbols ============================--><!-- ISO subset chosen for use with the widely available Adobe math font --><!ENTITY % HTMLmath PUBLIC  "-//IETF//ENTITIES Math and Greek for HTML//EN">%HTMLmath;<!--=================== Text Markup =======================================--><!ENTITY % font " U | S | TT | I | BIG | SMALL"><!ENTITY % phrase "EM | STRONG | CODE | SAMP | KBD | VAR | CITE"><!ENTITY % misc "Q | LANG | AU | DFN | PERSON | ACRONYM | ABBREV | INS | DEL"><!ENTITY % special "TAB | MATH | A | IMG | BR"><!ENTITY % notmath "%font | %phrase | %special | %misc"><!ENTITY % text "#PCDATA | SUB | SUP | B | %notmath"><!ENTITY % pre.exclusion "TAB|MATH|IMG|BIG|SMALL|SUB|SUP"><!ELEMENT (%font|B|%phrase|%misc) - - (%text)+><!ATTLIST (%font|B|%phrase|%misc) %attrs;><!-- Subscripts and superscripts. The ALIGN attribute is only used for math --><!ELEMENT (SUB|SUP) - - (%text)+><!ATTLIST (SUB|SUP)        %attrs;        align (left|center|right) #IMPLIED        ><!-- Forced line break --><!ELEMENT BR    - O EMPTY><!ATTLIST BR        %attrs;        %needs; -- for control of text flow --        ><!-- Named left, center and right tab stops (independent of '\t' char) --><!ELEMENT TAB - O EMPTY><!ATTLIST TAB        id       ID       #IMPLIED  -- defines named tab stop --        indent   NUMBER   0         -- en units before new tab stop --        to       IDREF    #IMPLIED  -- jump to named tab stop --        align   (left|center|right|decimal) left        dp      CDATA     #IMPLIED  -- decimal point e.g. dp="," --        ><!--================== Link Markup ========================================--><!--     With HTML 3.0 you can use ID attributes on most elements for named     link ends. The use of the NAME attribute on anchors is deprecated.     Do we want to support arbitrary elements for link starts? This would     involve adding HREF and related attributes to most elements.--><![ %HTML.Deprecated [    <!ENTITY % linkName "name CDATA #IMPLIED -- named link end --">]]><!ENTITY % linkName ""><!ENTITY % ToolBar "home|toc|index|glossary|copyright|                    up|previous|next|help|bookmark"        -- LINK RELationship values which are used to create toolbar           buttons or menu items for navigation, where toc stands           for table of contents and bookmark provides for an open           ended set of links, i.e. you can use multiple bookmarks           for key entry points. Use the optional TITLE attribute           to override default names.        --><!ENTITY % linkType "NAME"        -- A definitive list will be specified at a later date.           They are used                a) by stylesheets to control how collections of                   html nodes are rendered into printed documents                b) for document specific toolbars/menus when used                   with the LINK element in document head:                    "home|toc|index|glossary|copyright|                           up|previous|next|help|bookmark"                   where toc stands for table of contents and                   bookmark provides for an open ended set of links,                   i.e. you can use several bookmarks for key entry                   points. Use the optional TITLE attribute to                   override default names.                c) for hypertext paths or guided tours,                   with REL=NODE and REL=PATH.                 d) to make a link to a style sheet, e.g. rel=stylesheet                   (used only with the LINK element).                                   e) to make a link to a separate banner, e.g. rel=banner                   (used only with the LINK element).        --><!ENTITY % linkExtraAttributes -- URN moved to %url.link --       "rel %linkType #IMPLIED -- forward relationship type --        rev %linkType #IMPLIED -- reversed relationship type                                  to referent data --        title   CDATA #IMPLIED -- advisory only --        methods NAMES #IMPLIED -- supported public methods of the object:                                        TEXTSEARCH, GET, HEAD, ... --        "><![ %HTML.Deprecated [    <!ENTITY % A.content "(%heading|%text)+">]]><!ENTITY % A.content   "(%text)+"><!ELEMENT A     - - %A.content -(A)><!ATTLIST A        %attrs;        href   %URI;  #IMPLIED        %url.link;    -- standard link attributes --        %linkName;     -- name attribute is deprecated; use ID instead --        shape %SHAPE; #IMPLIED -- for shaped hotzones in FIGs --        %linkExtraAttributes;        ><!--=================== Images ============================================--><!-- Desired widths are used for negotiating image size     with the module responsible for painting the image.     align=left or right cause image to float to margin     and for subsequent text to wrap around image --><!ELEMENT IMG    - O EMPTY --  Embedded image --><!ATTLIST IMG        %attrs;        src  %URI;  #REQUIRED  -- URI of image to embed --        %url.link;             -- standard link attributes --        alt CDATA   #IMPLIED   -- for display in place of image --        align  (top|middle|bottom|left|right) top -- relative to baseline        --         width  NUMBER #IMPLIED -- desired width in en's or pixels --        height NUMBER #IMPLIED -- desired height in en's or pixels --        units  (en|pixels) pixels -- units for width and height --        ismap (ismap) #IMPLIED -- pass clicks to server --        ><!--=================== Paragraphs=========================================--><!ELEMENT P     - O (%text)+><!ATTLIST P        %attrs;        align  (left|center|right|justify) #IMPLIED        %needs; -- for control of text flow --        nowrap (nowrap) #IMPLIED -- disable wordwrap --        ><!--=================== Headings, Titles, Sections ========================--><!ELEMENT HR    - O EMPTY -- customizable horizontal rule --><!ATTLIST HR        %attrs;        src     %URI;   #IMPLIED -- URI of custom rule graphic --        %url.link;               -- standard link attributes --        %needs; -- for control of text flow --        ><!--  Headers can be numbered, although this is a matter for style sheets.  The style sheet controls the numbering style:     a) whether the parent numbering is inherited, e.g.  5.i.c        where 5 is the current sequence number for H1 headers, and        1 is the number for H2 headers and 3 for H3 headers.     b) what style is used for current sequence number        e.g. arabic, upperalpha, loweralpha, upperroman, lowerroman        or a numbering scheme appropriate for the current language  The skip attribute is used to skip over sequence numbers for items  which have been left out of the list, e.g. skip=3 advances the  sequence number past 3 omitted items. The seqnum sets the sequence  number to a specified value. Note that the style sheet may take  advantage of the sequence number for higher level headers.  The dingbat or src attributes may be used to specify a bullet like  image to be placed adjacent to the header. Defining this in the  header element simplifies the document markup and avoids the need  to use the clear or needs attribute in the following element to  prevent it flowing around this image.--><!ELEMENT ( %heading )  - -  (%text;)+><!ATTLIST ( %heading )        %attrs;        align  (left|center|right|justify) #IMPLIED        %needs; -- for control of text flow --        seqnum   NUMBER   #IMPLIED  -- starting sequence number --        skip     NUMBER   0         -- skip seq nums for missing items --        dingbat  ENTITY   #IMPLIED  -- dingbat entity from HTMLicons --        src      (%URI;)  #IMPLIED  -- bullet defined by graphic --        %url.link;                  -- standard link attributes --        nowrap   (nowrap)  #IMPLIED -- disable wordwrap --        ><!ELEMENT TITLE - -  (#PCDATA)          -- The TITLE element is not considered part of the flow of text.             It should be displayed, for example as the page header or             window title.          -->

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -