📄 htmlplus.dtd

📁 harvest是一个下载html网页得机器人
💻 DTD
📖 第 1 页 / 共 3 页
字号:
12 3 下一页
<!SGML  "ISO 8879:1986"--  Document Type Definition for the HyperText Markup Language Plus   for use with the World Wide Web application (HTML+ DTD).  This DTD is designed for use with SGML authoring tools and  it is expected that most browsers will tolerate omissions,  in particular, missing <DIVn> elements and <P> start tags  following headers, which can easily be inferred from the context.     The HTML+ DTD which is structured as an HTML core plus a     number of additional modules which can be included by an     entity definition in a document's <!DOCTYPE> element.     You can include specific features in your document     using the DOCTYPE declaration at the start, e.g.     <!DOCTYPE htmlplus [        <!ENTITY % HTML.tables "INCLUDE">        <!ENTITY % HTML.forms "INCLUDE">     ]>     This spec also allows for authors to extend the DTD and     to define how any new elements are rendered in terms of     existing ones. This should be used with caution.     I would like to acknowledge the influence of the TEI DTDs     which proved very helpful in restructuring the DTD.     Dave Raggett <dsr@hplb.hpl.hp.com> 5th April 1994       Changes:     Added align attribute to headers with same values     and meaning as for paragraphs.     Added prologue and epilogue elements and fixed bug     in definition of %paras with HTML.emph.     Added entity names for standard icons c/o Bert Bos.     This are enabled with the HTML.icons switch.     Added PRE to content model for LI and DD, and changed     DL to require at least one DT per DD. Math is now     part of %text; but can't be nested.     Tried to permit an optional epilogue following last     division, but sgmls gave: Content model is ambiguous     Added support for graphical menus to SELECT/OPTION.     Liberalised content model for headers to %text;     Added optional NOFOLD attribute to P element to     inhibit SGML folding of whitespace.     Dropped the LIT element and changed the TAB element     over to the LaTeX model. Added NOWRAP to P element.     Replaced TH/TD's align=numeric with ALIGNON="." etc.          Made anchor NAME attribute conditional on HTML.obsolete     Added HTML.obsolete for obsoleted HTML elements     Added SIG attribute to A and LINK for specifying     a digital signature to attest that a linked document     is unchanged.     SRC attribute added to NOTE to allow authors to override     the default icon chosen on the basis of the ROLE attribute.     Switched REL/REV back to CDATA to avoid trouble with duplicate     name or name token error. Using an explicit list of relationship     types would force us to drop REV.     Added top/bottom alignment attribute to CAPTION     REL and REV domains now defined by parameter entities     SRC attribute added to LINK to allow images to be used     in document specific toolbar     Baseline attribute added to FIG and IMG to give precise     control of vertical position relative to baseline.     Revised comments for <A> and <LINK> to remove references to     obsoleted timing attributes and to add a list of "standard"     relationship types.     CHANGED element now uses more meaningful attribute names.     STATE dropped in favor of INPUT with TYPE="hidden". Forms now     support a link to a script for client-side execution of constraints.     The element definition for DIV1..DIV6 and P no longer permit the     start tag to be omitted. This was regrettably forced by a     restriction in the SGML standard. Browsers *must* be capable of     inferring them when missing,in order to cope with legacy documents.     Fixed some errors in earlier version to do with start tags and     definitions of parameter entities. The content model for FIG has     been changed to use a P element for text to avoid an SGML parsing     problem with a line break before the caption.--CHARSET        BASESET "ISO 646:1983//CHARSET                 International Reference Version (IRV)//ESC 2/5 4/0"        DESCSET 0   9   UNUSED                9   2   9                11  2   UNUSED                13  1   13                14 18   UNUSED                32 95   32               127  1   UNUSED        BASESET "ISO Registration Number 100//CHARSET                 ECMA-94 Right Part of Latin Alphabet Nr. 1//ESC 2/13 4/1"        DESCSET 128  32  UNUSED                160  95  32                255   1  UNUSEDCAPACITY        SGMLREF                TOTALCAP        150000                GRPCAP          150000SCOPE   DOCUMENTSYNTAX        SHUNCHAR CONTROLS  0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18                           19 20 21 22 23 24 25 26 27 28 29 30 31 127 255        BASESET "ISO 646:1983//CHARSET                 International Reference Version (IRV)//ESC 2/5 4/0"        DESCSET 0 128 0        FUNCTION RE         13                 RS         10                 SPACE      32                 TAB SEPCHAR 9        NAMING   LCNMSTRT ""                 UCNMSTRT ""                 LCNMCHAR ".-"                 UCNMCHAR ".-"                 NAMECASE GENERAL YES                          ENTITY  NO        DELIM    GENERAL  SGMLREF                 SHORTREF SGMLREF        NAMES    SGMLREF        QUANTITY SGMLREF                 NAMELEN  32                 TAGLVL   100                 LITLEN   1024                 GRPGTCNT 150                 GRPCNT   64FEATURES  MINIMIZE    DATATAG  NO    OMITTAG  YES    RANK     NO    SHORTTAG NO  LINK    SIMPLE   NO    IMPLICIT NO    EXPLICIT NO  OTHER    CONCUR  NO    SUBDOC  NO    FORMAL  YES  APPINFO NONE><!DOCTYPE HTMLPLUS [<!-- DTD for HTML+ Markup minimisation should be avoided, otherwise the default <!SGML>declaration is fine. Browsers should be forgiving of markup errors,while authoring tools *should* enforce compliance with the DTD.Common Attributes:id      This attribute allows authors to name elements such as headers         and paragraphs as potential destinations for links. Note that         links don't specify points, but rather extended objects.charset This allows authors to switch to a different char set for        quotations or list etc. This is particularly useful for oriental        languages which need two byte character codes, e.g. see RFC 1468        "Japanese Character Encoding for Internet Messages"ENTITY DECLARATIONS <!ENTITY % foo "X | Y | Z"> is a macro definition for parameters and in  subsequent statements, the string "%foo;" is expanded to "X | Y | Z" Various classes of SGML text types:  CDATA  text which doesn't include markup or entity references  RCDATA text with entity references but no markup  PCDATA text occurring in a context in which markup and entity references      may occur.--><!-- Core HTML+ DTD omits following features --><!ENTITY % HTML.math "IGNORE"><!ENTITY % HTML.tables "IGNORE"><!ENTITY % HTML.figures "IGNORE"><!ENTITY % HTML.emph "IGNORE"><!ENTITY % HTML.forms "IGNORE"><!ENTITY % HTML.obsolete "IGNORE"><!ENTITY % HTML.icons "IGNORE"><!ENTITY % cextra "" -- for character-like elements --><!ENTITY % pextra "" -- for paragraph-like elements --><!-- %cextra; and %pextra are designed to allow document specific     extensions to the HTML+ DTD, e.g.      <!DOCTYPE htmlplus [        <!ENTITY % cextra "|PROPNAME">        <!ELEMENT PROPNAME - - CDATA>      ]>     Use the RENDER element to specify how the browser should     display new elements in terms of existing ones, e.g.      <RENDER tag="PROPNAME" style="I">--><!ENTITY % URL "CDATA" -- a URL or URN designating a hypertext node --><!-- Browsers should render the following types of emphasis     distinctly when the obvious rendering is impractical     I = italic, B = bold, U = underline, S = strikethru,     TT = teletype font, SUP = superscript, SUB = subscript     REV = reverse video for highlighting hit areas in the result of a query     Q = inline quote (render according to local conventions)--><!ENTITY % emph1 "I|B|U|TT|CITE|EM|STRONG|KBD|VAR|DFN|CODE|SAMP"><!ENTITY % emph2 "S|Q|PERSON|ACRONYM|ABBREV|CMD|ARG|REMOVED|ADDED|REV"><!ENTITY % emph3 "SUP|SUB|CHANGED|TAB|HIDE"><![ %HTML.emph [ <!ENTITY % emph  "%emph1;|%emph2;|%emph3;"> ]]><!ENTITY % emph  "%emph1;"><![ %HTML.emph [ <!ENTITY % misc1 "|RENDER|FOOTNOTE|MARGIN"> ]]><!ENTITY % misc1 ""><![ %HTML.forms [ <!ENTITY % misc2 "|INPUT|TEXTAREA|SELECT"> ]]><!ENTITY % misc2 ""><!ENTITY % misc "BR %misc1 %misc2; %cextra;"><![ %HTML.math [ <!ENTITY % math "|MATH"> ]]><!ENTITY % math ""><![ %HTML.figures [ <!ENTITY % text "#PCDATA|A|IMG|FIG|%emph;|%misc;"> ]]><!ENTITY % text "#PCDATA|A|IMG|%emph;|%misc; %math;"><!ENTITY % paras "P|PRE %pextra;"><!ENTITY % lists "UL|OL|DL"><![ %HTML.emph [ <!ENTITY % block1 "NOTE|QUOTE|ABSTRACT|ADDRESS|HR"> ]]><!ENTITY % block1 "ADDRESS|HR"><![ %HTML.tables [ <!ENTITY % block2 "|TABLE"> ]]><!ENTITY % block2 ""><![ %HTML.forms [ <!ENTITY % block3 "|FORM"> ]]><!ENTITY % block3 ""><![ %HTML.obsolete [ <!ENTITY % block4 "|MENU|DIR|BLOCKQUOTE"> ]]><!ENTITY % block4 ""><!ENTITY % block "%block1; %block2; %block3; %block4;"><![ %HTML.emph [<!ENTITY % setup1 "& RENDER*"> ]]><!ENTITY % setup1 ""><!ENTITY % setup "(TITLE? & ISINDEX? & BASE? & META* & LINK* %setup1;)"><!ENTITY % main "%block;|%lists;|%paras;"><!ENTITY % divisions "DIV6*, DIV5*, DIV4*, DIV3*, DIV2*, DIV1*"><!-- these entities are used to simplify element definitions --><!ENTITY % heading "H1|H2|H3|H4|H5|H6"> <!ENTITY % table "P|%heading;|%lists;"><!ENTITY % formulae "BOX|ARRAY|ROOT|%text;"><![ %HTML.obsolete [<!ENTITY % anchorname "name NMTOKEN #IMPLIED"> ]]><!ENTITY % anchorname ""><![ %HTML.forms [ <!ENTITY % fields "text|password|checkbox|radio|submit|reset|int|                        float|date|url|hidden|range|scribble|audio">]]><!-- Core DTD includes basic Latin-1 entities --><!ENTITY % ISOlat1 PUBLIC "ISO 8879-1986//ENTITIES Added Latin 1//EN">%ISOlat1;<!-- additional entities normally found in Latin-1 char sets--><!ENTITY % ISOnum PUBLIC "ISO 8879-1986//ENTITIES Numeric and Special Graphic//EN">%ISOnum;<!-- diacritical marks normally found in Latin-1 char sets--><!ENTITY % ISOdia PUBLIC "ISO 8879-1986//ENTITIES Diacritical Marks//EN">%ISOdia;<!-- misc. from ISO Publishing entities --><!ENTITY ndash  SDATA "[ndash ]"--=en dash--><!ENTITY mdash  SDATA "[ndash ]"--=em dash--><!ENTITY ensp   SDATA "[ensp  ]"--=en space (1/2-em)--><!ENTITY emsp   SDATA "[emsp  ]"--=em space--><!ENTITY hellip SDATA "[hellip]"--=ellipsis (horizontal)--><!ENTITY vellip SDATA "[vellip]"--=ellipsis (vertical)--><!-- standard ISO/WWW icons courtesy of Bert Bos and Kevin Hughes, see         gopher://gopher.let.rug.nl/00/ftp/pub/Bert/WWWicn.940218     These can be used in place of default symbols for list items or as    part of hypertext links, and save time needed to download images.    Browsers can define them in terms of library images or as URL/URNs.--><![ %HTML.icons [  <!ENTITY ftp SDATA "ftp" -- ftp server -->  <!ENTITY gopher SDATA "gopher" -- gopher server -->  <!ENTITY telnet SDATA "telnet" -- telnet connection -->  <!ENTITY archive SDATA "archive" -- archive server -->  <!ENTITY filing.cabinet SDATA "filing.cabinet" -- filing cabinet -->  <!ENTITY folder SDATA "folder" -- folder or directory -->  <!ENTITY fixed.disk SDATA "fixed.disk" -- fixed media drive -->  <!ENTITY disk.drive SDATA "disk.drive" -- removeable media drive -->  <!ENTITY document SDATA "document" -- unspecified document type -->  <!ENTITY unknown.document SDATA "unknown.document" -- unrecognised document type -->  <!ENTITY text.document SDATA "text.document" -- text/plain, text.html etc. -->  <!ENTITY binary.document SDATA "binary.document" -- binary data -->  <!ENTITY binhex.document SDATA "binhex.document" -- binhex format -->  <!ENTITY audio SDATA "audio" -- audio sequence -->  <!ENTITY film SDATA "film" -- film or animation, such as an MPEG movie -->  <!ENTITY image SDATA "image" -- photograph, drawing or graphic of any kind -->  <!ENTITY map SDATA "map" -- geographical or a schematic map -->  <!ENTITY form SDATA "form" -- fill-out form -->  <!ENTITY mail SDATA "mail" -- email messages -->
12 3 下一页
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -