⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 sgmls.txt

📁 harvest是一个下载html网页得机器人
💻 TXT
📖 第 1 页 / 共 2 页
字号:
       ity set.       In some environments, higher values may be  supported  for       the SUBDOC parameter.       Documents  that do not use optional features are also sup-       ported.  For example, if FORMAL NO  is  specified  in  the       SGML  declaration, public identifiers will not be required       to be valid formal public identifiers.       Certain parts of the concrete syntax may be changed:              The shunned character numbers can be changed.              Eight bit characters can be assigned  to  LCNMSTRT,              UCNMSTRT, LCNMCHAR and UCNMCHAR.              Uppercase substitution can be performed or not per-              formed both for entity names and for other names.              Either short reference delimiters assigned  by  the              reference  delimiter  set  or  no  short  reference              delimiters are supported.              The reserved names can be changed.                                                                6SGMLS(1)                                                 SGMLS(1)              The quantity set can be  increased  within  certain              limits  subject  to  there  being sufficient memory              available.  The upper limit on NAMELEN is 239.  The              upper  limits on ATTCNT, ATTSPLEN, BSEQLEN, ENTLVL,              LITLEN, PILEN, TAGLEN, and  TAGLVL  are  more  than              thirty  times  greater  than  the reference limits.              The upper limit on GRPCNT, GRPGTCNT, and GRPLVL  is              253.   NORMSEP  cannot  be  changed.   DTAGLEN  are              DTEMPLEN irrelevant since sgmls  does  not  support              the DATATAG feature.   SGML declaration       The  SGML declaration may be omitted, the following decla-       ration will be implied:                             <!SGML "ISO 8879:1986"                                     CHARSET       BASESET  "ISO 646-1983//CHARSET                 International Reference Version (IRV)//ESC 2/5 4/0"       DESCSET    0  9 UNUSED                  9  2  9                 11  2 UNUSED                 13  1 13                 14 18 UNUSED                 32 95 32                127  1 UNUSED       CAPACITY PUBLIC  "ISO 8879:1986//CAPACITY Reference//EN"       SCOPE    DOCUMENT       SYNTAX   PUBLIC  "ISO 8879:1986//SYNTAX Reference//EN"                                    FEATURES       MINIMIZE DATATAG NO OMITTAG  YES          RANK     NO  SHORTTAG YES       LINK     SIMPLE  NO IMPLICIT NO           EXPLICIT NO       OTHER    CONCUR  NO SUBDOC   YES 99999999 FORMAL   YES                                  APPINFO NONE>       with the exception that characters 128 through 254 will be       assigned to DATACHAR.       Sgmls identifies base character sets using the designating       sequence in the public identifier.  The  following  desig-       nating sequences are recognized:         Designating          ISO         Minimum      Number            Escape        Registration   Character       of             Description           Sequence          Number       Number     Characters       ------------------------------------------------------------------------------------       ESC 2/5 4/0             -             0          128       full set of ISO 646 IRV       ESC 2/8 4/0              2           33           94       G0 set of ISO 646 IRV       ESC 2/8 4/2              6           33           94       G0 set of ASCII       ESC 2/13 4/1           100           32           96       G1 set of ISO 8859-1       ESC 2/1 4/0              1            0           32       C0 set of ISO 646       ESC 2/2 4/3             77            0           32       C1 set of ISO 6429       ESC 2/5 2/15 3/0        -             0          256       the system character set       When one of the G0 sets is used as a base set, the charac-       ters SPACE and DELETE are treated as  occurring  at  posi-       tions  32  and 127 respectively; although these characters                                                                7SGMLS(1)                                                 SGMLS(1)       are not part of  the  character  sets  designated  by  the       escape  sequences,  this  mimics the behaviour of ISO 2022       with respect to these code positions.   Output format       The output is a series of lines.  Lines can be arbitrarily       long.   Each line consists of an initial command character       and one or more arguments.  Arguments are separated  by  a       single  space,  but when a command takes a fixed number of       arguments the last argument can contain spaces.  There  is       no space between the command character and the first argu-       ment.   Arguments  can  contain   the   following   escape       sequences.       \\     A \.       \n     A record end character.       \|     Internal SDATA entities are bracketed by these.       \nnn   The character whose code is nnn octal.       A  record  start  character  will  be represented by \012.       Most applications will need to ignore \012  and  translate       \n into newline.       The  possible command characters and arguments are as fol-       lows:       (gi    The start of an element whose generic identifier is              gi.  Any attributes for this element will have been              specified with A commands.       )gi    The end an element whose generic identifier is  gi.       -data  Data.       &name  A  reference  to an external data entity name; name              will have been defined using an E command.       ?pi    A processing instruction with data pi.       Aname val              The next element to start  has  an  attribute  name              with  value  val  which  takes one of the following              forms:              IMPLIED                     The value of the attribute is implied.              CDATA data                     The attribute is character  data.   This  is                     used  for attributes whose declared value is                     CDATA.                                                                8SGMLS(1)                                                 SGMLS(1)              NOTATION nname                     The attribute is a notation name; nname will                     have  been  defined using a N command.  This                     is used for attributes whose declared  value                     is NOTATION.              ENTITY name...                     The  attribute  is  a list of general entity                     names.  Each  entity  name  will  have  been                     defined using an I, E or S command.  This is                     used for attributes whose declared value  is                     ENTITY or ENTITIES.              TOKEN token...                     The  attribute is a list of tokens.  This is                     used for attributes whose declared value  is                     anything else.       Dename name val              This  is  the same as the A command, except that it              specifies a data attribute for an  external  entity              named  ename.  Any D commands will come after the E              command that  defines  the  entity  to  which  they              apply,  but  before any & or A commands that refer-              ence the entity.       Nnname nname.  Define a notation This command will be pre-              ceded  by  a p command if the notation was declared              with a public identifier, and by a s command if the              notation  was declared with a system identifier.  A              notation will only be defined if it is to be refer-              enced  in  an  E  command or in an A command for an              attribute with a declared value of NOTATION.       Eename typ nname              Define an external data  entity  named  ename  with              type  typ (CDATA, NDATA or SDATA) and notation not.              This command will be preceded by one or more f com-              mands  giving the filenames generated by the entity              manager from the system and public identifiers,  by              a p command if a public identifier was declared for              the entity, and by a s command if a system  identi-              fier  was  declared  for the entity.  not will have              been defined using a N  command.   Data  attributes              may  be  specified for the entity using D commands.              An external data entity will only be defined if  it              is  to be referenced in a & command or in an A com-              mand for  an  attribute  whose  declared  value  is              ENTITY or ENTITIES.       Iename typ text              Define  an  internal  data  entity named ename with              type typ (CDATA or SDATA) and entity text text.  An              internal  data entity will only be defined if it is                                                                9SGMLS(1)                                                 SGMLS(1)              referenced in an A command for an  attribute  whose              declared value is ENTITY or ENTITIES.       Sename Define a subdocument entity named ename.  This com-              mand will be preceded by one  or  more  f  commands              giving  the  filenames generated by the entity man-              ager from the system and public identifiers, by a p              command if a public identifier was declared for the              entity, and by a s command if a  system  identifier              was  declared for the entity.  A subdocument entity              will only be defined if it is  referenced  in  a  {              command  or  in an A command for an attribute whose              declared value is ENTITY or ENTITIES.       ssysid This command applies to the next E, S or N  command              and specifies the associated system identifier.       ppubid This  command applies to the next E, S or N command              and specifies the associated public identifier.       ffilename              This command applies to the next E or S command and              specifies  an  associated  filename.  There will be              more than one f command for a single E or S command              if the system identifier used a colon.       {ename The  start  of  the  SGML subdocument entity ename;              ename will have been defined using a S command.       }ename The end of the SGML subdocument entity ename.       Llineno file       Llineno              Set the current  line  number  and  filename.   The              filename  argument will be omitted if only the line              number has changed.  This will be  output  only  if              the -l option has been given.       #text  An  APPINFO  parameter of text was specified in the              SGML declaration.  This is not strictly part of the              ESIS,  but  a  structure-controlled  application is              permitted to act on it.  No # command will be  out-              put  if  APPINFO NONE  was  specified.  A # command              will occur at most once, and may be  preceded  only              by a single L command.       C      This command indicates that the document was a con-              forming SGML document.  If this command is  output,              it  will  be the last command.  An SGML document is              not  conforming  if  it  references  a  subdocument              entity that is not conforming.BUGS       Some  non-SGML  characters  in literals are counted as two                                                               10SGMLS(1)                                                 SGMLS(1)       characters for the purposes of quantity and capacity  cal-       culations.SEE ALSO       The SGML Handbook, Charles F. Goldfarb       ISO  8879 (Standard Generalized Markup Language), Interna-       tional Organization for StandardizationORIGIN       ARCSGML was written by Charles F. Goldfarb.       Sgmls  was   derived   from   ARCSGML   by   James   Clark       (jjc@jclark.com), to whom bugs should be reported.                                                               11

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -