📄 catdoc.ps
字号:
(catdoc)108 218.4 Q F0(doesn')2.509 E 2.509(ta)-.18 G .009(ttempt to e)-2.509 F .009(xtract formatting information other than tables from MS-W)-.15 F .01(ord document, so dif-)-.8 F .754(ferent output modes means mainly that dif)108 230.4 R .754(ferent charachers should be escaped and dif)-.25 F .754(ferent w)-.25 F.754(ays used to)-.1 F(represent characters, missing from output charset. See CHARA)108 242.4Q(CTER SUBSTITUTION belo)-.4 E(w)-.25 E F2(catdoc)108 271.2 Q F0 .531(uses internal)3.031 F F2(unicode)3.031 E F0 .531(\(4\) representation of te)B .531(xt, so it is able to con)-.15 F -.15(ve)-.4 G .531(rt te).15 F .532(xts when charset in source)-.15 F(document doesn')108 283.2 Q 2.5(tm)-.18 G(atch charset on tar)-2.5 E(get system.)-.18 E(See CHARA)5 E(CTER SETS belo)-.4 E -.65(w.)-.25 G.427(If no \214le names supplied,)108 300 R F2(catdoc)2.927 E F0 .427(processes its standard input unless it is terminal. It is unlik)2.927 F.426(ely that some-)-.1 F .786(body could type W)108 312 R .786(ord document from k)-.8 F -.15(ey)-.1 G .786(board, so if).15 F F2(catdoc)3.286 E F0(in)3.286 E -.2(vo)-.4 G -.1(ke).2 G 3.286(dw).1 G.786(ithout ar)-3.286 F .787(guments and stdin is not)-.18 F .953(redirected, it prints brief usage message and e)108 324 R 3.453(xits. Processing)-.15 F .952(of standard input \(e)3.453 F -.15(ve)-.25G 3.452(na).15 G .952(mong other \214les\))-3.452 F(can be forced using dash '-' as \214le name.)108 336 Q .308(By def)108352.8 R(ault,)-.1 E F2(catdoc)2.808 E F0 .309(wraps lines which are mor\e than 72 chars long and separates paragraphs by blank lines.)2.808 F1.015(This beha)108 364.8 R -.2(vo)-.2 G 1.015(ir can be turned of by).2F F2(-w)3.515 E F0 1.015(switch. In)3.515 F F3(wide)3.515 E F0(mode)3.515 E F2 1.015(catdoc prints each paragraph as one long)3.515 F(line, suitable f)108 376.8 Q(or import into)-.25 E F0 -.1(wo)2.5 G(rd processors which perform w).1 E(ord wrapping theirselv)-.1 E(es.)-.15 E F1(OPTIONS)72 417.6 Q F2(-a)108 429.6 Q F0 2.5(-s)31.67 G(hortcut for -f ascii. Produces ASCII te)-2.5 E(xt as output.)-.15 E(Separates table columns with T)5 E(AB)-.93 E F2(-b)108 446.4 Q F0 2.655(-p)31.11 G .155(rocess brok)-2.655 F .155(en MS-W)-.1 F .155(ord \214le. Normally)-.8 F(,)-.65 E F2 .156(catdoc checks if \214rst 8 bytes)2.655 F F0 .156(of \214le is Microsoft OLE)2.656 F .209(signature. If so, it processes\ \214le, otherwise it just copies it to stdin. It is intended to use)148458.4 R F2(catdoc)2.709 E F0(as)2.708 E(\214lter for vie)148 470.4 Q(wing all \214les with)-.25 E F3(.doc)2.5 E F0 -.15(ex)2.5 G(tension.).15 E F2(-d)108 487.2 Q F3 -.15(ch)C(ar).15 E(set)-.1 E F0 3.745(-s)148499.2 S 1.245(peci\214es destination charset name. Charset \214le has f\ormat described in CHARA)-3.745 F 1.245(CTER SETS)-.4 F(belo)148 511.2 Q5.666(wa)-.25 G 3.166(nd should ha)-5.666 F -.15(ve)-.2 G F2(.txt)5.816E F0 -.15(ex)5.666 G 5.666(tension and).15 F 3.166(reside in)5.666 F F23.166(catdoc library dir)5.666 F 3.165(ectory \(normally)-.18 F(/usr/local/lib/catdoc\).)148 523.2 Q(-f)108 540 Q F3(format)1.97 E F03.081(-s)148 552 S .582(peci\214es output format as described in CHARA)-3.081 F .582(CTER SUBSTITUTION belo)-.4 F -.65(w.)-.25 G F2(catdoc)6.232 E F0(comes)3.082 E(with tw)148 564 Q 2.5(oo)-.1 G(utput formats - ascii and te)-2.5 E(x. Y)-.15 E(ou can add your o)-1.1E(wn if you wish.)-.25 E F2(-l)108 580.8 Q F0(Causes)33.89 E F2(catdoc)2.5 E F0(to list names of a)2.5 E -.25(va)-.2 G(ilable charsets to the stdout and e).25 E(xit successfully)-.15 E(.)-.65 E F2(-m)108 597.6 Q F3(number).36 E F0(Speci\214es right mar)148609.6 Q(gin for te)-.18 E 2.5(xt \(def)-.15 F(ault 72\).)-.1 E F2(-m 0)5E F0(is equi)2.5 E -.25(va)-.25 G(lent to).25 E F2(-w)2.5 E(-s)108 626.4Q F3 -.15(ch)C(ar).15 E(set)-.1 E F0 2.636(Speci\214es source charset. \(one used in W)148 638.4 R 2.636(ord document\), if W)-.8 F 2.636(ord document doesn')-.8 F 5.135(tc)-.18 G(ontain)-5.135 E 2.5(UTF-16 te)148 650.4 R(xt.)-.15 E F2(-t)108667.2 Q F0 2.5(-s)33.34 G(hortcut for)-2.5 E F2(-f tex)2.5 E F0(con)150.5 679.2 Q -.15(ve)-.4 G .834(rts all printable chars, which ha).15 F1.134 -.15(ve s)-.2 H .834(pecial meaning for).15 F F2(LaT)3.334 E(eX)-.92 E F0 .835(\(1\) into appropriate control)B(sequences. Separates table columns by)148 691.2 Q F2(&.)2.5 E(-u)108708 Q F0 3.795(-d)31.11 G 1.295(eclares that W)-3.795 F 3.794(ord document contain UNICODE)-.8 F 1.294(\(UTF-16\) represntation of te)8.794 F 1.294(xt \(as some)-.15 F -.8(Wo)148 720 S .683(rd-97 documents\). If catdoc f).8 F .683(ails to correct)-.1 F -.8(Wo)5.683 G .683(rd document with).8 F(def)5.683 E .683(ault charset,)-.1 F 8.184(try this)8.183 F(MS-W)72 768 Q(ord reader)-.8 E -1.11(Ve)141.495 G(rsion 0.91)1.11 E(1)203.725 E EP%%Page: 2 2%%BeginPageSetupBP%%EndPageSetup/F0 10/Times-Roman@0 SF 389.98(catdoc\(1\) catdoc\(1\))72 48 R(option.)148 84 Q/F1 10/Times-Bold@0 SF(-8)108 100.8 Q F0 2.5(-d)31.67 G(eclares is W)-2.5 E(ord document is 8 bit. Just in case that catdoc)-.8E(recognizes \214le format incorrectly)150.5 112.8 Q(.)-.65 E F1(-w)108129.6 Q F0 1.408(disables w)29.45 F 1.408(ord wrapping. By def)-.1 F(ault)-.1 E F1(catdoc)3.908 E F0 1.407(output is splitted into lines not longer than 72 \(or)3.907 F(number)148 141.6 Q 3.705(,s)-.4 G 1.205(peci\214ed by -m)-3.705 F 6.205(option\) characters)6.205 F 1.206(and paragraphs are separated by blank line. W)3.705 F(ith)-.4 E(this option each paragraph is one long line.)148 153.6 Q F1(-x)108170.4 Q F0(causes catdoc to output unkno)31.67 E(wn UNICODE characher as \\xNNNN, instead of question marks.)-.25 E F1(-v)108 187.2 Q F0 .7(causes catdoc to print some useless information about w)31.67 F .7(ord document structure to stdout before)-.1 F(actual start of te)148199.2 Q(xt.)-.15 E/F2 9/Times-Bold@0 SF(CHARA)72 228 Q(CTER SETS)-.495 EF0(When processing MS-W)108 240 Q(ord \214le)-.8 E F1(catdoc)2.5 E F0(uses information about tw)2.5 E 2.5(oc)-.1 G(haracter sets, typically dif)-2.5 E(ferent)-.25 E 5.321(-i)110.5 252 S.321(nput and output. The)-5.321 F 2.821(ya)-.15 G .321(re stored in plain te)-2.821 F .321(xt \214les in)-.15 F F1(catdoc)2.821 E F0 .321(library directory)2.821 F 2.821(.C)-.65 G .322(haracter set \214les should)-2.821 F 1.4(contain tw)108 264 R 3.9(ow)-.1 G 1.4(hitespace-separated he)-3.9 F 1.399(xadecimal numbers - 8-bit code in character set and 16-bit unicode)-.15F 2.5(code. An)108 276 R(ything from hash mark to end of line is ignore\d, as well as blank lines.)-.15 E F1(catdoc)108 300 Q F0(distrib)4.586 E2.087(ution includes some of these character sets. Additional character\ set de\214nitions, directly)-.2 F .943(usable by)108 312 R F1(catdoc)3.443 E F0 .943(can be obtained from ftp.unicode.or)3.443 F .943(g. Charset \214les ha)-.18 F -.15(ve)-.2 G F1(.txt)3.592 E F0(suf)3.442E .942(\214x, which shouldn')-.25 F 3.442(tb)-.18 G(e)-3.442 E(speci\214ed in command-line or con\214guration \214les.)108 324 Q .393(Note that)108 340.8 R F1(catdoc)2.893 E F0 .393(is distrib)2.893 F .393(uted with Cyrillic charsets as def)-.2 F .394(ault. If you are not Russian, you probably don')-.1 F(t)-.18 E -.1(wa)108 352.8 S(nt it, an should recon\214gure catdoc at compile time or in\ runtime con\214guration \214le.).1 E 1.037(When dealing with documents with charsets other than def)108 369.6 R1.037(ault, remember that Microsoft ne)-.1 F -.15(ve)-.25 G 3.537(ru).15G 1.037(ses ISO)-3.537 F .979(charsets. While letters in, say cp1252 ar\e at the same position as in ISO-8859-1, some punctuation signs)108381.6 R -.1(wo)108 393.6 S .265(uld be lost, if you specify ISO-8859-1 \as input charset. If you use cp1252, catdoc w).1 F .264(ould deal with those)-.1 F(signs as described in CHARA)108 405.6 Q(CTER SUBSTITUTION belo)-.4 E -.65(w.)-.25 G F2(CHARA)72 434.4 Q(CTER SUBSTITUTION)-.495 E F1(catdoc)108 446.4 Q F0(con)2.5 E -.15(ve)-.4 G 2.5(rts MS-W).15 F(ord \214le into follo)-.8 E(wing internal unicode representation:)-.25 E(1. P)108 463.2 Q(aragraphs are separated by ASCII Line Feed symbol \(0x000A\))-.15 E(2. T)108 480 Q(able cells within ro)-.8 E 2.5(wa)-.25 G(re separated by ASCII Field Separator symbol)-2.5 E(\(0x001C\))128 492Q(3. T)108 508.8 Q(able ro)-.8 E(ws are separated by ASCII Record Separator \(0x001E\))-.25 E(4. All pr\intable characters, including whitespace are represented with their)108525.6 Q(respecti)128 537.6 Q .3 -.15(ve U)-.25 H(NICODE codes.).15 E.845(This UNICODE representation is subsequentely con)108 554.4 R -.15(ve)-.4 G .846(rted into 8-bit te).15 F .846(xt in tar)-.15 F .846(get character set using fol-)-.18 F(lo)108 566.4 Q(wing four)-.25 E(-step algorithm:)-.2 E(1. List of special characters is searched for gi)108 583.2 Q -.15(ve)-.25 G 2.5(nu).15 G(nicode character)-2.5 E(.)-.55 E(If found, then app\ropriate multi-character sequence is output instead of character)128595.2 Q(.)-.55 E(2. If there is an equi)108 612 Q -.25(va)-.25 G(lent in tar).25 E(get character set, it is output.)-.18 E(3. Otherwise\, replacement list is searched and, if there is multi-character)108628.8 Q(substitution for this UNICODE char)128 640.8 Q 2.5(,i)-.4 G 2.5(ti)-2.5 G 2.5(so)-2.5 G(utput.)-2.5 E(4. If all abo)108 657.6 Q .3 -.15(ve f)-.15 H(ails, "Unkno).05 E(wn char" symbol \(question mark\) is output.)-.25 E 1.999(Lists of spe\cial characters and list of substitution are character set-independent,\ becouse special chars)108 674.4 R .405(should be escaped re)108 686.4 R-.05(ga)-.15 G .405(rdless of their e).05 F .406(xistense in tar)-.15 F.406(get character set)-.18 F(\(usially)5.406 E 2.906(,t)-.65 G(he)-2.906 E 2.906(ya)-.15 G .406(re parts of US-ASCII,)-2.906 F .899(and therefore e)108 698.4 R .898(xist in an)-.15 F 3.398(yc)-.15 G .898(haracter set\) and replacement list is searched only for those charact\ers, which)-3.398 F(are not found in tar)108 710.4 Q(get character set.)-.18 E 1.808(These lists are stored in)108 727.2 R F1(catdoc)4.309 E F01.809(library directory in \214les with pre\214x of format name. These \\214les ha)4.309 F -.15(ve)-.2 G(MS-W)72 768 Q(ord reader)-.8 E -1.11(Ve)141.495 G(rsion 0.91)1.11 E(2)203.725 E EP%%Page: 3 3%%BeginPageSetupBP%%EndPageSetup/F0 10/Times-Roman@0 SF 389.98(catdoc\(1\) catdoc\(1\))72 48 R(follo)10884 Q(wing format:)-.25 E .315(Each line can be either comment \(startin\g with hash mark\) or contain he)108 100.8 R .314(xadecimal UNICODE v)-.15 F .314(alue, sepa-)-.25 F .273(rated by whitespace from string, which w)108 112.8 R .274(ould be substituted instead of it. If string contain no whitespace it)-.1 F .058(can be used as is, otherwise it should be enclosed in single\ or double quotes. Usial backslash sequences lik)108 124.8 R(e)-.1 E/F110/Times-Italic@0 SF('\\n')108 136.8 Q F0(,).07 E F1('\\t')-1.01 E F0(can be used in these string.)2.5 E/F2 9/Times-Bold@0 SF -.27(RU)72177.6 S(NTIME CONFIGURA).27 E(TION)-.855 E F0 1.272(Upon startup catdoc reads its system-wide con\214guration \214le \()108189.6 R/F3 10/Times-Bold@0 SF(catdocr)3.772 E 3.772(ci)-.18 G 3.772(nc)-3.772 G(atdoc)-3.772 E F0 1.273(library directory\) and)3.772 F(then user)108 201.6 Q(-speci\214c con\214guration \214le)-.2 E F3(${HOME}/.catdocr)2.5 E(c.)-.18 E F0(These \214les can contain follo)108218.4 Q(wing directi)-.25 E -.15(ve)-.25 G(s:).15 E F3(sour)108 235.2 Q(ce_charset =)-.18 E F1 -.15(ch)2.5 G(ar).15 E(set-name)-.1 E F0 .518(Sets def)148 247.2 R .517(ault source charset, which w)-.1 F .517(ould be used if no)-.1 F F3(-s)3.017 E F0 .517(option speci\214ed. Consult con\214guration)3.017 F(of nearby windo)148259.2 Q(ws w)-.25 E(orkstation to \214nd one you need.)-.1 E F3(tar)108276 Q(get_charset =)-.1 E F1 -.15(ch)2.5 G(ar).15 E(set-name)-.1 E F0(Sets def)150.5 288 Q(ault output charset. Y)-.1 E(ou probably kno)-1.1E 1.3 -.65(w, w)-.25 H(hich one you use.).65 E F3(charset_path =)108304.8 Q F1(dir)2.85 E(ectory-list)-.37 E F0 .305(colon-separated list o\f directories, which are searched for charset \214les.)148 316.8 R .305(This allo)5.305 F .305(ws you to install)-.25 F(additional charsets in your home directory)148 328.8 Q(.)-.65 E F3(map_path =)108 345.6 Q F1(dir)2.85 E(ectory-list)-.37 E F0 .433(colon-\separated list of directories, which are searched for special character\ map and replacement)148 357.6 R(map.)148 369.6 Q F3 -.25(fo)108 386.4 S(rmat =).25 E F1(format name)4.47 E F0 .737(Output format which w)148398.4 R .737(ould be used by def)-.1 F(ault.)-.1 E F3(catdoc)5.737 E F0.738(comes with tw)3.238 F 3.238(of)-.1 G .738(ormats -)-3.238 F F3(ascii)3.238 E F0(and)3.238 E F3(tex)3.238 E F0 -.2(bu)148 410.4 S 2.619(tn).2 G .119(othing pre)-2.619 F -.15(ve)-.25 G .119(nts you from writing your o).15 F .119(wn format \(set tw)-.25 F 2.619(om)-.1 G .118(ap \214les - special character map)-2.619 F(and replacement map\).)148 422.4 Q F3(unkno)108 439.2 Q(wn_char =)-.1 EF1 -.15(ch)2.5 G(ar).15 E(acter speci\214cation)-.15 E F0 .646(sets characher to output instead of unkno)148 451.2 R .646(wn unicode character \(def)-.25 F .646(ault '?'\))-.1 F .646(Character speci\214ca-)5.646 F(tion can ha)148 463.2 Q .3 -.15(ve o)-.2H(ne of tw).15 E 2.5(of)-.1 G(orm - character enclosed in single quotes or he)-2.5 E(xadecimal code.)-.15 E F2 -.09(BU)72 480 S(GS).09 E F0 .177(Can produce g)108 492 R .176(arbage, if \214le contain embedded illustrations. Doesn')-.05 F 2.676(th)-.18 G .176(andle f)-2.676 F(ast-sa)-.1 E -.15(ve)-.2 G 2.676(sp).15G(roperly)-2.676 E 2.676(.P)-.65 G .176(rints foot-)-2.676 F .244(notes\ as separate paragraphs at the end of \214le, instead of producing corr\ect late)108 504 R 2.744(xc)-.15 G .244(ommands. Cannot distin-)-2.744 F(guish between empty table cell and end of table ro)108 516 Q -.65(w.)-.25 G F2(SEE ALSO)72 568.8 Q F3(xls2csv)108 580.8 Q F0(\(1\),)A F3(cat)2.5 E F0(\(1\),)A F3(strings)2.5 E F0(\(1\),)A F3(utf)2.5 E F0(\(4\),)AF3(unicode)2.5 E F0(\(4\))A F2 -.45(AU)72 609.6 S(THOR).45 E F0 -1.29(V.)108 621.6 S(B.W)1.29 E(agner <vitus@ice.ru>)-.8 E(MS-W)72 768 Q(ord reader)-.8 E -1.11(Ve)141.495 G(rsion 0.91)1.11 E(3)203.725 E EP%%Trailerend%%EOF
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -