me_sgram.1
来自「speech signal process tools」· 1 代码 · 共 331 行
1
331 行
.\" Copyright (c) 1988-1990 Entropic Speech, Inc..\" Copyright (c) 1990-1997 Entropic Research Laboratory, Inc. All rights reserved..\" @(#)me_sgram.1 1.10 27 Sep 1997 ESI/ERL.ds ]W (c) 1997 Entropic Research Laboratory, Inc..TH ME_SGRAM 1\-ESPS 27 Sep 1997.SH "NAME"me_sgram - produces a maximum-entropy-based FEA_SPEC file suitable for displaying as a spectrogram.SH "SYNOPSIS".B me_sgram [.BI \-s " range] [.BI \-P " param_file] [.BI \-p " range] [.BI \-r " range"] [.BI \-m " method"] [.BI \-a " lpc_method"] [.BI \-w " window_len"] [.BI \-E " pre_emphasis"] [.BI \-o " analysis_order"] [.BI \-S " step_size"] [.BI \-d " data_window"] [.BI \-c " conv_test"] [ .BI \-i " max_iter"] [.B \-D].I " input.SD output.SPEC".SH "DESCRIPTION".PP.I Me_sgramreads an ESPS sampled data file, pre-emphasizes it, and produces anESPS FEA_SPEC(5\-ESPS) file, which qresults from maximum entropy spectralanalysis. This file is suitable for displaying as a speechspectrogram. (See \fIplotsgram\fP (1\-\s-1ESPS\s+1).) By default(specifying no options) or by specifying \fIwb\fR for the \fB\-m\fRoption, \fIme_sgram\fR produces a FEA_SPEC file that simulates theanalysis done to produce a wide-band speech spectrogram.Alternatively, by specifying \fInb\fR for the \fB\-m\fR option, aFEA_SPEC file that simulates the analysis for a narrow-bandspectrogram is computed. The \fB\-m\fP option simply definesdifferent defaults for the other options. By specifying any of theother options, in addition to \fB\-m\fP, the default values areoverridden by the specified option values. This allows the user toalter the basic display to highlight facets of the spectrogram thatare of particular interest..PP.I me_sgram is a shell script that uses \fIfilter\fP(1\-\s-1ESPS\s+1),\fIrefcof\fP(1\-\s-1ESPS\s+1), and \fIme_spec\fP(1\-\s-1ESPS\s+1);the parameter file is processed using \fIgetparam\fP(1\-\s-1ESPS\s+1)..PPIf "\-" is specified for the output file, standard output is used.Note, standard input cannot be read by this script..SH OPTIONS.PP.TP.BI \-s " start_time:end_time \fR[0.0:(end of file)]\fP".TP.BI \-s " start_time:+duration"Determines the range in seconds in the input file over whichthe spectrogram values are to be computed.In the first form, a pair ofreal numbers gives the starting and ending times of the range.The correspondence between samples and times is determined by two quantities:the starting time of the file and the interval between samples.The former is given by the generic header item.I start_timein the input file,or 0.0 if the header item is not present.The latter is the reciprocal of the sample frequency.I sfin the type-dependent part of the SD header.If.I start_timeis omitted, the starting time of the file is used. If .I end_time is omitted, the range extends through the end of the file.The second form of the option is equivalent to the first with .I "end = start + duration"..TP.BI \-p " starting_point:end_point \fR[1:(end of file)]\fP".TP.BI \-p " starting_point:+incr"Determines the range in points in the input file over whichthe spectrogram values are to be computed.In the firstform, a pair of unsigned integers gives the first andlast points of the range, counting from 1 for the firstpoint in the file. If first is omitted, 1 is used. Iflast is omitted, the range extends to the end of thefile. The second form is equivalent to the first withlast = first + incr . This option should not bespecified if -s is specified. .TP.BI \-r " range"\fB\-r\fP is a synonym for \fB\-p\fP..TP.BI \-m " method \fR[wb]\fP"The basic method is specified here. There are two possible values:\fIwb\fR for wide band and \fInb\fR for narrow band. By specifying\fIwb\fR, the following values are set: lpc_method = fburg,pre-emphasis = .94, analysis_order = 10, window_len = 8 msec.,step_size = 2 msec., and data_window = RECT. By specifying\fInb\fR, the following values are set: lpc_method = fburg,pre-emphasis = .94, analysis_order = 10, window_len = 40 msec.,step_size = 2 msec., and data_window = RECT..TP.BI \-a " lpc_method \fR[fburg]\fP"Specifies the spectrum analysis method. The default is the fastmodified Burg ("fburg") method. Also available are theautocorrelation method "autoc"), covariance method ("cov"), Burgmethod ("burg"), modified Burg method ("mburg"), stuctured covariance("strcov" and "strcov1"), and vector Burg ("vburg", fast approximationto structured covariance. Of the two structured covariance methods,the first ("strcov" is considerably faster and better behaved;"strcov1" is older but included as it may prove useful on occasion.**If "strcov" is used, the \fB-c\fP and \fB-i\fP options becomerelevant. The \fB-m\fP option overrides the value that may be in theparameter file. .TP.BI \-c " conv_test\fR[1e-5]\fP"Specifies, for the STRCOV method only (not including STRCOV1), a convergence test value. The lower the value, the smaller the change required on each iteration before the estimator terminates, and the more iterations that normally will result. .TP.BI \-i " max_iter\fR[20]\fP"Specifies, for the STRCOV method only (not including STRCOV1), the maximum number of iterations that the estimator will run through before terminating. A warning will indicate if the estimatorterminates because max_iter has been exceeded. .TP.BI \-w " window_len"The duration of data in milliseconds over which the the power spectrumis computed for each column of the spectrogram. The analysis bandwidthis inversely proportional to this value..TP.BI \-E " pre_emphasis"The coefficient \fIA\fR in the pre-emphasis filter: 1 \- \fIA\fR/z.This filter is applied to the sampled data before computingthe power spectrum. A value of \fIA\fR = 1 provides a 6 dB/octaveboost to the high frequencies; a value of 0 provides no boost..TP.BI \-o " analysis_order"The number of parameters in the autoregressivemodel that is used to compute the power spectrum. .TP.BI \-S " step_size"The time step in milliseconds between columns in the spectrogram.The time resolution and horizontal magnification areaffected by this parameter..TP.BI \-d " data_window"The data window applied to the data before the computationof the power spectrogram. Possible values include RECT (rectangular),HAMMING, and TRIANG (triangular). See \fIwindow\fP (3\-\s-1ESPS\s+1) fora complete list of supported data windows..TP.B \-DInclude in the output records a field.I dith_spec_valcontaining 1s and 0s intended for display as a simulated gray-scale spectrogramon a black-and-white monitor.The \fIrefcof\fP (1\-\s-1ESPS\s+1) output is piped through the.IR dither (1-ESPS)program..SH "ESPS PARAMETERS".PPThe parameter file is not required to be present, as there are default parameter values that apply. If the parameter file does exist, the following parameters are read:.TP.I "method \- string"The spectrogram method to use. This parameter is not read if the\fB-m\fP option is used. If the parameter is present and is read, itdetermines default values for all other parameters as discussed underthe \fB-m\fP option. The other parameters (or command line options),if present, override these defaults. The choice of "other" usesdefaults that are the same as "wb". .TP.I "lpc_method \- string"The analysis method used to compute reflection coefficients. Thisparameter is not read if the \fB-a\fP is used. .TP.I "data_window \- string"The data window to apply to the data. This parameter is not read ifthe \fB-d\fP option is specified or if \fImethod\fP is in theparameter file and does not have the value "other". Acceptable valuesinclude "RECT" for a rectangular window, "HAMMING", for Hamming,"HANNING" for Hanning, "COS4" for cosine 4th, and "TRIANG", fortriangular; see the.IR window (3-ESPSsp)manual page for the complete list..TP.I "start \- integer".IPThe first point in the input sampled data file that is processed. Avalue of 1 denotes the first sample in the file. This parameter isnot read if any of the range command line options are used. If it isnot in the parameter file, and neither option is specified, a defaultvalue of 1 is used..TP.I "nan \- integer".IPThe total number of data points to process.If.I nanis 0, processing continues through the end of the file.This parameter is not read if any of the range command line options are used..TP.I "window_len \- float".IPThe duration in milliseconds of each frame. This parameter not read if the \fB-w\fP option is specified of if\fImethod\fP is in the file and does not have the value "other"..TP.I "pre_emphasis \- float"The coefficient in the preemphasis filter (see.B \-Ein the Options section). This parameter not read if the \fB-E\fP option is specified..TP.I "step_size \- float"Initial points of consecutive frames differ by this number ofmilliseconds. This parameter not read if the \fB-S\fP option isspecified..TP.I "order - integer".IPThe number of reflection coefficients computed for each frame of input data.If no value is given in the file, a default value of 15 is used. This value is not read if the command line option \fB\-o\fP is used. .TP.I "strcov_test - float".IPIf STRCOV is used, this is the convergence test value (the lower the value, the smaller the change per iteration before termination). This is not read if \fB-c\fP is used. .TP.I "strcov_maxiter - int".IPIf STRCOV is used, this is the maximum number of iterations allowedbefore the estimator terminates. This is not read if \fB-i\fP isused..PPThe values of parameters obtained from the parameter file are printedif the environment variable ESPS_VERBOSE is 3 or greater. The defaultvalue is 3..SH ESPS COMMON.PPESPS Common is read provided that Common processing is enabled and that the .I filenameentry in Common matches .I input.SD,in which case parameters present in Common override values from theparameter file, which in turn may be overriden by command line options(see the discussion in ESPS PARAMETERS and under each option).Common is not read if .I file.sdis standard input. If .I file.rcis not standard output and .I file.sdis not standard input, the Common parameters \fIfilename\fP (= file.sd), \fIprog\fP (= refcof), .I start,and.I nanare written to ESPS Common..PPESPS Common processing may be disabled by setting the environmentvariable USE_ESPS_COMMON to "off". The default ESPS Common file is.espscom in the user's home directory. This may be overidden bysetting the environment variable ESPSCOM to the desired path. Userfeedback of Common processing is determined by the environmentvariable ESPS_VERBOSE, with 0 causing no feedback and increasinglevels causing increasingly detailed feedback. If ESPS_VERBOSE is notdefined, a default value of 3 is assumed..SH ESPS HEADERS.PPThe generic header item \fIstart_time\fP (type DOUBLE) is written inthe output file. The value written is computed by taking the\fIstart_time\fP value from the header of the input file (or zero, ifsuch a header item doesn't exist) and adding to it the offset time(from the beginning of the input file) of the first point processed.Also, the generic header item \fIrecord_freq\fP (type DOUBLE) iswritten in the output file. The value is the number of output recordsper second of input data. For details on the parameters used by all of the programs in the script, look at all of the intermediateheaders (e.g., via \fIpsps\fP \fB-a\fP). .SH "FUTURE CHANGES".PP\fIme_sgram\fP will be re-implemented as a C program. .SH "SEE ALSO".PP\fIsgram\fP (1\-\s-1ESPS\s+1),\fIplotsgram\fP (1\-\s-1ESPS\s+1),\fIfilter\fP (1\-\s-1ESPS\s+1), .br\fIrefcof\fP (1\-\s-1ESPS\s+1),\fIme_spec\fP (1\-\s-1ESPS\s+1), \fIcompute_rc\fP (3\-\s-1ESPS\s+1sp),.br\fIdither\fP (1\-\s-1ESPS\s+1), FEA_SPEC (5\-\s-1ESPS\s+1), FEA_SD (5\-\s-1ESPS\s+1).SH "BUGS".PPThis script cannot read standard input..SH "AUTHOR".PPManual page and program by David Burton. Revisions by John Shore.
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?