⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 distort.1t

📁 speech signal process tools
💻 1T
字号:
.\" Copyright (c) 1987-1990 Entropic Speech, Inc..\" Copyright (c) 1990-1997 Entropic Research Laboratory, Inc. All rights reserved..\" @(#)distort.1t	3.11 20 Sep 1997 ESI/ERL.ds ]W (c) 1997 Entropic Research Laboratory, Inc..TH DISTORT 1\-ESPS 20 Sep 1997.SH "NAME"distort \- compute distortion measures between two ESPS files.SH "SYNOPSIS".B distort[.BI \-x " debug_level"] [.BI \-f " record_range"] [.BI \-e " element_range"] [.B \-E] [.BI \-r] [.BI \-s] [.BI \-n].I "file1 file2".SH "DESCRIPTION".PP.I Distorttakes two input ESPS files, .I file1and.I file2,and computes various distortion measures of .I file1relative to .I file2.Some distortion measures are computed for all ESPS file types, whileothers are file\-type specific (see below).  The command line followed bysummary statistics is produced on standard output..PPIf .I file1and.I file2are not the same ESPS file type,contain complex data, or have incompatible record structures(e.g., same number of elements per record, same samplingfrequency, etc.),.I distortprints an error message and exits.If the range of records specified on the command line (with or withoutthe \fB\-f\fP option) is notincluded in both.I file1and.I file2,.I distortprints an error message and exits..PPFor all ESPS file types other than FEA_ANA, and FEA_SPEC,.I distort treats files as "generic" ESPS files containing records with some fixednumber of numerical elements per record.  By default, all elements inthe record are considered, but the range of elements within the recordscan be restricted by the .B \-eoption (Note: the first element starts at element 1, element 0 refersto the tag).  For each element in the element range, .I distortcomputes the following measures between corresponding elements of corresponding records in the two files:.IPdifference;.IPdifference magnitude;.IPsquare of the difference magnitude..LPThe averages of these quantities over all selected records, along withtheir maximum values, are then output for each element position.  If the.B \-Eoption is invoked, a distortion value for each element in each record isalso printed.  If the.B \-r option is invoked, then the average of these values over the element rangein each record is also computed and printed, both for the individualrecords and for the entire file.  (If .B \-ris specified without .B \-E, then only the record averages and file averages are printed.).PPFor example, a typical output for an ESPS FEA file given the followingcommand would look like:.sp.ne 7.rs.sp.cedistort \-f 2:5 \-E \-r file1.fea file2.fea.sp.TScenter;c c c c cc c c c cc c c c cc c c c c.			Diff	DiffRecord	Element	Diff	Mag	Squared------	-------	----	----	-------2      	element1	\-27.00	27.00	729.00       	element2	\-20.00	20.00	400.003      	element1	\-19.00	19.00	361.00       	element2	\-20.00	20.00	400.004      	element1	\-21.00	21.00	441.00       	element2	\-22.00	22.00	484.005      	element1	\-24.00	24.00	576.00       	element2	\-24.00	24.00	576.00.TE.ne 7.rs.sp.ce ELEMENT AVERAGE.sp.TScenter;c c c c c cc c c c c cc c c c c cc c c c c c.	AVG 	AVG	  AVG	MAX	MAXElement	Diff	Mag	Squared	Mag	Squared-------	----	---	-------	---	-------element1	\-22.75	22.75	526.8	27.00	729.0element2	\-21.50	21.50	465.0	24.00	576.0.TE.ne 7.rs.sp.ceRECORD AVERAGE.sp.TScenter;c c c c cc c c c cc c c c c.Record	Element	AVG Diff	AVG Mag	AVG Squared------	-------	--------	-------	-----------2	*	\-23.500	23.500	564.5003	*	\-19.500	19.500	380.5004	*	\-21.500	21.500	462.5005	*	\-24.000	24.000	576.000.TE.ne 7.rs.sp.ce TOTAL FILE AVERAGE.sp.TScenter;c c cc c cc c c.MEAN	MEAN	MEANDiff	Diff Mag	Diff Squared--------	-------	-----------\-22.125	22.125	495.875.TE.sp.LPThe first table shows the difference, difference magnitude, and differencemagnitude squared for each element in the records specified.  The ELEMENT AVERAGEtable shows the average of each individual elements across all records,whereas theRECORD AVERAGEtable shows the average of all elements in each record (note the asteriskrepresents that the average is across all elements andthat elementsincluded in the average can be restricted by the.B \-eoption).Finally, theTOTAL FILE AVERAGEtable shows the average of all elements in each record for all the recordsspecified..PP	If .I file1and.I file2are both ESPS FEA_ANA files and the .B \-eoption is used, then .I distorttreats the files as "generic" ESPS files and processes them as above.  If, however, the \fB\-e\fP option is not used, then .I distorttreats the pulse lengths, raw pulse powers, LPC error powers, andreflection coefficients separately, as though each set of these quantities were in a separate record.  The.B \-Eand.B \-roptions apply as before.  For example, if .B \-ris specified (without \fB\-e\fP), the user gets average statistics forindividual records and for the entire file forpulse lengths, raw pulse powers, LPC powers, and reflection coefficients..PPFor the reflection coefficients and LPC powers,.I distortalso computes the following distortion measures (see the .B \-soption):.IPItakura\-Saito (IS);.IPgain\-optimized Itakura\-Saito (GOIS);.IPgain\-normalized Itakura\-Saito (GNIS);.LPThe file averages in the case of FEA_ANA files are computed forthe whole file (restricted by.B \-foption), as described above, as well as separately for voiced and unvoiced records.  (NOTE:.I distortprints an asterisk if the LPC power in a given record is zero)..PPFor example, a typical output for an FEA_ANA file given the followingcommand would look like:.ne 7.rs.sp.ce distort \-f 42:43 \-E \-r /u/wrl/tst3.fana /u/wrl/tst4.fana .sp.TScenter;c c c c cc c c c cc c c c cl c c c c.			Diff	DiffRecord	Element	Diff	Mag	Squared\-\-\-\-\-\-	\-\-\-\-\-\-\-	\-\-\-\-	\-\-\-\-	\-\-\-\-\-\-\-42 (Unvoiced)	ref_coeff[0]	\-0.022741	0.022741	0.000517       	ref_coeff[1]	 0.045983	0.045983	0.002114       	ref_coeff[2]	\-0.010369	0.010369	0.000108	.	.	.	.	.	.	.	.	.	.	.	.       	ref_coeff[9]	\-0.169554	0.169554	0.028748       	raw_power[0]	 24.354967	24.354967	593.164429       	lpc_power[0]	 7.304450	7.304450	53.35498443 ( Voiced )	ref_coeff[0]	 0.006073	0.006073	0.000037       	ref_coeff[1]	\-0.030668	0.030668	0.000941       	ref_coeff[2]	\-0.019798	0.019798	0.000392	.	.	.	.	.	.	.	.	.	.	.	.       	ref_coeff[9]	\-0.039427	0.039427	0.001554       	p_pulse_len[0]	0.000000	0.000000	0.000000       	p_pulse_len[1]	0.000000	0.000000	0.000000       	p_pulse_len[2]	0.000000	0.000000	0.000000       	p_pulse_len[3]	0.000000	0.000000	0.000000       	raw_power[0]	 112.112762	112.112762	12569.271484       	raw_power[1]	\-303.590942	303.590942	92167.460938       	raw_power[2]	\-554.038818	554.038818	306959.000000       	raw_power[3]	\-683.969238	683.969238	467813.906250       	lpc_power[0]	\-34.695694	34.695694	1203.791138.TE.ne 7.rs.sp.ce ELEMENT AVERAGE.sp.TScenter;c c c c c cc c c c c cc c c c c cc c c c c c.	AVG 	AVG	  AVG	MAX	MAXElement	Diff	Mag	Squared	Mag	Squared-------	----	---	-------	---	-------ref_coeff[0]	\-8.3e\-03	1.4e\-02	2.7e\-04	2.2e\-02	5.1e\-04ref_coeff[1]	 7.6e\-03	3.8e\-02	1.5e\-03	4.5e\-02	2.1e\-03ref_coeff[2]	\-1.5e\-02	1.5e\-02	2.5e\-04	1.9e\-02	3.9e\-04.	.	.	.	.	..	.	.	.	.	..	.	.	.	.	.ref_coeff[9]	\-1.0e\-01	1.0e\-01	1.5e\-02	1.7e\-01	2.9e\-02p_pulse_len[0]	0.0e+00 	0.0e+00	0.0e+00	0.0e+00	0.0e+00p_pulse_len[1]	0.0e+00 	0.0e+00	0.0e+00	0.0e+00	0.0e+00p_pulse_len[2]	0.0e+00 	0.0e+00	0.0e+00	0.0e+00	0.0e+00p_pulse_len[3]	0.0e+00 	0.0e+00	0.0e+00	0.0e+00	0.0e+00raw_power[0]	 6.8e+01	6.8e+01	6.5e+03	1.1e+02	1.2e+04raw_power[1]	\-3.0e+02	3.0e+02	9.2e+04	3.0e+02	9.2e+04raw_power[2]	\-5.5e+02	5.5e+02	3.0e+05	5.5e+02	3.0e+05raw_power[3]	\-6.8e+02	6.8e+02	4.6e+05	6.8e+02	4.6e+05lpc_power[0]	\-1.3e+01	2.1e+01	6.3e+02	3.5e+01	1.2e+03.TE.ne 7.rs.sp.ceRECORD AVERAGE.sp.TScenter;c c c c cc c c c c l c c c c.Record	Element	Avg Diff	Avg Mag	Avg Squared------	-------	--------	-------	-----------42 (Unvoiced)	ref_coeff	\-0.084935	0.103815	0.022624       	raw_power	24.354967	24.354967	593.164429       	lpc_power	7.304450	7.304450	53.35498443 ( Voiced )	ref_coeff	\-0.014936	0.018925	0.000470       	p_pulse_len	0.000000	0.000000	0.000000       	raw_power	\-357.371552	413.427948	219877.406250       	lpc_power	\-34.695694	34.695694	1203.791138.TE.ne 7.rs.sp.ceTOTAL FILE AVERAGE.sp.TScenter;c c cc c cc c c.MEAN	MEAN	MEANDiff	Diff Mag	Diff Squared--------	-------	-----------\-46.242622	 55.525623	 28430.974609.TE.ne 7.rs.sp.ceSPECTRAL DISTORTIONS.sp.TScenter;c c c cc c c cl c c c.Record	IS	GNIS	GOIS------	--	----	----42 (Unvoiced)	6.01	0.21	0.2343 ( Voiced )	0.07	0.00	0.00.TE.ne 7.rs.sp.ceTOTAL.sp.TScenter;c c c cc c c cc c c c.Frame	IS	GNIS	GOIS-----	--	----	----Unvoiced	6.01	0.21	0.23Voiced	0.07	0.00	0.00ALL	3.04	0.10	0.12.TE.PPIf .I file1and.I file2are both FEA_SPEC files, .I distortcomputes statistics in the same way as for generic ESPS files (described above), including the effects of the .B \-eoption.  However, the statistics computed do not include discreteapproximations to the IS and GNIS spectral distortion measures that arecomputed in the case of FEA_ANA files..sp.SH "ESPS PARAMETERS".PPThe ESPS parameter file is not read by .I distort.  .SH ESPS HEADERS.PP.I Distort reads the following values from all ESPS file headers:.RS 5\fI.nfcommon.typecommon.ndreccommon.tag\fP.REFor "generic" files, \fIdistort\fP also reads.RS 5\fI.nfcommon.ndoublecommon.nfloatcommon.nlongcommon.nshortcommon.nchar\fP.REFor FEA_ANA files, \fIdistort\fP reads the following generic header items.RS 5\fIorder_vcdorder_unvcdmaxpulsesmaxrawmaxlpc\fP.REFor SPEC files, \fIdistort\fP reads.RS  5\fIhd.spec\->voicinghd.spec\->spec_typehd.spec\->freq_formathd.spec\->continhd.spec\->num_freqs.fi\fP.RE.SH OPTIONS.PPThe following options are supported:.TP.BI \-x " debug_level"If .I debug_levelis positive,.I distortprints debugging messages and other information on the standard erroroutput.  The messages proliferate as the  .I debug_levelincreases.  If.I debug_levelis 0, no messages are printed.  The default is 0..TP.BI \-f " range"Selects a subrange of records on which to compute distortions, specifiedin frames of the first input file .I file1.The range is specified using the format .I start\-endor.IR start:end .Either .I startor .I endmay be omitted, in which case the omitted parameter defaults respectivelyto the start or end of .I file1.  The first record in .I file1is considered to be frame 1, regardless of its position relative to anyoriginal source file.  The default range is the entire input file.I file1.  .TP.BI \-e " range"Selects a subrange of elements within each record on which to compute distortions.  The range is specified using the format.I start\-endor.IR start:end .Either .I startor .I endmay be omitted, in which case the omitted parameter defaultsrespectively to the first or last element in each record.Element 0 refers to the tag, however, distortion measure fortags is not implemented..TP.BI \-ECauses the distortion contribution of every element to be output. .TP.BI \-rCauses the distortion contribution of individual records to be output.If the input files are both FEA_ANA or both SPEC, the record outputs are marked as either voiced or unvoiced.  .TP.BI \-sCauses symmetric versions of the various Itakura\-Saito relateddistortions to be computed.  Thus, for example,instead of IS(\fIfile1,file2\fP), .I distortcomputes [IS(\fIfile1,file2\fP) + IS(\fIfile2,file1\fP)]/2..TP.BI \-nSuppress output of Element Averages for ESPS SPEC file type..SH "FUTURE CHANGES".PPEnergy\-weighted version of the three Itakura\-Saito distortions andthe frequency variant spectral distance will be implemented..SH "SEE ALSO".PP.nfis_dist_td(3\-ESPSsp), gois_dist_td(3\-ESPSsp), gnis_dist_td(3\-ESPSsp), featospec (1\-ESPS), refl_to_auto(3\-ESPSsp), FEA_ANA(5\-ESPS), FEA(5\-ESPS), SD(5\-ESPS), SPEC(5\-ESPS), ESPS(5\-ESPS), FEA_SPEC(5\-ESPS).fi.SH "WARNINGS".PPThe Itakura\-Saito distortion measures may or may not be correct.  Theresults have not been compared with a known example..I Distortdoes not yet support files that contain complex data. .SH "BUGS".PPNone known..SH "REFERENCES".PP[1] R. M. Gray, A. Buzo, A. H. Gray, and Y. Matsuyama, "Distortion Measuresfor Speech Processing," IEEE Transactions on Acoustics, Speech, and SignalProcessing, Vol. ASSP\-28, No. 4, August 1980, pp. 367\-376..SH "AUTHOR".PPCoded by Ajaipal S. Virdy.  Man page by John Shore, Dave Burton, andAjaipal S. Virdy.

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -