📄 readme.txt
字号:
A/B Tool: A program for comparing speech files.
Version 1.2 - September 1993
David P. Kemp <dpkemp@alpha.ncsc.mil>
Copyright 1993, U.S. Department of Defense
A/B Tool is a program for playing A/B comparisons between speech files.
It generates the necessary segment boundaries on the fly, eliminating
the need to create play scripts by hand. A/B Tool scans files for gaps
between segments of speech and compensates for delay between files.
It can automatically play corresponding segments from multiple files
in user-specified order, or it can play a single segment from one file
at a time to allow critical comparisons.
Usage:
The simplest usage of abtool is to start it with the desired
files listed on the command line:
abtool *.spd
or
abtool file_a file_b file_c ...
and then press the "Play A/B" button. This will scan the first file
for segment boundaries, align the remaining files to the first, and
play corresponding segments from each file in the order listed.
Controls:
File / "Load SPD" - Pops up the file navigator, allowing the user to
open additional speech files.
File / "Load Seg"
File / "Save Seg" - Allows the user to load and save custom segment
boundaries. **Not Yet Implemented**
"Audio..." - this starts up the Sun deskset application "audiocontrol",
for setting the volume, headphone/speaker port, etc. An alternate
command can be specified in the properties sheet.
Drop Box - currently inoperative, but future versions may allow
drag 'n drop for opening files.
"Quit" - terminates abtool and any spawned application.
"Play A/B" - Plays an A/B comparison, as specified by the AB: and
Seg: fields.
"Stop" - while playing, the "Play A/B" button changes to "Stop",
which has the expected functionality.
"AB:" - Specifies the order in which segments are played. There
can be up to 26 files open simultaneously, each identified by
a single letter. For example, if 3 files are open, specifying
"abc" here would play them in order; "aba" would play a segment
from the first file, followed by the second, followed by a
repeat of the first. "cc" would play each segment from the
third file twice. Letter identifiers are case-insensitive.
"Seg:" - Specifies the segment (or sentence) number to play.
If this field is 0, comparisons are played for all segments
in sequential order. If this field is a valid segment number,
a single comparison for that segment is played.
"Segs:" - Displays the number of segments currently defined. The
first file specified on the command line is automatically scanned
for segments, and segments can be recomputed with the "Segment"
command described below.
File List - The top part of the tool contains a list of open files,
identified by letter. The user can select a file from this list
and bring up a menu with the following options:
"Play" - play the entire file.
"Close" - close the file.
"Graph" - display the file's energy in the plot window.
"Segment" - scan the file to define a new segment list.
Plot window - The bottom part of the tool contains a graph of
energy tracks, segmentation parameters, and segment boundaries.
**Not Yet Implemented**
To play an individual segment from one file, enter the segment
number in the "Seg:" field (above), then place the keyboard
focus in the plot window by clicking the mouse. The specified
segment from particular files can be played by typing the file's
identifier letter. This allows the user to repeat the segment
as many times as desired, for critical comparisons.
At any time, playing can be interrupted by pressing the "Stop"
button. Typing a space (or other invalid character) in the plot
window will stop queueing any new data but allow the audio device
to finish playing whatever is already in the pipeline.
"Properties..." - Brings up a window containing additional controls
and information, as listed below:
"Graph" - selects RMS or Peak energy in the plot window.
"Time" - selects the time scale for energy plots.
"Auto Scale" - enables gain normalization during play.
"Gain" - allows modification of a file's play gain.
"Filenames" - hides filenames to allow blind comparisons.
"Ref:" - shows which file is the reference, i.e. the one which
was most recently segmented. To segment a file, select it
in the File List (above), and choose Segment from the menu.
"File:" - shows which file is selected in the File List.
"Delay" - time delay between the reference and selected files.
The Segment command segments the reference file and then aligns
all other open files to it. To show the alignment for a
particular file, select it from the File List.
"Max Delay" - maximum time delay (in both directions) over
which to search for the best alignment.
"Corr" - the best correlation found between the reference and
selected files. If this is less than .9, the files may not
contain the same speech material, making A/B comparisons less
useful.
"Thresh:", "Gap:", and "Pad:" - segmentation parameters.
Segments are defined as an interval of signal with energy
(in dB) above the "Threshold", surrounded by intervals below
"Threshold" with a minimum duration specified by "Gap". Segment
boundaries are placed "Pad"/2 ms. before and after the high-
energy interval. To segment a file, set the desired parameters
here, select a file from the file list, select "Segment" from
the file list menu, and observe the "Segs" display to determine
how many segments were found.
Audio Devices:
Abtool currently works with Sun audio devices: the 8 kHz 8 bit AMD
device used in the Sparcstation 1/2/IPX, and the 16 bit programmable
sample rate DBRI device used in the Sparcstation 10/LX. Abtool will
convert between encoding formats (linear and mu-law) on the fly, but
it will not do sample rate conversion. If a file does not have a valid
Sun audiofile header, it is assumed to be 16 bit linear, 8 kHz, mono.
Files of different encodings may be open simultaneously and will be
properly translated. Other devices may be specified in the "Audio I/O"
choice, but this is currently ignored.
The "Hold/Release" switch determines whether the audio device is
held open or closed when not in use. If the device is held open, other
audio applications (play, xbiff, etc) will not be able to access it.
This may be desirable for critical listening or generating a recording.
Abtool does not do device control; use gaintool, audiotool, or
audiocontrol to set the volume and select the desired output port(s).
The "Audio..." button will spawn a copy of "audiocontrol"; an alternate
command can be specified in the Properties sheet if desired.
Possible enhancements:
* Support for additional audio devices (MTU, DAT-Link).
* File output, for generating comparison tapes.
* Adding silence between segments.
* Additional audio file encodings (float, 24/32/48 kbps ADPCM)
Standard disclaimer:
Version 1.2 should still be considered beta software. As far as I
know it works correctly, but there is still a fair amount of work to
do - most notably in the plot window.
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -