⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 book2

📁 Best algorithm for LZW ..C language
💻
📖 第 1 页 / 共 5 页
字号:
It has a rich spectrum of harmonics,decaying at around 12\ dB/octave, and each harmonic is affectedby the vocal tract resonances..rh "Vocal tract resonances."A simple model of the vocal tract is an organ-pipe-like cylindrical tube(Figure 2.1),with a sound source at one end (the larynx) and open at the other (the lips)..FC "Figure 2.1"This has resonances at wavelengths $4L$, $4L/3$, $4L/5$, ..., where $L$is the length of the tube;and these correspond to frequencies $c/4L$, $3c/4L$, $5c/4L$, ...\ Hz, $c$being the speed ofsound in air.Calculating these frequencies, using a typical figure for thedistance between larynx and lips of 17\ cm,and $c = 340$\ m/s for the speed of sound, leads to resonances atapproximately 500\ Hz, 1500\ Hz, 2500\ Hz, ... ..ppWhen excited by the harmonic-rich waveform of the larynx,the vocal tract resonances producepeaks known as.ulformantsin the energy spectrum of the speech wave (Figure 2.2)..FC "Figure 2.2"The lowest formant, called formant one, varies from around 200\ Hzto 1000\ Hz during speech, the exact range depending on the sizeof the vocal tract.Formant two varies from around 500 to 2500\ Hz, and formant threefrom around 1500 to 3500\ Hz..ppYou can easily hear the lowest formant by whispering the vowels inthe words "heed", "hid", "head", "had", "hod", "hawed", and "who'd".They appear to have a steadily descending pitch, yet since you arewhispering there is no fundamental frequency.What you hear is the lowest resonance of the vocal tract \(em formant one.Some masochistic people can play simple tunes with this formant by puttingtheir mouth in successive vowel shapes and knocking the top of their headwith their knuckles \(em hard!.ppA difficulty occurs when trying to identify the lower formants for speakerswith high-pitched voices.When a formant frequency falls below the fundamental excitation frequencyof the voice, its effect is diminished \(em although it is still present.The vibrato used by opera singers provides a very low-frequency excitation(at the vibrato rate) which helps to illuminate the lower formants evenwhen the pitch of the voice is very high..ppOf course, speech is not a static phenomenon.The organ-pipe model describes the speech spectrum during a continuouslyheld vowel with the mouth in a neutral position such as for "aaah".But in real speech the tongue and lips are in continuous motion,altering the shape of the vocal tract and hence the positions of the resonances.It is as if the organ-pipe were being squeezed and expanded indifferent places all the time.Say.uleeas in "heed" and feel how close your tongue is to the roof of your mouth,causing a constriction near the front of the vocal cavity..ppLinguists and speech engineers use a special frequency analyser called a"sound spectrograph" to make a three-dimensional plot of the variationof the speech energy spectrum with time.Figure 2.3 shows a spectrogram of theutterance "go away"..FC "Figure 2.3"Frequency is given on the vertical axis,and bands are shown at the beginning to indicate the scale.Time is plotted horizontally,and energy is given by the darkness of any particular area.The lower few formants can be seen as dark bands extending horizontally,and they are in continuous motion.In the neutral first vowel of "away", the formant frequenciespass throughapproximately the 500\ Hz, 1500\ Hz, and 2500\ Hz that we calculated earlier.(In fact, formants two and three are somewhat lower than these values.).ppThefine vertical striations in the spectrogram correspond to single openings of the vocal cords.Pitch changes continuously throughout an utterance,and this can be seen on the spectrogram by the differences in spacingof the striations.Pitch change, or.ulintonation,is singularly important inlending naturalness to speech..ppOn a spectrogram, a continuously held vowel shows up as a static energy spectrum.But beware \(em what we call a vowel in everyday language is not the same thing as a"vowel" in phonetic terms.Say "I" and feel how the tongue moves continuously while you're speaking.Technically, this is a.uldiphthongor slide between two vowel positions,and not a single vowel.If you say.ularas in "hard",and change slowly to.uleeas in "heed", you will obtain a diphthong not unlike that in "I".And there are many more phonetically different vowel soundsthan the a, e, i, o, and u that we normally think of.The words "hood" and "mood" have different vowels, for example, as do "head" and "mead".The principal acoustic difference between the various vowel soundsis in the frequencies of the first two formants..ppA further complication is introduced by the nasal tract.  This isa large cavity which is coupled to the oral tract by a passage at theback of the mouth.The passage is guarded by a flap of skin called the "velum".You know about this because inadvertent opening of the velum whileswallowing causes food or drink to go up your nose.The nasal cavity is switched in and out of the vocal tractby the velum during speech.It is used for consonants.ulm,.uln,and the.ulngsound in the word"singing".Vowels are frequently nasalized too.A very effective demonstration of the amount of nasalization in ordinaryspeech can be obtained by cutting a nose-shaped hole in a largebaffle which divides a room, speaking normally with one's nose in the hole,and having someone listen on the other side.The frequency of occurrence ofnasal sounds, and the volume of sound that is emittedthrough the nose, are both surprisingly large.Interestingly enough, when we say in conversation that someone sounds"nasal", we usually mean "non-nasal".  When the nasal passages areblocked by a cold, nasal sounds are missing \(em.uln\c\&'s turn into.uld\c\&'s,and.ulm\c\&'s to.ulb\c\&'s..ppWhen the nasal cavity is switched in to the vocal tract, it introducesformant resonances, just as the oral cavity does.Although we cannotalter the shape of the nasal tract significantly, the nasal formantpattern is not fixed, because the oral tract does play a part in nasalresonances.If you say.ulm,.uln,and.ulngcontinuously, you can hear the difference and feel how it is produced byaltering the combined nasal/oral tract resonances with your tongue position.The nasal cavity operates in parallel withthe oral one:  this causes the two resonance patterns to be summedtogether, with resulting complications which will be discussed in Chapter 5..rh "Sound sources."Speech involves sounds other than those caused by regular vibration ofthe larynx.When you whisper, the folds of the larynx are held slightlyapart so that the air passing between them becomes turbulent, causing a noisy excitationof the resonant cavity.The formant peaks are still present, superimposed on the noise.  Such"aspirated" sounds occur in the.ulhof "hello", and for a very short timeafter the lips are opened at the beginning of "pit"..ppConstrictions made in the mouth produce hissy noises such as.ulss,.ulsh,and.ulf.For example, in.ulssthe tip of the tongue is high up,very close to the roof of the mouth.Turbulent air passing through this constriction causes arandom noise excitation, known as "frication".Actually, the roof of the mouth is quite a complicated object.You can feel with your tongue a bony hump or ridge just behind the frontteeth, and it is this that forms a constriction with the tongue for.uls.In.ulsh,the tongue is flattened close to the roof of the mouth slightly farther back,in a position rather similar to that for.uleebut with a narrowerconstriction,while.ulfis produced with the upper teeth and lower lip.Because they are made near the front of the mouth,the resonances of the vocal tract have little effect on these fricativesounds..ppTo distinguish them from aspiration and frication, the ordinary speechsounds (like "aaah") which have their source in larynx vibration areknown technically as "voiced".  Aspirated and fricative sounds are called"unvoiced".  Thus the three different sound types can be classified as.LB.NPvoiced.NPunvoiced (fricative).NPunvoiced (aspirated)..LECan any of these three types occur together?It would seem that voicing and aspiration can not, for the former requiresthe larynx to be vibrating regularly, but for the latter it must begenerating turbulent noise.However, there is a condition known technically as "breathy voice"which occurs when the vocal cords are slightly apart, still vibrating,but with a large volume of air passing between to create turbulence.Voicing can easily occur in conjunction with frication.Corresponding to.uls,.ulsh,and.ulfwe get the.ulvoicedfricatives.ulz,the sound in the middle of words like "vision" which I will call.ulzh,and.ulv.A simple illustration of voicing is to say "ffffvvvvffff\ ...".During the voiced part you can feel the larynx vibrations with a fingeron your Adam's apple, and it can be heard quite clearly if you stop upyour ears.Technically, there is nothing to prevent frication and aspirationfrom occurring together \(em they do, for example, when a voiced fricativeis whispered \(em but the combination is not an important one..ppThe complicated acoustic effects of noisy excitations in speech can beseen in the spectrogram in Figure 2.4 of"high altitude jets whizz past screaming"..FC "Figure 2.4".rh "The source-filter model of speech production."We have been talking in terms of a sound source (be it voiced or unvoiced)exciting the resonances of the oral (and possible the nasal) tract.This model, which is used extensively in speech analysis and synthesis,is known asthe source-filter model of speech production.  The reason for its successis that the effect of the resonances can be modelled as a frequency-selectivefilter, operating on an input which is the source excitation.Thus the frequency spectrum of the source is modified by multiplying itby the frequency characteristic of the filter (or adding it, if amplitudesare expressed logarithmically).This can be seen in Figure 2.5, which shows a sourcespectrum and filter characteristic which combine to give the overallspectrum of Figure 2.2..FC "Figure 2.5".ppAlthough, as mentioned above, the various fricatives are not subjectedto the resonances of the vocal tract to the same extentthat voiced and aspiratedsounds are, they can still be modelled as a noise source followed bya filter to give them their different sound qualities..ppThe source-filter model is an oversimplification of the actual speechproduction system.  There is inevitably some coupling between the vocaltract and the lungs, through the glottis, during the period whenit is open.  This effectively makes the filter characteristicschange during each individual cycle of the excitation.However, although the effect is of interest to speech re

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -