Jump to a key chapter
The Source-Filter Theory: Model of Speech
First formulated by the phoneticist Gunnar Fant in 1960,1 the source-filter model provides an acoustic description of speech. The idea is that a simple sound from the vocal folds has to travel through the throat and the mouth, passing all sorts of obstacles in order to produce a meaningful sound. According to this model, speech production involves two stages: the source stage and the filter stage. Here's a summary of the key components of the source-filter model.
- The sound source is a movement of air that produces a relatively simple pressure wave. In vowels, the sound source is the vibration of the vocal folds.
- The various structures in the vocal tract form a filter that morphs the pressure wave created by the source.
- Posturing this filter in different ways causes changes in the wave. Changing the position of the tongue, for example, can change the wave of a vowel from [i] (as in cheese) to [a] (as in bot).
- After the wave from the source has passed through the filter, it exits the mouth as a complex pressure wave.
- The complex wave that results from the source and the filter is a distinct, recognizable speech sound.
Running a recording of human speech (aka a speech signal) through a phonetic-analysis program presents you with a great deal of information and graphs. The source-filter model is important because it helps you interpret and analyze this information. It allows you to make predictions about the nature of a speech signal just by looking at it in a computer program.
Definition of Source-Filter Theory
The definition of source-filter theory is relatively simple:
The Source-filter theory is a theory of phonetics that describes speech as the sound produced by a source (usually the vocal folds) and modified by a filter (the vocal tract).
In other words, source-filter theory divides speech into two parts. One part creates a raw sound, and the other part acts as a tube to shape the sound as it passes through.
Playing the trumpet provides an example of source-filter theory in action.
In order to get a sound out of a trumpet, you have to purse your lips and blow air into the trumpet's mouthpiece. This is sometimes called "buzzing" your lips against the mouthpiece. This action creates the sound source.
The body of the trumpet acts as a filter for the sound source. It turns the simple buzz of the mouthpiece into an amplified, clear, recognizable trumpet sound.
Remember that cruel and unusual method of recording the voice right at the level of the vocal folds? The resulting recording sounds like a simple buzzing sound—not unlike the buzz of a trumpet mouthpiece. If you're curious, you can look up the sound and hear it for yourself.
The Source-Filter Theory: Model of the Vocal Tract
Now to translate the source-filter model from trumpets to the vocal tract. In most speech sounds, the sound source is the vibration of the vocal folds, and the filter is the remainder of the vocal tract.
The Sound Source (The Glottis)
The source of voiced speech sounds is the vibration of the vocal folds. The vocal folds are contained in the part of the larynx called the glottis.
The larynx, also known as the voice box, is an organ made of bone and tissue located at the center of the throat.
The glottis is the part of the larynx that contains the vocal folds.
The vocal folds make up the tissue membrane responsible for voicing in speech.
When you force air from your lungs through your closed glottis, you cause your vocal folds to vibrate. This vibration, called the glottal source wave, is the sound source for most speech sounds.
The glottal source wave is the sound source for most, but not all, speech sounds. The source for voiceless speech sounds that originate higher in the vocal tract is the constriction within the vocal tract. For example, when you produce the voiceless labiodental fricative [f], the sound source is air passing through the constriction between the lower lip and the upper teeth. The filter for this sound is very small because there isn't much in front of those structures to morph the sound.
The Filter (The Vocal Tract)
On its way out into the world, the glottal source wave must pass through the filter: the vocal tract.
The vocal tract consists of all the speech organs from the larynx to the lips.
The primary organs in the vocal tract that are relevant to speech production are the epiglottis, pharynx, velum, tongue (tip, blade, body, and root), velum, alveolar ridge, hard palate, teeth, lips, and nasal cavity. Each of these organs can filter the sound from the glottal source wave.
Source-Filter Theory in Vowels
Source-filter theory is also useful because it can help explain the key characteristics of vowels, specifically fundamental frequency (pitch) and formants.
The fundamental frequency (or pitch) of a voiced sound is the sound's primary audible frequency.
Formants are amplified frequency ranges that distinguish one vowel from another.
Fundamental Frequency
When you sing an A at 440 Hz, the fundamental frequency of your voice is 440 Hz. The fundamental frequency of a speech sound depends on the speed of vocal fold vibration and on the length of the filter.
Frequency is usually measured in hertz (Hz), which means cycles per second. If the wave from your voice has a fundamental frequency of 440 Hz, then the wave repeats its pattern 440 times every second. You can raise the fundamental frequency of your voice either by raising your larynx to shorten the vocal tract or by pushing air through your vocal folds with more pressure, forcing them to vibrate faster.
Formants
Formants are bands of loud frequencies that characterize unique vowels. They are categorized into three bands:
F1, the lowest formant, correlates to vowel height. The lower the F1 frequency, the higher the vowel. For example, the vowels with the lowest F1 values are the high vowels, like [i] as in beet and [u] as in boot.
F2, the next lowest formant after F1, correlates to vowel backness. The lower the F2 frequency, the further back the vowel. For example, the vowels with the lowest F2 values are the back vowels, like [o] as in boat and [ɑ] as in bot.
F3, the highest formant relevant to speech sounds, correlates to certain vowel constrictions, especially r-colored vowels like the [ɹ] in the General American pronunciation of bird. This sound has a relatively low F3 value.
What does this have to do with source-filter theory? According to the source-filter model, formants are determined by the shape of the filter.
The low back vowel [ɑ] includes a constriction, or area of tension, at the pharynx. This constriction separates the vocal tract filter into a short tube from the larynx to the pharynx and a long tube from the pharynx to the lips. This filters the glottal source wave into a sound with a high F1 and a low F2, which you perceive as the [ɑ] vowel.
Examples of Source-Filter Theory
You can see examples of the acoustic effects of the source and filter by looking at a sound spectrum.
A sound spectrum is a plot of the simple wave components of a complex wave.
The sound spectrum is like a snapshot of a vowel at a single point in time. It shows all of the frequencies present in the wave (on the x-axis) and the amplitude, or loudness, of each frequency (on the y-axis).
Remember that the sound source helps to determine the fundamental frequency of the wave. The frequencies on the x-axis are evidence of the glottal source wave. These frequencies look like a row of "spikes" on the spectrum.
The first big spike on the sound spectrum is the wave's fundamental frequency.
The filter determines a vowel's formants. Formants are characterized by the loudest frequencies in the wave. On the sound spectrum, the curvy pattern in amplitude at the top of the spikes provides evidence of the filter.
The most important points of this example of the source-filter theory are these:
A change in the number of frequency spikes on the sound spectrum represents a change in the sound source.
A change in the curvy amplitude shape on top of the frequency spikes represents a change in the filter.
Source Filter Theory - Key takeaways
- Source-filter theory is an acoustic model that describes speech as the sound produced by a source (usually the vocal folds) and modified by a filter (the vocal tract).
- The source of voiced speech sounds is the vibration of the vocal folds.
- The filter of speech sounds is the vocal tract.
- The fundamental frequency of a speech sound depends on the speed of vocal fold vibration and on the length of the filter.
- Formants are determined by the shape of the filter.
References
- Gunnar Fant (1960). Acoustic Theory of Speech Production.
Learn with 13 Source Filter Theory flashcards in the free StudySmarter app
Already have an account? Log in
Frequently Asked Questions about Source Filter Theory
What is the source of the source-filter model of vowel production?
The source of voiced speech sounds is the vibration of the vocal folds. The vocal folds are contained in the part of the larynx called the glottis.
Where is the filter located for a vowel according to source-filter theory?
On its way out into the world, the glottal source wave must pass through the filter: the vocal tract
What is source-filter theory in phonetics?
Source-filter theory is a theory of phonetics that describes speech as the sound produced by a source (usually the vocal folds) and modified by a filter (the vocal tract).
Who developed the source-filter theory of speech production?
First thought up by the phoneticist Gunnar Fant in 1960,1 the source-filter model provides an acoustic description of speech. The idea is that a simple sound from the vocal folds has to travel through the throat and the mouth, passing all sorts of obstacles, in order to produce a meaningful sound.
Why is the source-filter theory important?
Running a recording of human speech through a phonetic analysis program presents you with a great deal of information and graphs. The source-filter model is important because it helps you interpret and analyze this information. It allows you to make predictions about the nature of a speech signal just by looking at it in a computer program.
What is the source-filter model?
The source-filter model is a theoretical framework used to describe the production and perception of speech sounds. According to this model, speech production involves two stages: the source stage and the filter stage. In the source stage, the larynx generates a sound wave that is characterized by its fundamental frequency and harmonics. In the filter stage, the vocal tract shapes the sound wave into distinct speech sounds by modifying its spectral characteristics. The resulting sound is then perceived by the listener, who uses their knowledge of the sound system of their language to interpret the intended message.
About StudySmarter
StudySmarter is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.
Learn more