Acoustic Phonetics

Acoustic phonetics is the study of the physical properties of speech sounds made by the human vocal tract. Computer software programs are used in this field of study. They typically transform the speech waveform into a so-called spectrum, a sequence of frequency-based analyses performed at regular intervals. This computation can then be used to generate an image known as a spectrogram, which is used to examine the waveform. The various analyses allow investigators to examine such things as the duration of the sound, its frequency and its intensity (amplitude). Figure 1 shows a spectrogram of me singing the opening line of ‘Happy Birthday!’ Three so-called formants are shown. These are the horizontal bands across the frequency spectrum which represent regions of relatively great intensity.

Spectrogram showing vowel formants

Figure 1. Spectrogram showing formants

Auditory Phonetics

Auditory phonetics is the study of how people perceive speech sounds. It investigates how people recognize speech sounds – as distinct from other sounds in the environment – and how they interpret them.

Articulatory Phonetics

Articulatory phonetics is the study of how the vocal tract is used to produce (articulate) speech sounds. Further, it studies how speech sounds are combined – in words and in connected speech – and how they vary: their place of articulation in the vocal tract, their manner of articulation, whether the vocal cords are activated during production of the sound, and so on. It examines the two main categories of human speech sounds: (1) vowels and (2) consonants.


Vowels are open sounds because they involve no obstruction to the flow of air from the lungs as it passes up through the windpipe (trachea), through the voice box (larynx) and out of the mouth. Other than a speaker positioning the tongue, jaws and lips in a specific configuration, there is nothing to obstruct the airflow.

describing vowels

A full description of the production of vowel sounds is complex and beyond the scope of this article. However, in brief, a distinction is made between vowels dependent upon five main parameters that influence the shape of the oral cavity:

  1. tongue elevation
  2. position of tongue elevation
  3. shape of the lips
  4. position of the jaws
  5. length of vocalization

In British English there are approximately 20 vowels.


In contrast to the ‘open’ sounds of vowels, consonants are closed sounds. This means that there is some type of obstruction to the airflow from the lungs by parts of the mouth coming into contact with each other, or very nearly contacting, thus closing off the free flow of air.

For example, the lips could come together for the sound ‘b’ as in ball, or the tongue tip could almost contact the gum ridge just behind the upper incisors for the sound ‘s’ as in sun. These contacts, and near contacts, impede the free flow of air through the vocal apparatus. It is this kind of closure that characterizes consonant sounds.

In English there are approximately 24 consonants and these are arranged into five main groups:

  1. plosives: sounds that cannot be sustained and which have a ‘popping’ quality, e.g. ‘p’ as in pea and ‘b’ as in boy
  2. nasals: sounds in which the escaping air passes through the nasal cavity, e.g. ‘m’ as in map and ‘n’ as in nap
  3. fricatives: as air exits through the mouth it forces its way through a narrowed gap (for example, by the tongue tip very nearly touching the gum ridge just behind the upper incisors) – this creates turbulence or friction, e.g. ‘s’ as in so and ‘f’ as in fit
  4. affricates: these are ‘combination’ sounds that begin with a complete obstruction formed by the tongue tip contacting the gum ridge, just behind the upper incisors, before the air is released slowly with friction, e.g. ‘ch’ as in chop and ‘j’ as in jam
  5. approximants: a group of four sustainable sounds – ‘w’ as in we, ‘r’ as in red, ‘l’ as in let and ‘y’ as in you

Again, a full description of consonants is beyond the scope of this article. However, as well as the above five main divisions, consonants can be further described in terms of their voicing, place of articulation and manner of articulation:


Voicing refers to whether or not the vocal folds are vibrating during the production of the consonant. If they are not vibrating the sound is voiceless and if they are vibrating then the sound is voiced.

place of articulation

This refers to the place in the vocal apparatus where the two so-called articulators come together. There are eight places:

  1. bilabial: two lips come together
  2. labio-dental: lip and teeth come together
  3. dental: tongue contacts the teeth
  4. alveolar ridge: tongue tip moves towards the ‘gum ridge’ just behind the upper incisors
  5. post alveolar: tongue tip is close to the position just behind the alveolar ridge, towards the back of the mouth
  6. palatal: tongue moves towards the roof of the mouth (palate)
  7. velar: the back of the tongue moves towards the soft palate (velum)
  8. glottal: the only glottal consonant in English is ‘h’ as in how – strictly speaking, this does not involve two articulators coming together, as the sound is merely the friction caused by air being expelled through the gap between the vocal cords (glottis)


This indicates the type of contact that is made between the two articulators and is defined simply by the five main groups we encountered earlier: plosive, nasal, fricative, affricate or approximant.

Table 1 summarizes the 24 main consonants of English in terms of their voicing, place and manner.


place of articulation

bilabial labio-dental dental alveolar post-alveolar palatal velar glottal
plosive p b t d k g
nasal m n ng
fricative f v th th s z sh zh h
affricate ch j
approximant w r/l y


Conventionally, where symbols appear in pairs, the voiceless consonant is listed before its voiced counterpart.

Table 1. English Consonants

In summary, then, articulatory phonetics is the study of how the vocal tract is used to produce (articulate) speech sounds, the description and categorization of these sounds, how they combine (in words and connected speech) and how they vary (from one speaker to another and from one context to another). Speech therapists are especially concerned with this area of study as any disruption in the ability to articulate speech sounds may give rise to an articulation disorder (speech disorder).