Summary of speech acquisition

Speech acquisition data include the age typically developing children acquire consonants, consonant clusters, vowels, and tones as well as many other areas of speech.

Summary of 250 cross-linguistic studies of speech acquisition

Summary data are included below.

·A summary of English studies of speech acquisition

Cross-linguistic trends in children's speech acquisition are summarized below.

The summary is based on McLeod (2010; 2007; 2020; 2025). Additional information is available in Goldstein and McLeod (2012) and Zhu Hua and Dodd (2006).

Intelligibility

Definition: The estimated amount of speech that is intelligible to a particular listener

Main findings:

2-year-olds are intelligible at least 50% (more often with their parents)
4- and 5-year-olds' speech is intelligible most of the time, even to strangers

Main languages studying this aspect: English, Finnish, and Portuguese

Age of acquisition of consonants

Definition: Age when most children (90% or 75%) can pronounce a consonant like an adult

Main finding: Wide diversity of reported ages (>2;6 years) even for languages sharing similar consonants

Main languages: Almost every of the 24 languages

Age of acquisition of vowels

Definition: Paradigmatic acquisition = Discreet vowels (e.g., in monosyllabic words) vs. Syntagmatic acquisition = Vowels in context (e.g., stressed and unstressed vowels in polysyllables)

Main findings: Paradigmatic acquisition = Approx 3-years-old; Syntagmatic acquisition = Approx 7- to 9-years-old (in English)

Main languages: Very few languages (mostly English)

Percent consonants correct

Definition: Sometimes reported as percent consonants in error

Main findings:

2-year-olds produce consonants correctly at least 70% of the time
5-year-olds produce consonants correctly at least 90% of the time

Main languages: Most (including English, Finnish, German, Hungarian, Putonghua, and Welsh)

Common mismatches

Definition: Sounds children typically produce before they achieve the adult target

Main findings: Although there are some similarities, common mismatches do differ between languages. For example, common mismatches for /s/

/s/ - plosive consonant e.g., [t] in many languages (e.g., English, Dutch, Finnish, Hungarian, and Portuguese)
/s/ - lateralized fricative e.g., [ɮ] in Greek
/s/ - palatal consonant e.g., palatalization in Japanese, and [ʃ] in Israeli Hebrew

Main languages: A few languages including English, Greek, Japanese, Hungarian, Dutch

Phonological processes (patterns)

Definition: Patterns that occur in children's speech

Main findings:Systemic simplifications

Backing (e.g., Lebanese Arabic, Greek, Japanese, Norwegian, Putonghua, Thai, Vietnamese)
Fronting (e.g., Jordanian Arabic, Lebanese Arabic, Cantonese, English, German, Greek, Israeli Hebrew, Japanese, Korean, Maltese, Norwegian, Portuguese, Putonghua, Thai, Turkish, Welsh)
Gliding/Liquid deviation (e.g., Lebanese Arabic, Dutch, English, French, Korean, Maltese, Portuguese, Putonghua, Turkish, Welsh)
Stopping (e.g., Lebanese Arabic, Cantonese, Dutch, English, German, Greek, Israeli Hebrew, Japanese, Korean, Maltese, Norwegian, Portuguese, Putonghua, Thai, Turkish, Welsh)
Devoicing (e.g., Jordanian Arabic, Lebanese Arabic, Dutch, German, Hungarian, Israeli Hebrew, Maltese, Norwegian)
Voicing (e.g., English, German, Norwegian, Turkish, Welsh)

Structural simplifications

Assimilation/Consonant harmony (e.g., Cantonese, Dutch, English, French, Greek, Maltese, Norwegian, Portuguese, Putonghua, Turkish, Welsh)
Cluster reduction (e.g., Dutch, English, French, Greek, Israeli Hebrew, Maltese, Spanish, Thai, Turkish, Welsh)
Initial consonant deletion (e.g., Finnish, Spanish, Maltese, Thai)
Final consonant deletion (e.g., Jordanian Arabic, Cantonese, Dutch, English, German, Greek, Israeli Hebrew, Korean, Maltese, Portuguese, Putonghua, Spanish, Thai, Turkish, Welsh)
Reduplication (e.g., Dutch, English, Greek, Korean, Turkish, Welsh)
(Weak) syllable deletion (e.g., Jordanian Arabic, Dutch, English, Finnish, French, German, Israeli Hebrew, Japanese, Maltese, Norwegian, Portuguese, Spanish, Turkish, Welsh)

Phonetic inventories

Definition: Sounds produced regardless of the adult target

Main findings: Vowels, nasals, and plosives appear to be the earliest sounds to be produced by children. Children produce more sounds and greater articulatory variation as they grow older.

For example, phonetic inventories of American English 1-year-olds = nasals, voiced plosives, and a glide. Phonetic inventories of Jordanian Arabic 1- to 2-year-olds = plosives, fricatives, nasals, a lateral, and approximants. Maltese 2-year-olds = nasals, plosives, a fricative, and approximants

Main languages: Many languages (including Arabic, Cantonese, English, Finnish, Maltese).

Syllable structure

Definition: Syllable shapes produced regardless of target

Main findings: CV is a universal syllable shape (Locke, 1983) and is the earliest syllable structure to emerge. Next syllable shapes to emerge are: CVC (e.g., English, Israeli Hebrew, Maltese, Spanish), V (e.g., Korean), VC (e.g. Israeli Hebrew, Spanish)

Main languages: only a few studies

Prosody: stress

Definition: Strong and weak emphasis on different syllables

Main findings: Acquisition of stress is language-dependent. Very early acquisition (e.g., Israeli Hebrew). Later acquisition (e.g., Dutch and English)

Main languages: Few studies

Prosody: intonation

Definition: Melody of speech

Main findings: Language-specific intonation patterns begins between 1 and 2 years of age (e.g., English and Hungarian). Not fully acquired until 5;0 (English). Perception continues to develop until 10 and 11 years (Wells, Peppé, & Goulandris 2004)

Main languages: Few studies

Prosody: tones

Definition: Some languages use tones to differentiate lexical meaning

Main findings: Tone acquisition was achieved by 2-year-olds (Cantonese and Putonghua)

Main languages: Cantonese and Putonghua

References

McLeod, S. (2025). The Oxford handbook of speech development in languages of the world. Oxford University Press.
McLeod, S. (Ed). (2007). The international guide to speech acquisition. Clifton Park, NY: Thomson Delmar Learning.
McLeod, S. (2010). Laying the foundations for multilingual acquisition: An international overview of speech acquisition. In M. Cruz-Ferreira (Ed.), Multilingual norms (pp. 53-71). Frankfurt: Peter Lang Publishing.
McLeod, S. (2020). Intelligibility in Context Scale: Cross-linguistic use, validity, and reliability. Speech, Language and Hearing, 23(1), 9–16. https://doi.org/10.1080/2050571X.2020.1718837
McLeod, S. & Crowe, K. (2018).Children’s consonant acquisition in 27 languages: A cross-linguistic review. American Journal of Speech-Language Pathology, 27, 1546-1571. https://doi.org/10.1044/2018_AJSLP-17-0100
Crowe, K., & McLeod, S. (2020).Children's English consonant acquisition in the United States: A review. American Journal of Speech-Language Pathology, 29(4), 2155–2169. https://doi.org/10.1044/2020_AJSLP-19-00168
McLeod, S. & Goldstein, B. A. (Eds.). (2012). Multilingual aspects of speech sound disorders in children. Multilingual Matters.
Zhu Hua & Dodd, B. (2006). Phonological development and disorders in children: A multilingual perspective. Cleavdon, UK: Multilingual Matters.
Goldstein, B. A., & McLeod, S. (2012). Typical and atypical multilingual speech acquisition. In S. McLeod & B. A. Goldstein (Eds.), Multilingual aspects of speech sound disorders in children (pp. 84-100). Bristol, UK: Multilingual Matters.

Suggested citation

McLeod, S. (2024). Summary of 250 cross-linguistic studies of speech acquisition. Charles Sturt University. https://www.csu.edu.au/research/multilingual-speech/speech-acquisition/summary-of-speech-acquisition