Vocabulary Explains 43% of Reading Comprehension

·4 min read
Vocabulary Explains 43% of Reading Comprehension

If a student struggles with reading comprehension, the instinct is often to practice more reading. Read more books. Do more comprehension exercises. Practice more passages.

But research suggests the real bottleneck might not be reading skill at all. It might be vocabulary.

A 2019 study by Ahmed Masrai found that vocabulary knowledge explains up to 43% of the variance in reading comprehension scores. In other words, nearly half of the difference between strong and weak readers can be traced back to how many words they know.

The Study

Published in SAGE Open, Masrai's study examined 256 university-level students learning English as a second language. He measured their vocabulary knowledge across three frequency levels — high-frequency words (the most common), mid-frequency words, and low-frequency words — and then tested their reading comprehension using the IELTS reading exam.

The key finding: when you combine knowledge of high-frequency and mid-frequency vocabulary, it explained about 43% of the variance in reading comprehension performance among proficient learners.

That's a massive number. It means that vocabulary knowledge alone accounts for nearly half of what separates strong readers from weak ones.

Breaking Down the Numbers

Masrai's study revealed an important nuance about which words matter most:

High-frequency words (the most common 3,000 word families) contributed about 18% of the variance by themselves. These are the everyday words that form the backbone of any text.

Mid-frequency words (3,000–8,000 word families) added another 25% when combined with high-frequency knowledge. These are the words that appear regularly in academic and professional texts but aren't part of casual conversation — words like "ambiguous," "constraint," "paradigm," or "deteriorate."

Low-frequency words added a smaller additional contribution.

The takeaway: it's not just the basic words that matter. The mid-frequency range — the academic and semi-formal vocabulary — is where the biggest gains in comprehension come from.

Why This Matters for Indian Students

While Masrai's study focused on second-language learners of English, the findings are directly relevant to Indian students who study in English-medium schools.

For millions of Indian students, English is effectively a second language used for academic purposes. They face exactly the challenge Masrai studied: building enough English vocabulary to comprehend academic texts across subjects.

When a Class 8 student struggles with a science textbook, the problem often isn't that they can't understand science. It's that they don't know enough English words to decode the explanation. Words like "osmosis," "equilibrium," or "photosynthesis" aren't just science terms — they require understanding the English roots and context around them.

Similarly, when a student underperforms on board exams or competitive tests (CAT, GRE, UPSC), vocabulary gaps are often the hidden cause. The questions aren't too hard — the words are.

The Vocabulary Gap Is a Comprehension Gap

Masrai's research makes a compelling case: if you want to improve reading comprehension, invest in vocabulary.

This doesn't mean memorizing dictionary definitions. The study measured whether students truly knew the words — whether they could recognize them, understand them in context, and process them fluently. That's a deeper kind of knowledge than what rote memorization provides.

Building this kind of word knowledge requires:

Meaningful learning. Students need to engage with words through associations, context, and visual cues — not just definitions. Keyword mnemonics, where each word is linked to a vivid mental image, create the kind of deep encoding that leads to recognition and recall.

Systematic exposure to mid-frequency vocabulary. Masrai's study shows that the biggest comprehension gains come from mid-frequency words. Schools need to deliberately teach these words, not just hope students pick them up through reading.

Spaced repetition. Knowing a word once isn't enough. Students need to review words at scientifically optimized intervals to move them into long-term memory.

VocabularyFast is designed around all three of these principles. Every word comes with a keyword mnemonic and AI-generated image for deep encoding, and the spaced repetition system ensures words move from short-term memory into lasting knowledge.


References:

  • Masrai, A. (2019). "Vocabulary and Reading Comprehension Revisited: Evidence for High-, Mid-, and Low-Frequency Vocabulary Knowledge." SAGE Open, 9(2). DOI: 10.1177/2158244019845182

Ready to boost your vocabulary?

Start Learning