A post went viral on r/languagelearning this week. Someone spent two years grinding Anki for Spanish. 4,000+ cards. Over 90% retention. Could read menus, subtitles, news articles — no problem.
Then they had a 20-minute conversation with a native speaker and completely fell apart.
The comments are a graveyard of shared experience. Hundreds of learners nodding along: "This is exactly what happened to me." Two years of daily reviews. Thousands of mature cards. And when it mattered, their mouth couldn't produce what their brain supposedly knew.
This isn't a bug. It's a fundamental misunderstanding of how language acquisition works — and it comes down to tracking the wrong thing.
Recognition ≠ Production
Cognitive science draws a sharp line between two types of memory. Recognition memory is seeing a Spanish word on a flashcard and retrieving its meaning. Production memory is starting with a concept in your head and generating the word — under time pressure, while also managing grammar, listening, and social context simultaneously.
These are genuinely different cognitive processes. The neural pathway that nails a flashcard review is not the same system that fires during real-time speech.
A 2019 study in Language Learning found that learners who relied heavily on flashcard-only review performed significantly worse on spontaneous production tasks than learners who spent equivalent time in conversational practice — even when the flashcard learners had higher recognition test scores.
Read that again. Higher test scores. Worse at speaking. Because the test measured the wrong thing.
The Four Things Flashcards Can't Train
🧠 What Anki misses
- Processing speed — Fluent speech needs word retrieval in under 600ms. Flashcards have no time pressure. Your brain never learns urgency.
- Collocations — Words don't live in isolation. "Do homework" not "make homework." Flashcards teach words; conversation teaches how words live together.
- Phonological automaticity — Knowing what a word means and producing it with correct pronunciation under pressure are completely different skills.
- Contextual meaning — "Cringe" means something different at a dinner party vs. on Twitter. Flashcards give dictionary entries. Conversation gives cultural layers.
This is why someone can have 4,000 mature Anki cards and still freeze in conversation. They trained recognition in a zero-pressure environment. Speaking requires production under maximum pressure. It's like training for a marathon by doing seated leg extensions.
The Real Problem: What Gets Measured Gets Done
Here's the insidious part. Anki gives you beautiful metrics. Cards reviewed. Retention rate. Streak days. Mature vs. young cards. It's a dashboard that makes you feel like you're making progress.
And you are — at one very specific skill. But because the metrics are so satisfying, you keep doing more of the same thing. You do another 50 cards instead of having a 15-minute conversation with a tutor. You optimize your intervals instead of shadowing a podcast.
"You can't improve what you don't measure." True. But the corollary is just as important: you won't improve what you can't measure. If your tracker only counts flashcards, you'll only do flashcards.
This is why what you track matters more than how much you study.
The Output Practice Gap
Look at how most language learners spend their time:
📊 Typical learner's weekly breakdown
- Flashcards / vocab: 60-70% of study time
- Passive listening: 15-20%
- Reading: 10-15%
- Speaking practice: 0-5%
- Writing output: 0-2%
The activity that matters most for conversational fluency — actually producing the language — gets the least time. Not because people don't know it matters, but because it's harder, scarier, and there's no satisfying metric attached to it.
Nobody's going to screenshot their "spoke for 22 minutes today" stat and post it to Reddit. But they will screenshot a 365-day Anki streak.
This is a tracking problem, not a motivation problem.
What to Track Instead
The learners who actually reach conversational fluency — B2 and beyond — track fundamentally different metrics:
1. Hours by Activity Type
Not just "I studied for an hour" but what kind of hour. 60 minutes of Anki reviews is categorically different from 60 minutes of conversation practice. When you can see your weekly breakdown — speaking vs. reading vs. listening vs. vocab — you immediately notice the imbalance. And once you notice it, you fix it.
2. Output-to-Input Ratio
A healthy ratio for intermediate learners is roughly 30-40% output (speaking + writing) to 60-70% input (reading + listening + vocab). Most people are at 5/95. Just tracking this ratio and aiming for 30/70 can transform progress within weeks.
3. Speaking Minutes Per Week
This is the single most predictive metric for conversational improvement. Not cards reviewed. Not hours logged. Minutes spent producing the language out loud. Whether that's a tutor session, a language exchange, shadowing, or talking to yourself in the shower — it all counts.
4. Consistency Across Activities
A streak that only measures "did I open Anki" doesn't capture whether you're building a well-rounded practice. The useful question is: did I do at least two different activity types today? A session that includes 20 minutes of listening and 10 minutes of speaking is worth more than 60 minutes of flashcard grinding.
The Shadowing Fix
If you're coming from a flashcard-heavy routine and want to rebalance, here's the highest-leverage change: add 15 minutes of daily shadowing.
Shadowing means playing a native speaker clip, pausing, and repeating with identical rhythm and intonation. It bridges the gap between recognition and production by forcing your mouth to execute what your brain knows. It trains processing speed, pronunciation, and the phonological loop simultaneously.
Track it as "speaking practice." Because that's exactly what it is.
The Reddit poster with 4,000 Anki cards didn't need more vocabulary. They needed 15 minutes a day of making sounds with their mouth. And a tracker that showed them the gap before it became a two-year problem.
Flashcards Aren't Bad. They're Incomplete.
Let's be clear: Anki is an incredible tool. Spaced repetition is one of the most well-researched learning techniques in cognitive science. Flashcards are excellent for building a core vocabulary of 1,500–2,000 words fast, which gives you raw material for conversation.
The mistake is thinking the vocabulary is the language.
Use flashcards to build your word bank. Use conversation, shadowing, and output practice to turn that word bank into a language you can actually speak. And use your tracker to make sure you're doing both — not just the comfortable one.
The best language learners aren't the ones with the most Anki cards. They're the ones who can see exactly where their practice is unbalanced — and fix it before it costs them two years.
What This Means for You
If you recognized yourself in that Reddit post — high vocab retention, low speaking ability — you don't need to throw out your flashcards. You need to:
- Track activity types, not just time. Know how many minutes you spend on speaking vs. everything else.
- Set an output target. Even 15 minutes of speaking practice per day changes the trajectory.
- Review your balance weekly. If speaking is under 20% of your total study time, something needs to shift.
- Make output practice visible. Log it. Track it. Watch it grow. The same psychology that makes Anki streaks addictive works for speaking practice too — if you have the right tracker.
4,000 flashcards is impressive. But the real question isn't how many words you can recognize. It's how many minutes this week did you spend producing language out loud?
Track that number. Everything else follows.