THE MACHINERY OF DEEP LEARNING
How Skill Compounds When You Stop Trying to Learn
The Field Manual for Acquisition That Actually Works
There is a learning problem most people never solve.
You study for hours. You highlight. You reread. You feel productive. You take the test. Half of it is gone. You study again. It comes back, barely, and leaves faster than before. The hours accumulate. The capability does not.
Then you watch someone learn the same material in a third of the time. They seem relaxed about it. They space it out. They do not highlight anything. They close the book and go for a walk. Two months later they still have it and you do not.
The difference is not talent. It is not discipline. It is not intelligence.
The difference is that one person is fighting the architecture of their brain and the other is working with it.
Surface learning feels hard because it violates how the brain acquires capability. Deep learning feels easy because it does not. The feeling of effort is not a signal that learning is happening. It is often a signal that learning is being prevented.
This writing is the machinery. The mechanism underneath. And the forms that work with it instead of against it.
THE MECHANISM
What the Brain Is Actually Doing
Learning is not storage. It is construction.
The brain does not record information like a hard drive. It builds associative structures. Each new piece of information is encoded not by its content but by how deeply it connects to what is already there.
Craik and Lockhart mapped this in 1972. Three processing levels on a continuum:
Shallow processing. Noticing the surface. What does this word look like. Is it capitalized. Is it in bold. Almost no memory trace forms.
Intermediate processing. Noticing the sound. Does this rhyme with something. How is it pronounced. A slightly better trace. Still fragile.
Deep processing. Processing for meaning. How does this connect to what I already know. What does this imply. What would change if this were false. A durable, retrievable trace with multiple access paths.
The difference in retention between shallow and deep processing is not incremental. It is categorical. Deep processing produces memory traces that are qualitatively different from shallow ones. More connections. More retrieval routes. More resistance to decay.
THE ENCODING DEPTH SPECTRUM
SHALLOW DEEP
◄──────────────────────────────────────────────►
"is it in "does it "how does this
uppercase?" rhyme?" relate to X?"
Visual form Sound form Meaning
processed processed processed
░░ ████ ████████████
trace strength trace strength trace strength
Decays in Decays in Persists for
minutes hours months to years
One retrieval Few retrieval Many retrieval
path paths paths
The implication is immediate. Every minute spent highlighting is shallow processing. Every minute spent rereading is shallow processing. Every minute spent summarizing in someone else’s words is shallow processing. The activity feels like learning because the material becomes familiar. Familiarity is not learning. Familiarity is the brain recognizing the surface features of something it has not deeply encoded.
The Bjork Paradox
Robert and Elizabeth Bjork discovered the mechanism that explains why the most effective learning feels the least productive.
Every memory has two independent strengths:
Storage strength. How deeply the memory is woven into your existing knowledge network. How many connections it has. How structurally embedded it is. Storage strength only goes up. Once something is deeply encoded, the encoding persists.
Retrieval strength. How easily you can access the memory right now. How quickly it comes to mind. How available it feels. Retrieval strength rises and falls with use, context, and time.
These two strengths are often inversely correlated during the act of learning.
When retrieval strength is high, the material feels easy. It comes to mind quickly. It feels learned. But the actual learning gain from re-engaging with it is low. The brain does not need to do any work. No new connections form. No reconstruction happens.
When retrieval strength is low, the material feels forgotten. It takes effort to reconstruct. It feels like failure. But the effort of reconstruction triggers a massive gain in storage strength. The brain is forced to rebuild access pathways. Each rebuild strengthens the structure.
THE BJORK PARADOX
┌─────────────────────────────────────────────┐
│ │
│ RETRIEVAL STRENGTH HIGH │
│ (feels easy, feels learned) │
│ │
│ Learning gain: LOW │
│ │
│ The brain has nothing to do. │
│ No reconstruction needed. │
│ No new connections form. │
│ This is rereading. This is cramming. │
│ This is the illusion of competence. │
│ │
└─────────────────────────────────────────────┘
┌─────────────────────────────────────────────┐
│ │
│ RETRIEVAL STRENGTH LOW │
│ (feels hard, feels forgotten) │
│ │
│ Learning gain: HIGH │
│ │
│ The brain must reconstruct. │
│ Reconstruction builds new pathways. │
│ Each pathway strengthens storage. │
│ This is spaced retrieval. │
│ This is actual learning. │
│ │
└─────────────────────────────────────────────┘
The feeling of productivity and the
fact of learning point in opposite
directions. This is the trap.
This is why students who reread their notes feel confident going into the exam and perform poorly. The rereading maintained high retrieval strength (the material felt familiar) without building storage strength (the material was not reconstructed). Dunlosky’s meta-analysis confirmed it: rereading and highlighting are among the least effective study techniques despite being the most popular.
The techniques that work feel wrong. They feel like forgetting. They feel like struggle. They feel like failure. They are not failure. They are the brain doing the construction work that produces permanent capability.
The Compounding Engine
This is where deep learning separates permanently from surface learning. This is the mechanism that makes it exponential.
Chase and Simon studied chess in 1973. They found that expert players do not have better memories. They have better patterns. A grandmaster stores roughly 50,000 to 100,000 meaningful configurations in long-term memory. When they see a board position, they do not see 32 individual pieces. They see 5 or 6 familiar patterns.
A novice holds 7 plus or minus 2 individual pieces in working memory. A grandmaster holds 7 plus or minus 2 chunks, each containing dozens of elements.
The grandmaster is not smarter. They have more infrastructure.
This is schema construction. A schema is a generalized framework that preserves structural relationships while discarding surface details. It is a template with slots. Once constructed, a schema does three things:
First, it reduces cognitive load. What once required conscious processing of multiple elements now occupies a single slot in working memory. The brain freed up capacity.
Second, it enables transfer. A schema built from one domain can be applied to structurally similar problems in other domains. The schema IS the transfer mechanism.
Third, it creates infrastructure for the next schema. Complex schemas are built from simpler schemas. Each deeply learned concept becomes a building block for the next one. A musician who has chunked scales, chord progressions, and rhythmic patterns can learn a new piece by recognizing it as a configuration of existing chunks. A beginner must process each note individually.
THE COMPOUNDING ENGINE
NOVICE EXPERT
Working memory: Working memory:
7 individual elements 7 chunks
┌──┐┌──┐┌──┐┌──┐┌──┐┌──┐┌──┐ ┌──────┐┌──────┐┌──────┐
│a ││b ││c ││d ││e ││f ││g │ │ abcd ││ efgh ││ ijkl │
└──┘└──┘└──┘└──┘└──┘└──┘└──┘ └──────┘└──────┘└──────┘
┌──────┐┌──────┐┌──────┐
│ mnop ││ qrst ││ uvwx │
└──────┘└──────┘└──────┘
┌──────┐
│ yz.. │
└──────┘
Total capacity: Total capacity:
7 elements 28+ elements
Each hour of deep Each schema makes the
learning builds next hour of learning
new schemas. more productive.
This is the compounding.
Linear input. Exponential output.
Every hour of deep learning builds schemas. Every schema reduces the cognitive load of future learning. Every reduction in cognitive load frees capacity for more schema construction. The cycle accelerates. The expert finds advanced material “easy” not because they are gifted but because their infrastructure handles the load.
Every hour of surface learning builds isolated traces. Isolated traces do not reduce future cognitive load. They do not free capacity. They do not compound. The next hour of surface learning is exactly as hard as the first.
The Consolidation Engine
Half of deep learning happens while you are not learning.
Synaptic consolidation begins within minutes of encoding. Long-term potentiation physically strengthens the connections between neurons that fired together during learning. The synapse changes shape. It becomes faster. More efficient.
But the larger mechanism is systems consolidation. Over hours to days, memory is transferred from the hippocampus (temporary, context-bound storage) to the neocortex (permanent, generalized storage). This transfer is not a copy. It is a transformation. Specific episodes become general schemas. Context-bound memories become transferable knowledge.
Sleep is the primary consolidation engine.
During slow-wave sleep, the hippocampus replays the day’s neural firing patterns. This is not random review. It is selective. Memories tagged as important or emotionally significant get replayed preferentially. The replay coordinates with thalamic spindles and cortical slow oscillations to redistribute traces from hippocampus to neocortex.
The synaptic homeostasis hypothesis adds a critical detail. During waking hours, learning potentiates synapses broadly. Many connections get strengthened. During sleep, selective weakening occurs. Weak, incidental traces get pruned. Strong, repeatedly activated traces survive. The signal-to-noise ratio improves.
Sleep does not just save memories. Sleep separates what matters from what does not.
The incubation effect extends this into waking rest. Setting a problem aside activates the Default Mode Network, which is not idleness. The DMN reorganizes the groundwork laid during active engagement, free from the limitations of working memory. Solutions appear during walks, showers, transitions. The unconscious processing was doing the structural work that conscious effort could not.
THE TWO-PHASE LEARNING CYCLE
PHASE 1: ENGAGEMENT PHASE 2: CONSOLIDATION
(you are aware of this) (you are not)
┌──────────────────────┐ ┌──────────────────────┐
│ │ │ │
│ Deep encoding │ │ Hippocampal replay │
│ Schema extension │ ────► │ Synaptic pruning │
│ Retrieval practice │ │ Neocortical transfer │
│ Pattern extraction │ │ Schema generalization│
│ │ │ │
│ Time: 60-120 min │ │ Time: hours to days │
│ State: focused │ │ State: sleep / rest │
│ Cost: metabolic │ │ Cost: zero effort │
│ │ │ │
└──────────────────────┘ └──────────────────────┘
Most people optimize Phase 1 and
sabotage Phase 2. They study until
midnight, sleep four hours, and wonder
why nothing stuck. They eliminated the
engine that does the actual construction.
THE TEMPLATE
The Deep Learning Cycle
Every domain. Every skill. Every body of knowledge. The form that produces compounding acquisition has four components.
THE DEEP LEARNING TEMPLATE
┌─────────────────────────────────────────────┐
│ │
│ COMPONENT 1: ENGAGE FOR MEANING │
│ │
│ Process for connection, not familiarity. │
│ Ask: "how does this relate to what I │
│ already know?" not "do I recognize this?" │
│ │
│ Generate, do not consume. │
│ Explain it. Draw it. Teach it. │
│ Use your own words, not the source's. │
│ │
├─────────────────────────────────────────────┤
│ │
│ COMPONENT 2: SPACE AND RETRIEVE │
│ │
│ Wait until retrieval strength drops. │
│ Then retrieve without looking. │
│ The struggle IS the learning. │
│ │
│ Interval: ~10-20% of desired retention. │
│ Retain for 1 month → review after 3 days. │
│ Retain for 1 year → review after 1 month. │
│ │
├─────────────────────────────────────────────┤
│ │
│ COMPONENT 3: INTERLEAVE │
│ │
│ Mix related-but-different material. │
│ Do not repeat 20 of the same type. │
│ The switching forces schema extraction. │
│ "What makes type A different from type B?" │
│ is the question that builds the template. │
│ │
├─────────────────────────────────────────────┤
│ │
│ COMPONENT 4: CONSOLIDATE │
│ │
│ Sleep. Walk. Do nothing related. │
│ The brain finishes the construction │
│ you started. Do not interrupt it. │
│ │
│ 90-120 minutes of focused engagement. │
│ Then stop. Not because you are tired. │
│ Because the neurochemistry has a window │
│ and pushing past it degrades encoding. │
│ │
└─────────────────────────────────────────────┘
These four components are not a method. They are the mechanism. Every effective learning system in history encodes some version of them. Spaced repetition systems formalize component 2. The Socratic method formalizes component 1. Interleaved curricula formalize component 3. Apprenticeship models build in component 4 through the natural rhythm of work and rest.
The systems that fail ignore one or more components. Lectures ignore 1 (passive consumption). Cramming ignores 2 and 4 (no spacing, no consolidation). Blocked practice ignores 3 (no interleaving). All-nighters ignore 4 (no sleep).
THE PHRASEBOOK
Surface vs. Deep
The same study hour can be surface or deep. The difference is not what material you engage with. It is how you engage with it.
SURFACE DEEP
(common, feels productive, (uncommon, feels uncertain,
builds almost nothing) builds everything)
Reread the chapter Close the book, write
what you remember
Highlight the key terms Explain each term to
yourself without looking
Copy the solved example Cover the solution, try
to solve it, then compare
Watch the lecture again Pause the lecture, predict
what comes next, then check
Read your notes before Put the notes away, write
the exam a practice exam, take it
Study one topic for Alternate between three
two hours straight related topics every
30 minutes
Study until midnight Study for 90 minutes,
sleep 8 hours, study
again tomorrow
Group similar problems Mix problem types randomly
Review immediately Wait two days, then retrieve
after learning without looking
Feel confident and Feel uncertain and
prepared effortful
The left column is what most people do. It is what most educational systems teach. It is what feels right.
The right column is what the brain needs. It is what almost nobody does. It is what feels wrong.
DOMAIN APPLICATIONS
Learning a Language
SURFACE DEEP
Review flashcards with See the word in your
the translation visible target language, try to
recall meaning before
flipping
Study vocabulary lists Encounter words in
grouped by theme sentences, mixed with
(animals, then colors, other themes, then try
then numbers) to produce sentences
using them
Listen to a podcast Listen to 3 minutes,
for 45 minutes pause, try to summarize
straight what was said, then
continue
Grammar drills: Grammar drills: mix
20 past tense, past tense, present,
then 20 present conditional randomly
The language learner who spaces vocabulary review across days and mixes grammar forms in a single session will outperform the one who crams 200 words and does 50 identical conjugations. The first feels chaotic. The second feels organized. The first is working with the architecture.
Learning an Instrument
SURFACE DEEP
Play the whole piece Isolate the hard passage,
start to finish, ten play it slowly, then
times interleave with other
passages
Practice scales for Practice scales in one
30 minutes, then key for 5 minutes, switch
arpeggios for 30 to arpeggios in another
minutes key, switch to a third
scale, rotate
Play along with the Play from memory, check
sheet music every time the sheet only when stuck,
mark the stuck points
Practice the same Practice on Monday, skip
piece every day Tuesday, retrieve on
for a week Wednesday, skip Thursday
and Friday, play Saturday
The musician who interleaves and spaces will progress faster despite practicing fewer total hours. Each session forces reconstruction. Each reconstruction deepens the encoding. The compounding begins within weeks.
Learning a Technical Skill
SURFACE DEEP
Read the documentation Read one section, close
cover to cover the docs, try to use the
feature from memory
Follow the tutorial Follow step 1, then try
step by step to predict step 2 before
reading it
Copy code from Study the example, close
examples it, write your own version,
compare
Study one framework Study framework A for
for a week, then 30 minutes, switch to
the next framework B, compare their
approaches to the same
problem
Read Stack Overflow Read the question, try to
answers answer it yourself before
reading the answers
The developer who predicts before reading, retrieves before checking, and compares across frameworks will build transferable understanding. The one who follows tutorials will build fragile procedural memory that breaks when the context changes.
Physical Skill Acquisition
SURFACE DEEP
Drill one technique Interleave three related
50 times in a row techniques, 15 reps each,
randomly ordered
Practice in the same Vary the conditions: speed,
conditions every time angle, distance, resistance
Train for 3 hours Train for 75 minutes with
straight full focus, rest, return
the next day
Repeat until it Repeat until you can
feels automatic explain what you are
doing and why
Blocked repetition creates an illusion of mastery. The movement feels smooth by rep 40. But the smoothness is contextual. Change the context and the smoothness disappears. Interleaving forces the nervous system to solve the selection problem (which technique applies here?) in addition to the execution problem. Both get encoded.
ERROR MODES
Error 1: The Fluency Trap
You reread something and it feels familiar. You mistake the familiarity for understanding. You move on. The understanding was never constructed.
The fluency trap is the most common and most destructive learning error. It is the reason students feel prepared and perform poorly. The brain’s fluency signal (this processes smoothly, therefore I know it) is not a learning signal. It is a recognition signal. Recognition and recall are different systems.
The fix: never assess learning by how familiar something feels. Assess it by whether you can reconstruct it without the source in front of you. If you cannot reconstruct it, you have not learned it. Regardless of how familiar it feels.
Error 2: The Effort Inversion
You study hard for six hours. You feel exhausted. You assume the exhaustion means learning happened. The exhaustion means metabolic resources were consumed. It says nothing about whether schemas were constructed.
Six hours of rereading is more exhausting than two hours of spaced retrieval practice. The rereading consumed energy maintaining attention on familiar material (extraneous cognitive load). The retrieval practice consumed energy reconstructing from memory (germane cognitive load). Same exhaustion. Completely different outcome.
The fix: distinguish between the type of effort. Effort spent fighting the material into memory through repetition is extraneous. Effort spent reconstructing the material from partial traces is germane. Only germane effort produces learning.
Error 3: The Completion Illusion
You finished the book. You watched all the lectures. You completed the course. You have a certificate. You feel done.
Consuming material to the end is not learning. It is exposure. Exposure is the raw material of learning, not the product. The product is constructed during retrieval, during spacing, during interleaving, during sleep. If those processes did not happen, the exposure produced nothing.
The fix: completion is not the metric. Retrieval is the metric. Can you reproduce the core ideas without the source? Can you apply them to a novel problem? Can you explain them in your own words? If not, the course is not finished regardless of what the progress bar says.
THE DEEPER GEOMETRY
Why the Brain Works This Way
The architecture makes sense once you see the optimization target.
The brain is not optimizing for storage. It is optimizing for retrieval in novel contexts. Information that can only be retrieved in the context where it was encoded is useless for survival. An animal that learned the predator’s pattern but could only recall it in the same meadow would die in the next meadow.
The brain builds for transfer. Every mechanism described in this document serves that goal:
Deep encoding creates multiple retrieval paths. More connections mean more contexts can trigger recall.
Spacing creates contextual variation. Each spaced session occurs in a different internal and external state. The memory gets encoded with multiple contextual cues. It becomes retrievable from diverse future situations.
Interleaving builds schemas that abstract away surface features. The schema captures structure, not context. Structure transfers. Context does not.
Consolidation transforms episodic traces into generalized knowledge. Hippocampal replay strips the contextual wrapper and transfers the structural core to the neocortex.
Sleep prunes the noise. What remains after pruning is the signal that generalizes.
The entire architecture is an optimization engine for producing knowledge that works in situations the learner has never encountered. Surface learning fails because it produces context-bound traces that do not generalize. Deep learning succeeds because every mechanism in the pipeline is aimed at producing transfer-ready schemas.
TWO LEARNING ARCHITECTURES
SURFACE LEARNING DEEP LEARNING
(fights the brain) (works with the brain)
┌──────────────┐ ┌──────────────┐
│ input │ │ input │
└──────┬───────┘ └──────┬───────┘
│ │
▼ ▼
┌──────────────┐ ┌──────────────┐
│ shallow │ │ deep │
│ encoding │ │ encoding │
│ (surface) │ │ (meaning) │
└──────┬───────┘ └──────┬───────┘
│ │
▼ ▼
┌──────────────┐ ┌──────────────┐
│ isolated │ │ connected │
│ trace │ │ schema │
└──────┬───────┘ └──────┬───────┘
│ │
▼ ▼
┌──────────────┐ ┌──────────────┐
│ no spacing │ │ spaced │
│ no retrieval│ │ retrieval │
└──────┬───────┘ └──────┬───────┘
│ │
▼ ▼
┌──────────────┐ ┌──────────────┐
│ trace │ │ schema │
│ decays │ │ strengthens │
│ alone │ │ + compounds │
└──────────────┘ └──────────────┘
Linear input Linear input
Linear output Exponential output
No compounding Full compounding
DRILLS
Drill 1: The Reconstruction (daily)
After every engagement session (reading, lecture, practice), close the source. Set a timer for 5 minutes. Write everything you can remember. Do not organize it. Do not make it neat. Just dump.
Compare what you wrote against the source. Mark what you missed.
The gaps are your curriculum. Focus next session’s engagement on the gaps, not on what you already recalled.
Do this every day for two weeks. You will notice two things: your reconstructions get longer, and the gaps get smaller. Not because you are memorizing more. Because your schemas are handling more of the content automatically.
Drill 2: The Spacing Calendar (weekly)
Pick one skill or body of knowledge you are actively learning. Create three touch points per week, spaced at least 48 hours apart.
Monday: engage with new material for 60-90 minutes. At the end, reconstruct without looking.
Wednesday: without reviewing, try to recall Monday’s material. Write it down. Check. Re-engage only with what was lost.
Saturday: without reviewing, try to recall everything from the week. Write it down. Check. The gaps from Saturday become Monday’s new material.
The weekly rhythm maps naturally to the forgetting curve. Monday’s encoding partially decays by Wednesday. The Wednesday retrieval rebuilds it stronger. Saturday’s retrieval catches what survived and flags what did not. The cycle repeats. Storage strength climbs with every iteration.
Drill 3: The Interleave (per session)
Take three related-but-different skills or concept types you are learning. In a single practice session, rotate between them every 10-15 minutes.
Learning three math concepts: 10 minutes on concept A, 10 on concept B, 10 on concept C, then back to A. Do not finish one before starting the next.
Learning a physical skill with three techniques: 5 reps of technique A, 5 of technique B, 5 of technique C, rotate. Do not do 50 of one.
This will feel worse than blocked practice. Performance during the session will be lower. Errors will be higher. The feeling of competence will be lower.
Retention two weeks later will be dramatically higher. The interleaving forced your brain to extract the structural differences between A, B, and C. That extraction IS schema construction. Blocked practice never triggers it.
Drill 4: The 90-Minute Window (per session)
Set a hard stop at 90 minutes of focused engagement. Not approximate. Hard.
The neurochemical cocktail that supports focused encoding (dopamine, norepinephrine, acetylcholine) has a finite supply per session. Pushing past 90-120 minutes degrades encoding quality. You are still consuming metabolic resources but producing weaker traces.
After the 90-minute window: walk, nap, do unrelated tasks. Do not engage with related material. The consolidation engine activates during the gap. Let it work.
If you have more to learn, wait at least 90 minutes before starting a second focused session. Two 90-minute sessions separated by rest produce more learning than one continuous 3-hour session.
Drill 5: The Prediction Game (per engagement)
Before reading the next section, paragraph, or chapter, pause. Predict what it will say. Write the prediction down.
Then read. Compare your prediction against what actually appeared.
When you predicted correctly: your existing schema already covers this territory. Move quickly.
When you predicted incorrectly: this is where new encoding needs to happen. Slow down. Process deeply. This mismatch between prediction and reality is the exact signal the brain uses to trigger deep encoding.
The prediction game converts passive consumption into active engagement automatically. It forces deep processing because prediction requires consulting your existing schemas. Every mismatch between prediction and reality is a learning opportunity that passive reading would have missed entirely.
STACKING
Five complementary moves that amplify the deep learning cycle:
MOVE 1: TEACH IT
Explain what you learned to someone else.
The generation effect means producing
information activates broader neural circuits
than consuming it. Teaching is the highest-
bandwidth form of generation.
MOVE 2: VARY THE CONTEXT
Study in different locations. At different
times. In different states. Each variation
adds a contextual cue to the memory trace.
More cues means more retrieval paths.
MOVE 3: TEST BEFORE YOU FEEL READY
Take the practice exam before finishing
the material. Attempt the problem before
learning the technique. The errors are not
failures. They are encoding signals that
tell your brain what to prioritize.
MOVE 4: SLEEP ON IT
The single highest-leverage intervention
for learning is adequate sleep after
engagement. 7-9 hours. Non-negotiable.
Every hour of lost sleep degrades
consolidation measurably.
MOVE 5: NAME THE SCHEMA
After learning something, ask: "what is the
general principle here?" Give it a name.
Draw it. The act of abstracting from specific
to general IS schema construction. Making it
explicit accelerates what the brain would
otherwise do implicitly over weeks.
WHAT IT IS NOT
This is not a case for laziness.
Deep learning requires engagement. Real engagement. Processing for meaning. Retrieval under difficulty. Interleaving that feels chaotic. Prediction that requires concentration. The sessions are not long but they are intense.
This is not a case against effort.
Effort is essential. But the type of effort matters. Germane effort (reconstruction, schema-building, retrieval) produces learning. Extraneous effort (fighting familiarity through repetition, maintaining attention on passive review) does not. The mechanism distinguishes between these. Your feelings do not.
This is not a study hack.
Study hacks optimize the wrong variable. They try to make surface learning more efficient. Surface learning cannot be made efficient. It is structurally incapable of compounding. No amount of optimization makes a non-compounding process compound. The shift is not from inefficient surface learning to efficient surface learning. It is from surface to deep. Different architecture entirely.
This is not about intelligence.
Deep learning works the same way in every brain. The mechanisms (encoding depth, spacing effect, schema construction, consolidation) are universal. They are not modulated by IQ. A person of average intelligence who learns deeply will outperform a person of high intelligence who learns on the surface. The infrastructure compounds. The raw processing power does not.
THE SINGLE SENTENCE
┌─────────────────────────────────────────────┐
│ │
│ PROCESS FOR MEANING. │
│ SPACE AND RETRIEVE. │
│ MIX WHAT IS RELATED. │
│ SLEEP WHILE IT BUILDS. │
│ │
│ Everything that feels like learning │
│ probably is not. │
│ Everything that feels like forgetting │
│ probably is. │
│ │
└─────────────────────────────────────────────┘
Citations
Levels of Processing
Craik, F. I. M., & Lockhart, R. S. (1972). “Levels of processing: A framework for memory research.” Journal of Verbal Learning and Verbal Behavior, 11(6), 671-684.
Desirable Difficulties
Bjork, R. A. (1994). “Memory and metamemory considerations in the training of human beings.” In J. Metcalfe & A. Shimamura (Eds.), Metacognition: Knowing about Knowing. MIT Press.
Bjork, E. L., & Bjork, R. A. (2011). “Making things hard on yourself, but in a good way: Creating desirable difficulties to enhance learning.” In M. A. Gernsbacher et al. (Eds.), Psychology and the Real World. Worth Publishers.
Storage and Retrieval Strength
Bjork, R. A., & Bjork, E. L. (1992). “A new theory of disuse and an old theory of stimulus fluctuation.” In A. Healy et al. (Eds.), From Learning Processes to Cognitive Processes: Essays in Honor of William K. Estes. Erlbaum.
Study Technique Effectiveness
Dunlosky, J., et al. (2013). “Improving Students’ Learning With Effective Learning Techniques.” Psychological Science in the Public Interest, 14(1), 4-58.
Testing Effect
Roediger, H. L., & Karpicke, J. D. (2006). “Test-Enhanced Learning: Taking Memory Tests Improves Long-Term Retention.” Psychological Science, 17(3), 249-255.
Schema Theory and Chunking
Chase, W. G., & Simon, H. A. (1973). “Perception in chess.” Cognitive Psychology, 4(1), 55-81.
Sweller, J. (1988). “Cognitive load during problem solving: Effects on learning.” Cognitive Science, 12(2), 257-285.
Memory Consolidation
Diekelmann, S., & Born, J. (2010). “The memory function of sleep.” Nature Reviews Neuroscience, 11(2), 114-126.
Tononi, G., & Cirelli, C. (2014). “Sleep and the Price of Plasticity: From Synaptic and Cellular Homeostasis to Memory Consolidation and Integration.” Neuron, 81(1), 12-34.
Spacing Effect
Cepeda, N. J., et al. (2006). “Distributed practice in verbal recall tasks: A review and quantitative synthesis.” Psychological Bulletin, 132(3), 354-380.
Interleaving
Rohrer, D., & Taylor, K. (2007). “The shuffling of mathematics problems improves learning.” Instructional Science, 35(6), 481-498.
Deliberate Practice
Ericsson, K. A., Krampe, R. T., & Tesch-Romer, C. (1993). “The role of deliberate practice in the acquisition of expert performance.” Psychological Review, 100(3), 363-406.
Flow State
Csikszentmihalyi, M. (1990). Flow: The Psychology of Optimal Experience. Harper & Row.
Generation Effect
Slamecka, N. J., & Graf, P. (1978). “The generation effect: Delineation of a phenomenon.” Journal of Experimental Psychology: Human Learning and Memory, 4(6), 592-604.