Week 11 Part III · Architectures & Representation Learning
Instructor lesson plan: lecture (3 h) and practice (2 h).
| 0:00–0:10 | 10 min | Recap & retrievalOpen with two quick questions on last week's material (retrieval practice), then state this week's objectives. |
| 0:10–0:25 | 15 min | MotivationGates: a learned mechanism to keep or forget information across long sequences. |
| 0:25–1:10 | 45 min | LSTM and GRU
|
| 1:10–1:20 | 10 min | Break |
| 1:20–2:05 | 45 min | Sequence tasks
|
| 2:05–2:35 | 30 min | Live demo (predict, then run)Ask the class to predict whether the LSTM or the plain RNN holds the long-range signal better before comparing them. Swap an RNN for an LSTM, inspect the gates and cell state, and compare long- versus short-sequence gradients. |
| 2:35–2:50 | 15 min | Wrap-up & practice previewRevisit the misconception and concept checks below, recap the takeaways, and preview the practice lesson. |
| 2:50–3:00 | 10 min | Buffer & questions |
Students often think: LSTMs beat plain RNNs because they are bigger and have more parameters.
Set it straight: It is the cell state’s near-linear, gated path, not the parameter count, that preserves gradients across long sequences; the gates learn what to keep and forget.
In the practice lesson the instructor demonstrates implementations, runs code, and works through examples, using the practice notebook linked below. The weekly lab is then set as homework, where students apply this themselves.
| 0:00–0:10 | 10 min | Setup & recapRecap the lecture's key ideas and open the working notebook. |
| 0:10–1:00 | 50 min | Instructor demonstrations
|
| 1:00–1:05 | 5 min | Break |
| 1:05–1:45 | 40 min | Instructor demonstrations (continued)
|
| 1:45–2:00 | 15 min | Wrap-up & lab briefSummarize the patterns shown and brief the weekly lab (homework), which students complete on their own. |
Open the practice notebook in Colab Curated references Lab (homework)