Gradient descent: rolling downhill
The idea: Follow the slope of the loss downhill.
What you'll be able to do: You can explain how training works: roll downhill, and why step size matters.
The problem it solves: Millions of knobs: how to tune them all?
Builds on: Loss as a scoreboard
← Loss as a scoreboard · Next: Where the map comes from →
All lessons