GPT-2 → GPT-3 → frontier
The idea: The same architecture scaled by orders of magnitude.
What you'll be able to do: You can explain that frontier models are the same architecture, just scaled enormously.
The problem it solves: How big is 'big', really?
Builds on: Scaling laws
← Quantization · Next: The model thinks before it answers →
All lessons