Question 1

What is "Scaling laws" about?

Accepted Answer

Loss falls as a power law in params, data, compute (Chinchilla).

Question 2

What problem does it solve?

Accepted Answer

Does throwing more at it predictably help?

Question 3

What will I be able to do after this lesson?

Accepted Answer

You can explain scaling laws: loss falls predictably with compute, but predicts loss not skills.

Question 4

What comes next?

Accepted Answer

How to add parameters without paying for them every token?