Question 1

What is "Below the vector: tokens" about?

Accepted Answer

Before meaning, text is split into tokens — common chunks (often sub-words) drawn from a fixed vocabulary an algorithm like BPE learned. The model only ever sees token ids, not characters.

Question 2

What problem does it solve?

Accepted Answer

Why does an AI fumble 'how many r's in strawberry?' — it isn't reading letters.

Question 3

What will I be able to do after this lesson?

Accepted Answer

You can explain tokenization (BPE): text → tokens → ids, and why it explains spelling quirks and token costs.

Question 4

What comes next?

Accepted Answer

Tokens become vectors — but what about images and sound?