The self-correction loop: test, fix, repeat
The idea: Give the agent a way to check itself — run the tests, read the failure, fix, re-run — and the test suite becomes its ground truth. Verification, not generation, is the real bottleneck.
What you'll be able to do: You can explain the agentic test→fix loop and why giving an agent a way to verify itself is what makes it reliable.
The problem it solves: An agent's first edit is often wrong. So why do agents still get reliable?
Builds on: Inside one turn: reason, then act
← Inside one turn: reason, then act · Next: Plan first, or feel your way? →
All lessons