What is retrieval-augmented generation (RAG)?

Question

Accepted Answer

RAG is how an AI answers from your documents instead of only its training. When you ask a question, the system searches your content for the most relevant passages, pastes them into the model's context, and asks the model to answer using them. The model never memorized your data. It reads the retrieved text at answer time. That is why RAG can cite sources and stay current, and why most RAG failures are really retrieval failures: if the right passage was not fetched, the model cannot use it.

RAG, explained interactively

What people get wrong

Where you see it in real products

Related explainers