Definition

Retrieval-Augmented Generation (RAG)

A technique that grounds an LLM's output in a specific document corpus by retrieving relevant context before generation.

The Full Definition

Retrieval-Augmented Generation is an architecture that injects relevant context into a language model's prompt at inference time. Rather than relying solely on what the model learned during training, a RAG system maintains a searchable index of your organization's documents and retrieves the most relevant chunks for each query — then asks the LLM to answer using only that retrieved context. The result is an AI that can cite specific sources, stay current with your latest data, and avoid the hallucinations that come from open-ended generation.

Why It Matters

For most business AI use cases — internal Q&A, customer support, policy lookup, document search — RAG is the right architecture. It lets you ship AI grounded in your organization's knowledge without the cost and complexity of fine-tuning, and it lets that knowledge update in real time as your documents change.

How This Shows Up in Practice

A law firm builds a RAG system over its precedent library. When an attorney asks "Have we ever litigated a non-compete in California for a sales VP?", the system retrieves the three closest historical matters and generates an answer that cites them directly. The firm's actual case history grounds every answer — and updating the corpus is as easy as dropping new documents into a folder.

Common Questions

When should I use RAG vs fine-tuning?

Use RAG when the goal is to ground an AI in specific, changing knowledge. Use fine-tuning when the goal is to teach the model a specific style, format, or task. Most production systems combine both — fine-tuning for behavior, RAG for facts.

Does RAG eliminate hallucinations?

It dramatically reduces them but doesn't eliminate them. Well-designed RAG systems include source citation and answer confidence so that any hallucination is easy to spot.

Related Terms

Want to put this to work?

A free process audit maps where retrieval-augmented generation (rag) — and the rest of the modern AI stack — actually move the needle in your business.

Survey My Business