Skip to content
AIAn Alian Software company

AI glossary

RAG (Retrieval-Augmented Generation)

Fetching relevant documents from a database first, then asking the model to answer using only those documents. The default architecture for grounded chatbots and knowledge assistants.

The longer version

Production RAG isn't just 'fetch chunks and stuff them in the prompt.' It's hybrid search (BM25 + vector) → reranking → citation-required prompting → refusal patterns when retrieval confidence is low. Hallucination drops 80%+ with the full pattern vs naive RAG. See /playbooks/rag for the full pattern.

Want to talk about how this applies to your stack?