A RAG model (Retrieval-Augmented Generation) combines text generation from a language model with an external knowledge source, such as a database, document archive, or semantic search engine.
Unlike traditional AI systems that rely solely on their training data, a RAG model can retrieve relevant information during the request and include it in the response. This leads to more accurate, up-to-date, and verifiable answers.
In AI search systems like ChatGPT with Bing or enterprise bots connected to internal documents, RAG models ensure that answers are grounded in real sources rather than assumptions.