What is Retrieval-Augmented Generation (RAG)?

2 min readMar 21, 2024

As we are getting more and more exposure to Large Language Models (LLMs), we’re also starting to see the gaps in the accuracy and reliability of the responses we get from these models. LLMs can start hallucinating, providing responses that might seem accurate at first glance, but aren’t. One of the reasons why a LLM can hallucinate is because it’s unable to verify information from external sources. The LLM solely relies on training data, without access to real-time, up to date content or databases.

Retrieval-Augmented Generation addresses the problem of unreliable or inaccurate data by enhancing generative AI models to incorporate external data sources. After a user has prompted the LLM, the LLM gets an instruction to retrieve relevant content. This means that the LLM will look up a reputable data source, which can be open (e.g. Internet data) or closed (e.g. internal knowledge database). The LLM will link the information found to the user’s query and provide context about why the response is accurate.

**Image Credit:** Leonie Monigatti on Medium

Main learning point: When I hear about the art of creating and fine tuning LLMs, I think about the added aspect of RAG. Including information retrieval from predefined data sources is is another aspect to consider when improving LLM reliability and accuracy.

Related links for further learning:

What is Retrieval-Augmented Generation (RAG)?

Written by MAA1