Using Retrieval Augmented Generation (RAG) with Sitecore Search to ground GPT queries

A presentation at SUGCON India 2024 in in Bengaluru, Karnataka, India by Rob Earlam

In this session Rob will show how you can use Retrieval Augmented Generation (RAG) to pull data from Sitecore Search and ground queries against Large Language Models (LLMs) like GPT-4. This ensures the LLM only uses contextually relevant information when generating responses, giving greater control over what is returned to the end user.

Rob will show how you can use data indexed in Sitecore Search to achieve this quickly and easily in a ChatBot implementation – it will leverage queries against Sitecore Search to ground requests made into an LLM. This will be implemented on Sitecore’s Developer Portal, using the data indexed from all Sitecore’s different web properties to provide a contextually aware ChatBot experience. We will show how you can control what data is being used to generate responses, and as such provide a much more tailored experience to the users.

The Developer Portal is an open-source project, and all of the source code shown in the demo will be publicly available to the attendees.