Back to Basics for RAG

RAG

llm-conf-2024

Published

July 2, 2024

Abstract

Adding context-sensitive information to LLM prompts through retrieval is a popular technique to boost accuracy. This talk will cover the fundamentals of information retrieval (IR) and the failure modes of vector embeddings for retrieval and provide practical solutions to avoid them. Jo demonstrates how to set up simple but effective IR evaluations for your data, allowing faster exploration and systematic approaches to improving retrieval accuracy.

Subscribe For More Educational Content

If you enjoyed this content, subscribe to receive updates on new educational content for LLMs.

Chapters

00:00 Introduction and Background

01:19 RAG and Labeling with Retrieval

03:31 Evaluating Information Retrieval Systems

05:54 Evaluating Document Relevance

08:22 Metrics for Retrieval System Performance

10:11 Reciprocal Rank and Industry Metrics

12:41 Using Large Language Models for Judging Relevance

14:32 Microsoft’s Research on LLMs for Evaluation

17:04 Representational Approaches for Efficient Retrieval

19:14 Sparse and Dense Representations

Back to Basics for RAG

Chapters

Slides

Resources

Full Transcript