Evals

Measuring the performance of LLM products

Inspect, An OSS framework for LLM evals

This talk will cover using and extending Inspect, a new OSS Python…

LLM Eval For Text2SQL

Ankur from Braintrust discusses the systematic evaluation and enhancement…

A Deep Dive on LLM Evaluation

Doing LLM evaluation right is crucial, but very challenging! We’ll cover…