RAG Evaluation - A Step-by-Step Guide with DeepEval
In my previous Evaluating RAG pipelines post, I introduced two approaches to evaluating RAG pipelines. In this post, I will show you how to implement these two approaches in detail. The implementation will naturally depend on the framework you use. In my case, I’ll be using DeepEval, an open-source evaluation framework.
Approach 1: Evaluating Retrieval and Generator separately As a recap, in this approach, you evaluate the retriever and generator of the RAG pipeline separately with their own separate metrics.
Read More →