Gen AI Evaluation Service - Model-Based Metrics
In the Gen AI Evaluation Service - An Overview post, I introduced Vertex AI’s Gen AI evaluation service and talked about the various classes of metrics it supports. In the Gen AI Evaluation Service - Computation-Based Metrics post, we delved into computation-based metrics, what they provide, and discussed their limitations. In today’s third post of the series, we’ll dive into model-based metrics.
The idea of model-based metrics is to use a judge model to evaluate the output of a candidate model.
Read More →