Atamel.Dev

Gen AI Evaluation Service - An Overview

Posted on 30 June 2025

Generating content with Large Language Models (LLMs) is easy. Determining whether the generated content is good is hard. That’s why evaluating LLM outputs with metrics is crucial. Previously, I talked about DeepEval and Promptfoo as some of the tools you can use for LLM evaluation. I also talked about RAG triad metrics specifically for Retrieval Augmented Generation (RAG) evaluation for LLMs. In the next few posts, I want to talk about a Google Cloud specific evaluation service: the Gen AI evaluation service in Vertex AI. Read More →

GenAI VertexAI Gemini Google Cloud Platform

Evaluating RAG pipelines with the RAG triad

Posted on 14 May 2025

Retrieval-Augmented Generation (RAG) emerged as a dominant framework for feeding Large Language Models (LLMs) the context beyond the scope of their training data and enabling LLMs to respond with more grounded answers and fewer hallucinations based on that context. However, designing an effective RAG pipeline can be challenging. You need to answer questions such as: How should you parse and chunk text documents for vector embedding? What chunk size and overlay size should you use? Read More ↗︎

GenAI VertexAI Gemini Google Cloud Platform

DeepEval adds native support for Gemini as an LLM Judge

Posted on 29 April 2025

In my previous post on DeepEval and Vertex AI, I introduced DeepEval, an open-source evaluation framework for LLMs. I also demonstrated how to use Gemini (on Vertex AI) as an LLM Judge in DeepEval, replacing the default OpenAI judge to evaluate outputs from other LLMs. At that time, the Gemini integration with DeepEval wasn’t ideal and I had to implement my own integration. Thanks to the excellent work by Roy Arsan in PR #1493, DeepEval now includes native Gemini integration. Read More →

GenAI VertexAI Gemini Google Cloud Platform

Much simplified function calling in Gemini 2.X models

Posted on 8 April 2025

Last year, in my Deep dive into function calling in Gemini post, I talked about how to do function calling in Gemini. More specifically, I showed how to call two functions (location_to_lat_long and lat_long_to_weather) to get the weather information for a location from Gemini. It wasn’t difficult but it involved a lot of steps for 2 simple function calls. I’m pleased to see that the latest Gemini 2.X models and the unified Google Gen AI SDK (that I talked about in my Gemini on Vertex AI and Google AI now unified with the new Google Gen AI SDK) made function calling much simpler. Read More →

GenAI VertexAI Gemini Google Cloud Platform

RAG with a PDF using LlamaIndex and SimpleVectorStore on Vertex AI

Posted on 24 March 2025

Previously, I showed how to do RAG with a PDF using LangChain and Annoy Vector Store and RAG with a PDF using LangChain and Firestore Vector Store. Both used a PDF as the RAG backend and used LangChain as the LLM framework to orchestrate RAG ingestion and retrieval. LlamaIndex is another popular LLM framework. I wondered how to set up the same PDF based RAG pipeline with LlamaIndex and Vertex AI but I didn’t find a good sample. Read More →