GenAI on Atamel.Dev

GenAI on Atamel.Dev https://atamel.dev/tags/genai/ Recent content in GenAI on Atamel.Dev Hugo -- gohugo.io atamel@gmail.com (Mete Atamel) atamel@gmail.com (Mete Atamel) Fri, 19 Jul 2024 00:00:00 +0000 Control LLM costs with context caching https://atamel.dev/posts/2024/07-19_control_llm_costs_context_caching/ Fri, 19 Jul 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/07-19_control_llm_costs_context_caching/ Introduction Some large language models (LLMs), such as Gemini 1.5 Flash or Gemini 1.5 Pro, have a very large context window. This is very useful if you want to analyze a big chunk of data, such as a whole book or a long video. On the other hand, it can get quite expensive if you keep sending the same large data in your prompts. Context caching can help. Context caching is useful in reducing costs when a substantial context is referenced repeatedly by shorter requests such as: Control LLM output with response type and schema https://atamel.dev/posts/2024/07-15_control_llm_output/ Mon, 15 Jul 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/07-15_control_llm_output/ Introduction Large language models (LLMs) are great at generating content but the output format you get back can be a hit or miss sometimes. For example, you ask for a JSON output in certain format and you might get free-form text or a JSON wrapped in markdown string or a proper JSON but with some required fields missing. If your application requires a strict format, this can be a real problem. RAG API powered by LlamaIndex on Vertex AI https://atamel.dev/posts/2024/07-08_ragapi_llamaindex_vertexai/ Mon, 08 Jul 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/07-08_ragapi_llamaindex_vertexai/ Introduction Recently, I talked about why grounding LLMs is important and how to ground LLMs with public data using Google Search (Vertex AI’s Grounding with Google Search: how to use it and why) and with private data using Vertex AI Search (Grounding LLMs with your own data using Vertex AI Search). In today’s post, I want to talk about another more flexible and customizable way of grounding your LLMs with private data: the RAG API powered by LlamaIndex on Vertex AI. Grounding LLMs with your own data using Vertex AI Search https://atamel.dev/posts/2024/07-01_grounding_with_own_data_vertexai_search/ Mon, 01 Jul 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/07-01_grounding_with_own_data_vertexai_search/ Introduction In my previous Vertex AI’s Grounding with Google Search: how to use it and why post, I explained why you need grounding with large language models (LLMs) and how Vertex AI’s grounding with Google Search can help to ground LLMs with public up-to-date data. That’s great but you sometimes need to ground LLMs with your own private data. How can you do that? There are many ways but Vertex AI Search is the easiest way and that’s what I want to talk about today with a simple use case. Give your LLM a quick lie detector test https://atamel.dev/posts/2024/06-06_llm_lie_detector_test/ Thu, 06 Jun 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/06-06_llm_lie_detector_test/ Introduction It’s no secret that LLMs sometimes lie and they do so in a very confident kind of way. This might be OK for some applications but it can be a real problem if your application requires high levels of accuracy. I remember when the first LLMs emerged back in early 2023. I tried some of the early models and it felt like they were hallucinating half of the time. More recently, it started feeling like LLMs are getting better at giving more factual answers. Vertex AI's Grounding with Google Search - how to use it and why https://atamel.dev/posts/2024/05-29_using-vertex-ai-grounding-with-google-search/ Wed, 29 May 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/05-29_using-vertex-ai-grounding-with-google-search/ Introduction Once in a while, you come across a feature that is so easy to use and so useful that you don’t know how you lived without it before. For me, Vertex AI’s Grounding with Google Search is one of those features. In this blog post, I explain why you need grounding with large language models (LLMs) and how Vertex AI’s Grounding with Google Search can help with minimal effort on your part. A tour of Gemini 1.5 Pro samples https://atamel.dev/posts/2024/05-07_gemini_15_pro_samples/ Tue, 07 May 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/05-07_gemini_15_pro_samples/ Introduction Back in February, Google announced Gemini 1.5 Pro with its impressive 1 million token context window. Larger context size means that Gemini 1.5 Pro can process vast amounts of information in one go — 1 hour of video, 11 hours of audio, 30,000 lines of code or over 700,000 words and the good news is that there’s good language support. In this blog post, I will point out some samples utilizing Gemini 1. C# and Vertex AI Gemini streaming API bug and workaround https://atamel.dev/posts/2024/05-01_csharp_vertex_gemini_streaming_bug/ Wed, 01 May 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/05-01_csharp_vertex_gemini_streaming_bug/ A user recently reported an intermittent error with C# and Gemini 1.5 model on Vertex AI’s streaming API. In this blog post, I want to outline what the error is, what causes it, and how to avoid it with the hopes of saving some frustration for someone out there. Error The user reported using Google.Cloud.AIPlatform.V1 library with version 2.27.0 to use Gemini 1.5 via Vertex AI’s streaming API and running into an intermittent System. A Tour of Gemini Code Assist - Slides and Demos https://atamel.dev/posts/2024/04_24_tour_of_gemini_code_assist/ Wed, 24 Apr 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/04_24_tour_of_gemini_code_assist/ This week, I’m speaking at 3 meetups on Gemini Code Assist. My talk has a little introduction to GenAI and Gemini, followed by a series of hands-on demos that showcase different features of Gemini Code Assist. In the demos, I setup Gemini Code Assist in Cloud Code IDE plugin in Visual Studio Code. Then, I show how to design and create an application, explain, run, generate, test, transform code, and finish with understanding logs with the help of Gemini. Vertex AI Gemini generateContent (non-streaming) API https://atamel.dev/posts/2024/02-26_vertexai_gemini_generate_content_api/ Mon, 26 Feb 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/02-26_vertexai_gemini_generate_content_api/ Introduction In my recent blog post, I’ve been exploring Vertex AI’s Gemini REST API and mainly talked about the streamGenerateContent method which is a streaming API. Recently, a new method appeared in Vertex AI docs: generateContent which is the non-streaming (unary) version of the API. In this short blog post, I take a closer look at the new non-streaming generateContent API and explain why it makes sense to use as a simpler API when the latency is not super critical. Using Vertex AI Gemini from GAPIC libraries (C#) https://atamel.dev/posts/2024/02-14_vertexai_gemini_gapic_libraries/ Wed, 14 Feb 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/02-14_vertexai_gemini_gapic_libraries/ Introduction In my previous Using Vertex AI Gemini REST API post, I showed how to use the Gemini REST API from languages without SDK support yet such as C# and Rust. There’s actually another way to use Gemini from languages without SDK support: GAPIC libraries. In this post, I show you how to use Vertex AI Gemini from GAPIC libraries, using C# as an example. What is GAPIC? At this point, you might be wondering: What’s GAPIC? Using Vertex AI Gemini REST API (C# and Rust) https://atamel.dev/posts/2024/02-05_vertexai_gemini_restapi_csharp_rust/ Mon, 05 Feb 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/02-05_vertexai_gemini_restapi_csharp_rust/ Introduction Back in December, Google announced Gemini, its most capable and general model so far available from Google AI Studio andGoogle Cloud Vertex AI. The Try the Vertex AI Gemini API documentation page shows instructions on how to use the Gemini API from Python, Node.js, Java, and Go. That’s great but what about other languages? Even though there are no official SDKs/libraries for other languages yet, you can use the Gemini REST API to access the same functionality with a little bit more work on your part. Test and change an existing web app with Duet AI https://atamel.dev/posts/2024/01-29_duetai_test_change_existing_webapp/ Mon, 29 Jan 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/01-29_duetai_test_change_existing_webapp/ In the Create and deploy a new web app to Cloud Run with Duet AI post, I created a simple web application and deployed to Cloud Run using Duet AI’s help. Duet AI has been great to get a new and simple app up and running. But does it help for existing apps? Let’s figure it out. In this blog post, I take an existing web app, explore it, test it, add a unit test, add new functionality, and add more unit tests all with the help of Duet AI. Create and deploy a new web app to Cloud Run with Duet AI https://atamel.dev/posts/2024/01-23_duetai_create_deploy_webapp_clourun/ Tue, 23 Jan 2024 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2024/01-23_duetai_create_deploy_webapp_clourun/ I’ve been playing with Duet AI, Google’s AI-powered collaborator, recently to see how useful it can be for my development workflow. I’m pleasantly surprised how helpful Duet AI can be when provided with specific questions with the right context. In this blog post, I document my journey of creating and deploying a new web application to Cloud Run with Duet AI’s help. I also capture some lessons learned along the way to get the most out of Duet AI. C# library and samples for GenAI in Vertex AI https://atamel.dev/posts/2023/12-11_csharp_library_and_samples_genai_vertexai/ Mon, 11 Dec 2023 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2023/12-11_csharp_library_and_samples_genai_vertexai/ In my previous post, I talked about multi-language libraries and samples for GenAI. In this post, I want to zoom into some C# specific information for GenAI in Vertex AI. C# GenAI samples for Vertex AI If you want to skip this blog post and just jump into code, there’s a collection of C# GenAI samples for Vertex AI. These samples show how to invoke GenAI from C# for different use cases such as text classification, extraction, summarization, sentiment analysis and more using the C# client library. Multi-language libraries and samples for GenAI in Vertex AI https://atamel.dev/posts/2023/11-28_multilanguage_libs_samples_genai_vertexai/ Tue, 28 Nov 2023 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2023/11-28_multilanguage_libs_samples_genai_vertexai/ You might think that you need to know Python to be able to use GenAI with VertexAI. While Python is the dominant language in GenAI (and Vertex AI is no exception in that regard), you can actually use GenAI in Vertex AI from other languages such as Java, C#, Node.js, Go, and more. Let’s take a look at the details. Vertex AI SDK for Python The official SDK for Vertex AI is Vertex AI SDK for Python and as expected, it’s in Python. Generative AI Short Courses by DeepLearning.AI https://atamel.dev/posts/2023/07-04_genai_short_courses_by_deeplearning/ Tue, 04 Jul 2023 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2023/07-04_genai_short_courses_by_deeplearning/ Introduction In my previous couple posts (post1, post2), I shared my detailed notes on Generative AI Learning Path in Google Cloud’s Skills Boost. It’s a great collection of courses to get started in GenAI, especially on the theory underpinning GenAI. Since then, I discovered another great resource to learn more about GenAI: Learn Generative AI Short Courses by DeepLearning.AI from Andrew Ng. In this post, I summarize what each course teaches you to help you decide which course to take. Generative AI Learning Path Notes – Part 2 https://atamel.dev/posts/2023/06-15_genai_learningpath_notes_part2/ Thu, 15 Jun 2023 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2023/06-15_genai_learningpath_notes_part2/ If you’re looking to upskill in Generative AI, there’s a Generative AI Learning Path in Google Cloud Skills Boost. It currently consists of 10 courses and provides a good foundation on the theory behind Generative AI. As I went through these courses myself, I took notes, as I learn best when I write things down. In part 1 of the blog series, I shared my notes for courses 1 to 6. Generative AI Learning Path Notes – Part 1 https://atamel.dev/posts/2023/06-06_genai_learningpath_notes_part1/ Tue, 06 Jun 2023 00:00:00 +0000 atamel@gmail.com (Mete Atamel) https://atamel.dev/posts/2023/06-06_genai_learningpath_notes_part1/ If you’re looking to upskill in Generative AI (GenAI), there’s a Generative AI Learning Path in Google Cloud Skills Boost. It currently consists of 10 courses and provides a good foundation on the theory behind Generative AI and what tools and services Google provides in GenAI. The best part is that it’s completely free! As I went through these courses myself, I took notes, as I learn best when I write things down.