Vertex AI Gemini generateContent (non-streaming) API
Introduction In my recent blog post, I’ve been exploring Vertex AI’s Gemini REST API and mainly talked about the streamGenerateContent method which is a streaming API.
Recently, a new method appeared in Vertex AI docs: generateContent which is the non-streaming (unary) version of the API.
In this short blog post, I take a closer look at the new non-streaming generateContent API and explain why it makes sense to use as a simpler API when the latency is not super critical.
Read More →