Talk: Building the Next Generation of Voice-First Agentic AI using Gemini 3 Pro
By Gerard Sans
Talk: Building the Next Generation of Voice-First Agentic AI using Gemini 3 Pro
Generative AI has shifted gears. In this talk we explore how Gemini 3 Pro now supports multimodal generation (text, image, audio, video) and real-time agents. You’ll see how Google AI Studio connects the latest model families into a unified workflow of “vibe coding” rather than writing boilerplate. We’ll also dive into the voice-first capabilities made possible via the Gemini Live API: real-time, bidirectional voice and video conversations, tone-aware responses, tool-integration and session memory. Together we’ll look at prompt-to-code flows, media generation, voice-first use cases and the next generation of MCP-driven agentic AI.



























