Google AI: Release Notes

Google AI: Release Notes
Podcast Description
Ever wondered what it's really like to build the future of AI? Join host Logan Kilpatrick for a deep dive into the world of Google AI, straight from the minds of the builders. We're pulling back the curtain on the latest breakthroughs, sharing the unfiltered stories behind the tech, and answering the questions you've been dying to ask.
Whether you're a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for:
- Exclusive interviews with AI pioneers and industry leaders.
- In-depth discussions on the latest AI trends and developments.
- Behind-the-scenes stories and anecdotes from the world of AI.
- Unfiltered insights and opinions from the people shaping the future.
So, if you're ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.
Podcast Insights
Content Themes
The podcast centers around the latest advancements in artificial intelligence, with themes including the development of Google's Gemini models and the challenges of deploying large language models. Episodes delve into topics such as Gemini 2.0's multimodal capabilities and the innovative Flash 8B release, providing listeners with a comprehensive understanding of the AI landscape.

Ever wondered what it’s really like to build the future of AI? Join host Logan Kilpatrick for a deep dive into the world of Google AI, straight from the minds of the builders. We’re pulling back the curtain on the latest breakthroughs, sharing the unfiltered stories behind the tech, and answering the questions you’ve been dying to ask.
Whether you’re a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for:
– Exclusive interviews with AI pioneers and industry leaders.
– In-depth discussions on the latest AI trends and developments.
– Behind-the-scenes stories and anecdotes from the world of AI.
– Unfiltered insights and opinions from the people shaping the future.
So, if you’re ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.
Ani Baddepudi, Gemini Model Behavior Product Lead, joins host Logan Kilpatrick for a deep dive into Gemini’s multimodal capabilities. Their conversation explores why Gemini was built as a natively multimodal model from day one, the future of proactive AI assistants, and how we are moving towards a world where “everything is vision.” Learn about the differences between video and image understanding and token representations, higher FPS video sampling, and more.
Chapters:
0:00 – Intro
1:12 – Why Gemini is natively multimodal
2:23 – The technology behind multimodal models
5:15 – Video understanding with Gemini 2.5
9:25 – Deciding what to build next
13:23 – Building new product experiences with multimodal AI
17:15 – The vision for proactive assistants
24:13 – Improving video usability with variable FPS and frame tokenization
27:35 – What’s next for Gemini’s multimodal development
31:47 – Deep dive on Gemini’s document understanding capabilities
37:56 – The teamwork and collaboration behind Gemini
40:56 – What’s next with model behavior
Watch on YouTube: https://www.youtube.com/watch?v=K4vXvaRV0dw

Disclaimer
This podcast’s information is provided for general reference and was obtained from publicly accessible sources. The Podcast Collaborative neither produces nor verifies the content, accuracy, or suitability of this podcast. Views and opinions belong solely to the podcast creators and guests.
For a complete disclaimer, please see our Full Disclaimer on the archive page. The Podcast Collaborative bears no responsibility for the podcast’s themes, language, or overall content. Listener discretion is advised. Read our Terms of Use and Privacy Policy for more details.