Google AI: Release Notes Podcast By Google AI cover art

Google AI: Release Notes

Google AI: Release Notes

By: Google AI
Listen for free

About this listen

Ever wondered what it's really like to build the future of AI? Join host Logan Kilpatrick for a deep dive into the world of Google AI, straight from the minds of the builders. We're pulling back the curtain on the latest breakthroughs, sharing the unfiltered stories behind the tech, and answering the questions you've been dying to ask. Whether you're a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for: - Exclusive interviews with AI pioneers and industry leaders. - In-depth discussions on the latest AI trends and developments. - Behind-the-scenes stories and anecdotes from the world of AI. - Unfiltered insights and opinions from the people shaping the future. So, if you're ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.2024 Google Science
Episodes
  • Gemini's Multimodality
    Jul 2 2025

    Ani Baddepudi, Gemini Model Behavior Product Lead, joins host Logan Kilpatrick for a deep dive into Gemini's multimodal capabilities. Their conversation explores why Gemini was built as a natively multimodal model from day one, the future of proactive AI assistants, and how we are moving towards a world where "everything is vision." Learn about the differences between video and image understanding and token representations, higher FPS video sampling, and more.

    Chapters:

    0:00 - Intro
    1:12 - Why Gemini is natively multimodal
    2:23 - The technology behind multimodal models
    5:15 - Video understanding with Gemini 2.5
    9:25 - Deciding what to build next
    13:23 - Building new product experiences with multimodal AI
    17:15 - The vision for proactive assistants
    24:13 - Improving video usability with variable FPS and frame tokenization
    27:35 - What’s next for Gemini’s multimodal development
    31:47 - Deep dive on Gemini’s document understanding capabilities
    37:56 - The teamwork and collaboration behind Gemini
    40:56 - What’s next with model behavior


    Watch on YouTube: https://www.youtube.com/watch?v=K4vXvaRV0dw

    Show more Show less
    44 mins
  • Building Gemini's Coding Capabilities
    Jun 16 2025

    Connie Fan, Product Lead for Gemini's coding capabilities, and Danny Tarlow, Research Lead for Gemini's coding capabilities, join host Logan Kilpatrick for an in-depth discussion on how the team built one of the world's leading AI coding models. Learn more about the early goals that shaped Gemini's approach to code, the rise of 'vibe coding' and its impact on development, strategies for tackling large codebases with long context and agents, and the future of programming languages in the age of AI.

    Watch on YouTube: ⁠https://www.youtube.com/watch?v=jwbG_m-X-gE⁠

    Chapters:

    0:00 - Intro
    1:10 - Defining Early Coding Goals
    6:23 - Ingredients of a Great Coding Model
    9:28 - Adapting to Developer Workflows
    11:40 - The Rise of Vibe Coding
    14:43 - Code as a Reasoning Tool
    17:20 - Code as a Universal Solver
    20:47 - Evaluating Coding Models
    24:30 - Leveraging Internal Googler Feedback
    26:52 - Winning Over AI Skeptics
    28:04 - Performance Across Programming Languages
    33:05 - The Future of Programming Languages
    36:16 - Strategies for Large Codebases
    41:06 - Hill Climbing New Benchmarks
    42:46 - Short-Term Improvements
    44:42 - Model Style and Taste
    47:43 - 2.5 Pro’s Breakthrough
    51:06 - Early AI Coding Experiences
    56:19 - Specialist vs. Generalist Models

    Show more Show less
    1 hr
  • Sergey Brin on the Future of AI & Gemini
    Jun 16 2025

    A conversation with Sergey Brin, co-founder of Google and computer scientist working on Gemini, in reaction to a year of progress with Gemini.

    Watch on YouTube: https://www.youtube.com/watch?v=o7U4DV9Fkc0

    Chapters

    0:20 - Initial reactions to I/O
    2:00 - Focus on Gemini’s core text model
    4:29 - Native audio in Gemini and Veo 3
    8:34 - Insights from model training runs
    10:07 - Surprises in current AI developments vs. past expectations
    14:20 - Evolution of model training
    16:40 - The future of reasoning and Deep Think
    20:19 - Google’s startup culture and accelerating AI innovation
    24:51 - Closing

    Show more Show less
    27 mins
No reviews yet