<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Any-to-Any: Building Native Multimodal Agents - Patrick Löber, Google DeepMind</title>
        <link>https://video.ut0pia.org/videos/watch/1f0bd1da-5eea-4aa1-9cf7-e55faa96cdd9</link>
        <description>Draw arrows on a map and ask Gemini to generate a picture of what you see. It produces the Golden Gate Bridge. Not because it matched pixels, but because the image generation model is built on top of Gemini's world understanding and knows what those arrows are pointing at. Patrick Löber walks through the full any-to-any stack: multimodal understanding where Gemini ingests PDFs, video, and audio up to nine-plus hours at once, native image and speech generation called as tools from an agentic loop, and a live audio model where audio goes in and audio comes out through a single architecture with no cascaded pipeline. The session ends with the building blocks for a Notebook LM clone where a reasoning agent decides what to generate rather than a hardcoded workflow. Speaker info: https://x.com/patloeber, https://linkedin.com/in/patrick-l%C3%B6ber-403022137, https://github.com/patrickloeber</description>
        <lastBuildDate>Wed, 20 May 2026 23:00:16 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>Any-to-Any: Building Native Multimodal Agents - Patrick Löber, Google DeepMind</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/1f0bd1da-5eea-4aa1-9cf7-e55faa96cdd9</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=1f0bd1da-5eea-4aa1-9cf7-e55faa96cdd9" rel="self" type="application/rss+xml"/>
    </channel>
</rss>