<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>From Transcription to Live Music: Gemini's Audio Stack — Thor Schaeff, Google DeepMind</title>
        <link>https://video.ut0pia.org/videos/watch/2f9ed14f-c41b-44dd-8715-a82ed7b18cc5</link>
        <description>One API call to Gemini 3 Flash Preview: speaker labels by name, timestamps, emotion tags, language detection with English translation, and a full summary. That is the audio understanding layer that underlies everything else Thor Schaeff demos here, including speech generation directed by a "director's note" rather than picked from a catalogue, and Gemini 3.1 Flash Live, a sound to sound real time multimodal model with thinking baked in rather than cascaded through a separate LLM. The talk ends with Lyria 3, Google DeepMind's music generation model that can now produce full songs with lyrics. The live demo has the Gemini Live model calling Lyria via tool use on request to generate a German techno schlager about the UK startup scene, live on stage. Speaker info: https://x.com/thorwebdev, https://www.linkedin.com/in/thorwebdev</description>
        <lastBuildDate>Wed, 10 Jun 2026 22:27:20 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>From Transcription to Live Music: Gemini's Audio Stack — Thor Schaeff, Google DeepMind</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/2f9ed14f-c41b-44dd-8715-a82ed7b18cc5</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=2f9ed14f-c41b-44dd-8715-a82ed7b18cc5" rel="self" type="application/rss+xml"/>
    </channel>
</rss>