<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>GPU Cloud Deployment Without Leaving Your IDE — Audry Hsu, RunPod</title>
        <link>https://video.ut0pia.org/videos/watch/40f05e7e-90f3-4784-9791-ad4351dc37a9</link>
        <description>The iteration cycle before Flash: commit, push, build a Docker image, pull it from the registry, load it onto a server, allocate a GPU, then find out if it works. Audrey Hsu demos what replacing that with a single decorator looks like — add @flash.endpoint to an async Python function and it deploys to GPU cloud from your IDE, with hot reload so a model swap is one line of code rather than a container rebuild. The second demo chains three models: Qwen 3 generates image prompts, DreamShaper renders them, Nano Banana 2 composes the results into a single photo. H100 pricing is $0.00116 per second, charged only while a worker is handling a request. RunPod's recommendation: start with pods while experimenting, switch to serverless when you need hundreds of workers autoscaling across data centers. Speaker info: https://www.linkedin.com/in/audry-hsu/</description>
        <lastBuildDate>Wed, 10 Jun 2026 21:06:43 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>GPU Cloud Deployment Without Leaving Your IDE — Audry Hsu, RunPod</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/40f05e7e-90f3-4784-9791-ad4351dc37a9</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=40f05e7e-90f3-4784-9791-ad4351dc37a9" rel="self" type="application/rss+xml"/>
    </channel>
</rss>