<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Why building eval platforms is hard — Phil Hetzel, Braintrust</title>
        <link>https://video.ut0pia.org/videos/watch/4c891a6b-688d-42ac-ab8f-b45b16601701</link>
        <description>An eval platform is not just a test runner. You are building shared definitions of "good," reliable data pipelines, labelling workflows, versioning, and trust in results across many teams and model changes. This session breaks down the hidden complexity, the common failure modes, and the design principles that make evals credible and usable in day-to-day engineering. Speaker info: https://www.linkedin.com/in/philliphetzel/</description>
        <lastBuildDate>Wed, 29 Apr 2026 11:27:35 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>Why building eval platforms is hard — Phil Hetzel, Braintrust</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/4c891a6b-688d-42ac-ab8f-b45b16601701</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=4c891a6b-688d-42ac-ab8f-b45b16601701" rel="self" type="application/rss+xml"/>
    </channel>
</rss>