<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Dark Factory: OpenClaw Ships Faster Than You Can Read the Diff — Vincent Koc, Comet ML</title>
        <link>https://video.ut0pia.org/videos/watch/df1ce1f7-fde1-4ace-9d4d-a82328ab8482</link>
        <description>Static benchmarks made sense for static software. Agents that adapt to users, rewrite their own harnesses, and shift behavior over time break that assumption. This talk is about what evaluation looks like when the system you're measuring keeps changing underneath you. Vincent Koc traces the arc from prompt engineering to context engineering to intent engineering, where agents self-optimize toward what users actually want. The eval problem compounds at each step: production traces reveal behavioral drift, test suites go stale, and the 20% of edge cases that break your product rarely show up in handcrafted datasets. The alternative he proposes: define the end state, let agents curate their own suites from traces, and treat evals as a living system rather than a point-in-time snapshot. Speaker info: https://x.com/vincent_koc</description>
        <lastBuildDate>Wed, 13 May 2026 17:54:41 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>Dark Factory: OpenClaw Ships Faster Than You Can Read the Diff — Vincent Koc, Comet ML</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/df1ce1f7-fde1-4ace-9d4d-a82328ab8482</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=df1ce1f7-fde1-4ace-9d4d-a82328ab8482" rel="self" type="application/rss+xml"/>
    </channel>
</rss>