<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Spec-Driven Testing for Agents With A Brain the Size of A Planet — Steven Willmott, SafeIntelligence</title>
        <link>https://video.ut0pia.org/videos/watch/584354d8-a922-4d24-b4ae-416613bac2f9</link>
        <description>Wrapping a malicious instruction in a poem is an effective jailbreak against large models and not against small ones. Small models don't understand the poem. Large models do and execute the instruction. Steven Willmott from Safe Intelligence argues this is one reason bigger is not straightforwardly safer: a larger model with broader capabilities has more attack surface and more infrastructure access to abuse. His frame is spec driven validation. An agent spec is not just a test dataset. It needs explicit rules (never offer more than 10% discount), domain ontologies (an airline agent only needs to know about destinations that airline actually flies to), rights and roles, and robustness requirements such as how many typos or rephrasings before it fails. Write these independently of the implementation so they survive a model swap and can drive both security testing and iterative improvement. Speaker info: https://uk.linkedin.com/in/stevenwillmott, https://x.com/njyx</description>
        <lastBuildDate>Mon, 01 Jun 2026 05:53:16 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>Spec-Driven Testing for Agents With A Brain the Size of A Planet — Steven Willmott, SafeIntelligence</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/584354d8-a922-4d24-b4ae-416613bac2f9</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=584354d8-a922-4d24-b4ae-416613bac2f9" rel="self" type="application/rss+xml"/>
    </channel>
</rss>