<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Let LLMs Wander: Engineering RL Environments — Stefano Fiorucci</title>
        <link>https://video.ut0pia.org/videos/watch/dd03032e-0142-4029-bcf3-d095149f65da</link>
        <description>Reasoning models like DeepSeek R1 have demonstrated that learning from interaction is just as critical as learning from examples. To build these capabilities ourselves, we need to move beyond static datasets and start building Reinforcement Learning Environments: little worlds where models can act, get rewards, and learn. In this talk, I will walk you through my journey exploring this space from a practical software engineering perspective. We will cover: How classic Reinforcement Learning concepts translate to Language Models, Verifiers, an open-source library to build Environments as software artifacts, Concrete examples of environments, from single-turn tasks to multi-turn games and tool-using agents, How to use these environments for both evaluating and training Small Language Models., Join me to learn how to move from prompting models to building the gyms where they learn. Stefano Fiorucci - AI/SW Engineer/Explorer, deepset Stefano is an AI/Software Engineer and explorer. He currently works on AI Orchestration at Deepset, where he contributes to and maintains Haystack, a widely used open-source framework for building LLM applications. He loves experimenting with Small Language Models, Post-Training and Reinforcement Learning, and shares his learning through code, writing, and talks. Socials: https://twitter.com/theanakin87 https://www.linkedin.com/in/stefano-fiorucci/ https://github.com/anakin87 https://huggingface.co/anakin87 Slides: https://drive.google.com/file/d/116PKThwtyTxeH1GmZQ7bL3HPYM6KCgHa/view?usp=drive_link</description>
        <lastBuildDate>Thu, 09 Apr 2026 15:13:18 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>Let LLMs Wander: Engineering RL Environments — Stefano Fiorucci</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/dd03032e-0142-4029-bcf3-d095149f65da</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=dd03032e-0142-4029-bcf3-d095149f65da" rel="self" type="application/rss+xml"/>
    </channel>
</rss>