<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>VoiceOps-fying Low-Latency Intelligence Extraction from Messy Audio Streams — Dippu Kumar Singh</title>
        <link>https://video.ut0pia.org/videos/watch/68c4c10a-8f5b-4267-9005-ec072a0b62be</link>
        <description>"Processing real-time voice data is an engineering minefield of latency, accents, and interruptions. This session explores the architecture of a Real-Time Voice Intelligence Pipeline deployed in a high-volume contact center. We will move beyond simple transcription to discuss Structured Intent Extraction. I will show you how to design: Voice Capture Pipeline: The entry point for clean, multi-channel data acquisition., Speech-To-Text(STT) Engine: Converting speech to accurate text., Generative AI Core Structure: Using rigorous system prompts to force the LLM to separate ""Customer Intent"" from ""Operator Chit-Chat"" and output valid JSON, even from garbled transcripts., Customer Data Sync: Translating AI insights into enterprise system actions., We reduced post-call work by 50% by shifting compute from ""batch"" to ""stream."" Speaker: Dippu Kumar Singh - Leader Of Emerging Technologies (Apps), Fujitsu North America Inc. Dippu Kumar Singh has over 16 years of experience at the intersection of industry innovation and advanced research. He is a recognized authority in building scalable, trustworthy, and commercially viable AI systems. Being a Leader for Emerging Data &amp; Analytics at Fujitsu North America, Dippu specializes in bridging the gap between theoretical AI concepts and enterprise-grade implementation. His strategic leadership has spearheaded multi-million in sales pipelines and delivered remarkable savings through AI-driven optimizations in transportation, manufacturing, utilities, and supply chain logistics. Socials: https://www.linkedin.com/in/dippukumarsingh/ Slides: https://docs.google.com/presentation/d/1f2y1s64irhdDNTRgK6bWrBtOgMWlhQYM/edit?usp=sharing&amp;ouid=107532212133041789455&amp;rtpof=true&amp;sd=true"</description>
        <lastBuildDate>Thu, 09 Apr 2026 15:03:19 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://video.ut0pia.org</generator>
        <image>
            <title>VoiceOps-fying Low-Latency Intelligence Extraction from Messy Audio Streams — Dippu Kumar Singh</title>
            <url>https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp</url>
            <link>https://video.ut0pia.org/videos/watch/68c4c10a-8f5b-4267-9005-ec072a0b62be</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://video.ut0pia.org/feeds/video-comments.xml?videoId=68c4c10a-8f5b-4267-9005-ec072a0b62be" rel="self" type="application/rss+xml"/>
    </channel>
</rss>