VoiceOps-fying Low-Latency Intelligence Extraction from Messy Audio Streams

VoiceOps-fying Low-Latency Intelligence Extraction from Messy Audio Streams — Dippu Kumar Singh https://video.ut0pia.org/videos/watch/68c4c10a-8f5b-4267-9005-ec072a0b62be "Processing real-time voice data is an engineering minefield of latency, accents, and interruptions. This session explores the architecture of a Real-Time Voice Intelligence Pipeline deployed in a high-volume contact center. We will move beyond simple transcription to discuss Structured Intent Extraction. I will show you how to design: Voice Capture Pipeline: The entry point for clean, multi-channel data acquisition., Speech-To-Text(STT) Engine: Converting speech to accurate text., Generative AI Core Structure: Using rigorous system prompts to force the LLM to separate ""Customer Intent"" from ""Operator Chit-Chat"" and output valid JSON, even from garbled transcripts., Customer Data Sync: Translating AI insights into enterprise system actions., We reduced post-call work by 50% by shifting compute from ""batch"" to ""stream."" Speaker: Dippu Kumar Singh - Leader Of Emerging Technologies (Apps), Fujitsu North America Inc. Dippu Kumar Singh has over 16 years of experience at the intersection of industry innovation and advanced research. He is a recognized authority in building scalable, trustworthy, and commercially viable AI systems. Being a Leader for Emerging Data & Analytics at Fujitsu North America, Dippu specializes in bridging the gap between theoretical AI concepts and enterprise-grade implementation. His strategic leadership has spearheaded multi-million in sales pipelines and delivered remarkable savings through AI-driven optimizations in transportation, manufacturing, utilities, and supply chain logistics. Socials: https://www.linkedin.com/in/dippukumarsingh/ Slides: https://docs.google.com/presentation/d/1f2y1s64irhdDNTRgK6bWrBtOgMWlhQYM/edit?usp=sharing&ouid=107532212133041789455&rtpof=true&sd=true" Thu, 09 Apr 2026 15:03:19 GMT https://validator.w3.org/feed/docs/rss2.html PeerTube - https://video.ut0pia.org VoiceOps-fying Low-Latency Intelligence Extraction from Messy Audio Streams — Dippu Kumar Singh https://video.ut0pia.org/lazy-static/avatars/0287a09a-aae7-4840-9843-b416426e7046.webp https://video.ut0pia.org/videos/watch/68c4c10a-8f5b-4267-9005-ec072a0b62be All rights reserved, unless otherwise specified in the terms specified at https://video.ut0pia.org/about and potential licenses granted by each content's rightholder.