Voice agents don’t just transcribe anymore — they think, talk, and call tools in real time.
This Build Hour demos speech-to-speech agents built with the Realtime API and Agents SDK that can handle conversations natively in audio, reason about context, and call tools while streaming speech back to the user.
Brian Fioca and Prashant Mital (Applied AI) cover:
– Why voice agents now: APIs to the real world, expressive + accessible interactions
– Architectures: chained speech-to-text vs. end-to-end speech-to-speech models
– Live demo: building a voice-powered workspace manager + designer agent with handoffs
– Best practices: evals, guardrails, and delegation
– Live Q&A
👉 Follow along with the code repo:
👉 Check out the voice agents guide:
👉 Sign up for upcoming live Build Hours:
source