Getting Started
Introduction
About Daily Bots
With Daily Bots, developers can quickly ship voice and video agents integrated into their own applications. Built with Open Source SDKs and deployed on Daily’s real-time global infrastructure, developers can:
- Create AI agents that talk naturally
- Design voice-to-voice AI flexibly, with leading commercial and open models. We’ve partnered with Anthropic, Cartesia, Deepgram, and Together AI. You can also use any LLM that supports OpenAI-compatible APIs.
- Build ultra low latency experiences for desktop, mobile, and telephone.
- Use the leading Open Source tooling for voice-to-voice and multimodal video AI. Daily Bots implements the RTVI standard for real-time inference, and is built on the Pipecat server-side framework.
- Launch quickly and scale on Daily’s global WebRTC infrastructure
Features
Daily Bots implements best practices for all of the hard, low-level challenges that voice AI product teams face. With a few lines of code, developers can leverage:
- A modular architecture that enables easy switching between different LLMs and voice models. Use state of the art LLMs with large parameter counts where needed. Or use models optimized for conversational response times.
- Multi-turn context management, with tool calling and vision input.
- Voice-to-voice response times as low as 500ms.
- Interruption handling with word-level context accuracy.
- Phrase endpointing that combines voice activity detection, semantic cues, and noise-level averaging.
- Echo cancellation and background noise reduction.
- Metrics and observability down to the level of individual media streams from every session.