Getting Started
Introduction
About Daily Bots
With Daily Bots, developers can quickly ship voice and video agents integrated into their own applications. Built with Open Source SDKs and deployed on Daily’s real-time global infrastructure, developers can:
- Create AI agents that talk naturally.
- Design voice-to-voice AI flexibly, with leading commercial and open models. We’ve partnered with Anthropic, Cartesia, Deepgram, ElevenLabs, and Together AI. We also provide the flexibility to provide your own API key for any service.
- Build ultra low latency experiences for desktop, mobile, and telephone.
- Use the leading Open Source tooling for voice-to-voice and multimodal video AI. Daily Bots is built on the Pipecat server framework and works with Pipecat client SDKs. Like Pipecat, Daily Bots implements the RTVI (Real-time Voice Inference) standard for real-time inference.
- Launch quickly and scale on Daily’s global WebRTC infrastructure.
Features
Daily Bots implements best practices for all of the hard, low-level challenges that voice AI product teams face. With a few lines of code, developers can leverage:
- A modular architecture that enables easy switching between different LLMs and voice models. Use state of the art LLMs with large parameter counts where needed. Or use models optimized for conversational response times.
- Multi-turn context management, with tool calling and vision input.
- Voice-to-voice response times as low as 500ms.
- Interruption handling with word-level context accuracy.
- Phrase endpointing that combines voice activity detection, semantic cues, and noise-level averaging.
- Echo cancellation and background noise reduction.
- Metrics and observability down to the level of individual media streams from every session.