Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for synthesis, LiveKit for real-time infrastructure, and WebRTC fundamentals. Knows how to build low-latency, production-ready voice experiences. Use when: voice ai, voice agent, speech to text, text to speech, realtime voice.
7.5
Rating
0
Installs
AI & LLM
Category
Strong voice AI skill with practical code examples covering major platforms (OpenAI Realtime, Vapi, Deepgram, ElevenLabs). The description accurately reflects capabilities and provides clear invoke triggers. Task knowledge is solid with working code patterns for real-time streaming, WebSocket handling, and multi-provider integration. Structure is good with clear sections, though some code examples are truncated (assumed complete per instructions). Anti-patterns section is particularly valuable. Novelty is moderate - while voice AI is specialized, the patterns shown are relatively standard implementations that a capable CLI agent could construct with sufficient context, though this skill does reduce token cost by consolidating provider-specific patterns. Minor weaknesses: could benefit from more architecture guidance on latency budgeting, error handling, and testing strategies for production voice apps.
Loading SKILL.md…

Skill Author