Sign in

Your complete platform for real-time communication

Start new thread

AI Voice Agent SDK - The open-source framework for real-time AI voice

by

Real-time Voice AI Agents We are open-sourcing the easiest way for developers to build real-time Voice Agents and Virtual Avatars into any app—telephony, web, mobile, robotics, wearables, and beyond.

Replies

Best

Video SDK

Maker

📌

👋 Hey Product Hunt, I’m Arjun, co-founder of VideoSDK.

I'm beyond excited to launch our Open-Source AI Voice Agent SDK.

Today, voice is becoming the new UI. We expect agents to understand us, respond instantly, and work seamlessly across web, mobile, and even telephony. But, to achieve this, developers have to stitch together: STT, LLM, TTS, glued with HTTP endpoints and, a prayer.

This most often results in agents that sound robotic, hallucinations and fail in product environments without observability.

So we built something to solve that: End-to-End infrastructure to build, deploy, and monitor your AI Voice Agents

Here’s what it offers:

Global WebRTC infra with <80ms latency
Native turn detection, VAD, and noise suppression
Modular pipelines for STT, LLM, TTS, avatars, and real-time model switching
Built-in RAG + memory for grounding and hallucination resistance
SDKs for web, mobile, Unity, IoT, and telephony — no glue code needed
Agent Cloud to scale infinitely with one-click deployments — or self-host with full control

Think of it like moving from a walkie-talkie to a modern cell towers that handles thousands of calls.

VideoSDK gives you the infrastructure to build voice agents that actually work in the real world, at scale.

I'd love your thoughts and questions! Happy to dive deep into architecture, use cases, or crazy edge cases you've been struggling with.

1mo ago

MD Amirul Islam

Earth.fm

@arjun_kava1 Design is so sleek and user-friendly!

1mo ago

Video SDK

Maker

@1mirul Thanks a lot for your kind words.

1mo ago

Sellible

@arjun_kava1 very cool! Looking forward to trying out the sdk soon.

1mo ago

Video SDK

Maker

@preetraj Thanks a lot, Preet for your kind words.

1mo ago

Video SDK

🔥 This is a game-changer for anyone building with voice! Love how you're simplifying the entire stack for real-time Voice AI—especially the flexibility across telephony, web, mobile, and even robotics. Open-sourcing it makes it even more powerful for indie hackers and startups. Huge kudos to the team 👏

1mo ago

Video SDK

Maker

@yash_chudasama Thanks a ton, Yash! 🙌 Glad you loved it — we're excited to see what builders like you create with the Voice Agent SDK! 🚀

1mo ago

this looks promising, makes your main product a full stack video & audio framework for building agents.

Congrats on the launch team!

1mo ago

Video SDK

Maker

@theanimeshs Thanks so much! Really appreciate the support! 🙌

1mo ago

Neel Patel 🦕

Another amazing launch! Let's go team @arjun_kava1 @sagar_kava

1mo ago

Video SDK

Maker

@sagar_kava @neelptl2602 Thanks a lot for your support.

1mo ago

Kombai

Congrats on the launch. @arjun_kava1 .

1mo ago

Video SDK

Maker

@sourabh_upreti Thanks so much! Appreciate the support 🙌

1mo ago

Tidyread

Launching soon!

Congrats on the launch! 🎉 @Video SDK This looks like a game-changer for voice AI development. The <80ms latency with global WebRTC infrastructure sounds impressive. Quick question - how does your native turn detection handle overlapping speech or interruptions? That's always been a challenge with voice agents. Also curious about the pricing model for the Agent Cloud vs self-hosting options!

1mo ago

Video SDK

Maker

@nicoleastor Thanks! 🙌 Super glad you found it interesting. Turn detection handles overlaps smartly in real-time — curious, are you exploring voice agents for a specific use case?

1mo ago

Video SDK

Congrats on the launch, team!!! 🥳

For Introducing Voice Agent SDK — an open-source framework to build real-time voice agents that actually work in production.

Built on VideoSDK, it empowers agents to join meetings, listen, speak, and think — all with under 80ms latency.
The cascading pipeline supports STT, LLM, TTS, VAD, and Turn Detection — fully provider-agnostic.
With A2A and MCP, you get multi-agent collaboration and seamless integration with external tools and services.

We can’t wait to see what the community builds with Voice Agent SDK — go create something amazing!

1mo ago

Video SDK

Maker

@deep_bhupatkar Thanks so much! Really appreciate the support! 🙌

1mo ago

Video SDK's AI Voice Agent SDK with its low - latency infrastructure and modular pipelines is a great help for developers building real - time voice applications! For developers who want to integrate custom AI models into the SDK, does Video SDK's AI Voice Agent SDK support easy integration of custom models?

1mo ago

Maulik Dhameliya

Congratulations on the launch team. Feel free to import it on LaunchIgniter for maximum visibility

1mo ago

Video SDK

Maker

@mjdhameliya Thanks so much appreciate it!

1mo ago

Hi, Arjun and team. Congrats on the launch!

1mo ago

Video SDK

Maker

@maksym_skrypka Thanks so much! We really appreciate your support.

1mo ago

Video SDK

Maker

@maksym_skrypka Thanks for your support.

1mo ago

Nader Ikladious

Linkinize

This is a fantastic step toward making real-time voice AI more accessible to developers across platforms. Love that it's open-source — excited to see what the community builds with it! Congrats on the launch 🚀

1mo ago

Video SDK

Maker

@naderikladious Thank you! We’re thrilled to see the excitement can’t wait to see what you and the community create with it! 🚀

1mo ago

DhiWise

Congratulations on the launch team Video SDK and @arjun_kava1 .

1mo ago

Video SDK

Maker

@rahul_shingala Thanks a lot, Chief.

1mo ago

Congratulations on the launch!

1mo ago

@kbao5 Thanks Karina! Really Appreciate your support

1mo ago

Video SDK

Maker

@kbao5 Thank you so much.

1mo ago

I think I saw an SDK that looks almost 1-to-1 like yours and is called LiveKit.

You have a cool product; can you tell me how you differ and how long you've been working on it? Huge scale of work!

1mo ago

@hormold Hey Nikita! VideoSDK provides SDKs for all the platforms for creating virtual meetings & Connecting realtime audio/video to Unity Games, with AI AgentSDK you can connect an AI participant to those sessions, You can checkout our docs to know see all the features that are available

1mo ago

Kandid

“VideoSDK looks stellar love how it streamlines building real-time video and audio experiences with easy-to-use APIs. what’s next for it? Are you planning upgrades like low-latency streaming, built-in recording, or advanced moderation tools?”

1mo ago

@pulkitgarg Hey Pulkit! Thanks for your support! We already have ILS Steaming with near low latency, Also you can record entire session or individual participant stream easily and store it in any cloud

1mo ago

They are great!!

1mo ago

@tahel_romero Thanks Tahel !, Really appreciate your support

1mo ago

Refrens

Getting that 80ms latency consistently is the key. Kudos for achieving this.

1mo ago

thanks for making it open source!

1mo ago

All the best for today, I’ve been a VoiceDSK customer for years, and the evolution is promising.

1mo ago

Super excited to check it out! Great product @arjun_kava1 @sagar_kava

1mo ago