Gemini Live — Part 1: Building a low-latency, telephone Voice Agent with FreeSWITCH and ADK agents powered by Gemini Live

Németh_Attila · May 11, 2026, 3:22pm

Hi! Thank you for this insightful article. I’m currently experimenting with ADK and the Gemini Live API, and your telephone-voice agent architecture is very inspiring. I have two specific questions regarding your implementation:

Infrastructure: Could you clarify if you are hosting the FreeSWITCH instance yourself (e.g., on a VPS/GCP) to interface with Halonet.pl, or are you using a managed service for the media layer?
Interruption Handling: I’m testing the Live API in a local ADK environment, but I’m struggling with interruptions. When I speak over the AI, the agent continues its response instead of cutting off. In your setup, how is the “barge-in” logic handled? Is the Voice Activity Detection (VAD) managed at the FreeSWITCH level (e.g., via mod_audio_fork), or are you relying solely on the ADK’s internal VAD and the interrupted signals from the Live API?

I would greatly appreciate any guidance or snippets on how you achieved such low-latency, interruptible conversations.

Thanks!

Topic		Replies	Views
Beyond the Chatbot: WebRTC, Gemini, and Your First Real-Time Voice Agent Agents gemini , googler-article	3	1073	November 15, 2025
Deploy bidirectional streaming agents with Vertex AI Agent Engine and Live API Agents agent-engine , googler-article , adk , agent-builder , live-stream-api	1	1469	November 10, 2025
Serving real-time public safety AI with Gemini Live API and ADK Community Articles googler-article , adk , gemini-enterprise , how-we-built-this	0	548	June 3, 2026

Gemini Live — Part 1: Building a low-latency, telephone Voice Agent with FreeSWITCH and ADK agents powered by Gemini Live

AI Suggested topics