I am building a production voice support agent using Dialogflow CX integrated with Twilio (voice) and Vertex AI Search (RAG).
The architecture intentionally separates responsibilities:
Dialogflow CX flows → routing, confirmations, and high-risk / transactional actions
Vertex AI Search (RAG) → informational “How do I…?” explanations
Voice-first interaction (no chat UI)
Expected behaviour
After a RAG response is delivered (via Vertex AI Search):
The agent should continue the conversation
Either:
Ask a clarifying follow-up (e.g. “Do you want me to walk you through this now?”), or
Route the user to the correct Dialogflow CX flow/page based on intent
Actual behaviour
Vertex AI Search returns a correct, grounded answer
After the response:
The conversation ends, idles, or escalates
No follow-up intent capture occurs
CX does not transition to another page or flow
The RAG response itself is correct — the issue is conversation continuation and routing after RAG.
Current configuration (summary)
Dialogflow CX agent (voice)
Twilio for telephony
Vertex AI Search connected as a Data Store
Start Flow used for routing only
Default Welcome Intent disabled
RAG invoked via fulfilment for informational queries
Transactional processes handled in scripted CX flows
What I have already ruled out
Default Welcome intent interception
Obvious page or event handler misconfiguration
Webhook timeouts or fulfilment errors
Twilio audio cut-off issues
Question
Is there a recommended Dialogflow CX design pattern for:
Continuing a conversation after a Vertex AI Search (RAG) response, and
Cleanly transitioning back into CX intent/page routing
without prematurely ending or escalating the interaction?
Specifically:
Should follow-up confirmation be handled in CX rather than via RAG?
Is there a known best practice for chaining RAG → intent capture → flow transition in voice agents?
Any guidance, references, or examples would be appreciated.
