Hello there,
Since Memory Bank launched in Preview a few weeks ago, I’ve been getting a lot of questions in my DMs and on different threads (thanks for all the engagement!).
Rather than answering questions one by one, the Vertex AI Memory team and I created this FAQ to ensure everyone has the same information.
1. “Why not just build this myself?”
Memory Bank handles the messy state management you really don’t want to build yourself.
Sure, you can spin up your own memory system, but the real headache is writing the code that has to:
- Summarize conversation history on the fly.
- Figure out what’s a key fact vs. just noise.
- Know that when a user says their favorite color is “red” now, it replaces the “blue” from last week.
- De-dupe memories so you don’t have five of the same facts.
Memory Bank solves the complex memory issues, allowing you to dedicate your efforts to agent development.
2. “So what’s actually under the hood?”
Here’s what goes on under the hood:
- Memory Generation: We use Gemini to do the “thinking”—understanding the chat and pulling out and continuously updating the facts.
- Memory Retrieval: We use Google’s embedding models for the super-fast semantic search to find the right memory when you need it.
If you really want to go deeper into the method, I recommend reading the research paper that influenced the design.
3. “What about important features like TTL (Time-to-Live) or de-duping memories?”
Yes, these are critical for any memory management service.
- Consolidation/De-duping: This is automatic. The generate_memories method is built to handle conflicts and update facts as new info rolls in.
- Data Aging (TTL): This isn’t in the initial Preview, but it’s on the roadmap. We know you need it to manage costs and keep memories relevant.
4. “Can I use this with Gemma, Llama, Mistral, or other open-source models?”
No, not at the moment. Memory Bank is built to work tightly with the Gemini family. That close integration is how it automates all the memory management features reliably. With that being said, Memory bank is in preview and we are still in time to change it.
5. “Let’s talk about cost. We’re a little worried about pricing…”
Right now, it’s in Preview and provided at no cost. We’ll announce pricing publicly at a later date.
When it goes GA, the price will be on the main Vertex AI pricing page, and we hear your feedback about needing clear pricing information.
We hope you will find these answers helpful. Also this is the perfect time to test Vertex AI Memory Bank and share your feedback with us.
If you want to get started, below you have all resources you need
- Documentation
- Notebooks
- Blog posts
What other questions do you have? Share what you are building in the comments!