Hi everyone,
I’m building a production-grade virtual try-on bot where the goal is very strict:
-
The original photo of the person must remain unchanged (especially face, body proportions, skin, hands)
-
Only the clothing should change
I’ve tested Google Virtual Try-On via Vertex AI, and while results are sometimes visually acceptable, I’m facing critical stability issues that make it unusable for a real product:
-
The photo quality deteriorates significantly, despite reducing the parameter - 0.
-
Face and facial features are often altered
Even with high-quality input photos, the model sometimes reshapes the face (eyes, nose, symmetry). This is a hard blocker — users immediately notice it. -
Incorrect handling of sleeves / arms
If the original model photo has bare arms and the garment has long sleeves, the output often:-
Keeps the arms bare
-
Or partially “cuts” the sleeves
Instead of extending the garment correctly over the arms.
-
-
Overall inconsistency between runs
With similar inputs, results vary a lot. This makes it impossible to guarantee predictable output quality. -
In my experience with NanoBanana and NanoBanana Pro, I haven’t been able to achieve any real stability so far.
The outputs are highly inconsistent: in many cases the model simply returns the original model image or the original garment image without applying any changes at all, and in other cases the garment is applied only partially or unpredictably.
Because of this, I haven’t yet found a way to configure NanoBanana / NanoBanana Pro for reliable, repeatable virtual try-on results.
Does anyone here have real-world experience building a stable virtual try-on pipeline using Google Virtual Try-On without degrading the original image quality (especially face and body preservation)?
Additionally, has anyone managed to achieve highly consistent results with NanoBanana or NanoBanana Pro?
If so, are there any prompts, configurations, or processing strategies that significantly improve stability and prevent cases where the model either returns the original image unchanged or applies the garment inconsistently?





