Fine-tune Llama 2 in GCP Vertex AI

kk2105 · March 21, 2024, 9:49am

Hello,

I am running few experiments in GCP Vertex AI using foundation models (mainly Llama 2).

I was able to successfully deploy the Llama 2 7B mode and test it. Now I am trying to fine-tune the Llama 2 foundation model for a custom data. Has anybody succeeded in fine-tuning the Llama 2 model in Vertex AI?

I have the below queries:

How do I select the model version 7B, 13B, 70B (mainly I am trying to fine-tune 7B) ?
What is the format of the input file to fine-tune ?
Is there a documentation available on how to fine-tune foundation models ?

Any leads would be really helpful for me to get started.

Thank you,

KK

Poala_Tenorio · March 26, 2024, 8:31pm

You can use this documentation which demonstrates deployment and fine-tuning of LLAMA 2 models by using PEFT libraries.

timbosssds · March 29, 2024, 1:09pm

Great to hear you got it working, every time I try to ‘deploy’ I get errors. Are you able to share

steps you took to deploy?
where you able to connect to deployed model from vertex ai notebook? If so can you share code?

thank you

kk2105 · March 29, 2024, 2:50pm

Hi @timbosssds ,
I am still working on fine-tuning the model with the custom data.
I was able to fine-tune the model (llama2-7b) with the custom data given in the example, however I was not able to fine-tune with my own data.
I am using the notebook model_garden_pytorch_llama2_peft_finetuning.ipynb.

I will keep you updated with the progress here.

Thank you,

KK

timbosssds · March 30, 2024, 2:59am

Just tried to fine tune Llamma, and got…

The DAG failed because some tasks failed. The failed tasks are: [large-language-model-tuner].

Happy that you got it working, as nothing i try seems to work

kk2105 · April 1, 2024, 5:38am

Can you please post the error details? What is the fine-tuning method / notebook are you using?

timbosssds · April 1, 2024, 10:51am

Thank you for following up. I started trying to use the Gemma model, but was having issues. Moved to Llama and more issues. I can’t share the error details as I had to delete those models/endpoints as I have moved on to trying Mistral - which is working . Going to try to build end-to-end (data ingestion to UI) now. Will hopefully be able to come back to Llama and re-use these learnings. Best of luck on your journey.

kk2105 · April 1, 2024, 10:58am

Thanks for the update.
I may have to move to other models if none works.
I will come back to you, if any queries.

Thank you,

KK

cyberyu · April 7, 2024, 4:19pm

I had the same problem, but I spent a whole day trying different combination with my own dataset with the example data set and found the reason: the example data is multi-turn conversation between human and assistant, so # Humman or # Assistant appear at least twice. If your own custom data only has single turn conversation, it might end up with the same error. What you can do is repeat your single turn conversation twice in your training data (keep the key ‘text’ the same) and maybe it works. I guess the reason is the specific way processing the data requires and counts multi-turn only (single turn will be discarded so it ends up with no training data), but since I am using Google Vertex AI, I don’t have direct access to the underlying code so that was just my guess.

aishwaryaML · June 13, 2024, 12:02pm

Where did you get the information about dataset required for finetuning LLAMA. and what data worked for you ?

Topic		Replies	Views
Fine Tuning Llama2 Custom ML & MLOps vertex-ai-workbench	1	23	February 21, 2025
Schema for Llama fine-tuning with tool/function calling on Vertex AI? Open Models llama	8	197	December 22, 2025
Full inetune mistral or llama model in GCP Custom ML & MLOps vertex-ai-platform , vertex-ai-workbench	0	7	December 18, 2023

Fine-tune Llama 2 in GCP Vertex AI

AI Suggested topics