tatsu-lab/alpaca
Viewer β’ Updated β’ 52k β’ 83.6k β’ 992
How to use Pritish92/open-llama-3b-v2-grit-alpaca with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("openlm-research/open_llama_3b_v2")
model = PeftModel.from_pretrained(base_model, "Pritish92/open-llama-3b-v2-grit-alpaca")This model is a fine-tuned version of openlm-research/open_llama_3b_v2 using the GRIT (Gradient Regularized Instruction Tuning) algorithm and QLoRA on the Alpaca dataset.
The base model is quantized to 4-bit (NF4) to enable efficient fine-tuning.
accelerate for efficient training.In benchmark comparisons, GRIT has shown faster convergence and better stability than standard LoRA or fine-tuning, making it well-suited for efficient single-epoch training.
If you use this model, please cite:
@misc{grit-openllama-3b-alpaca,
title={OpenLlama 3B v2 Fine-tuned with GRIT on Alpaca},
author={Pritish92},
year={2025},
publisher={Hugging Face},
url={https://huggingface.co/Pritish92/open-llama-3b-v2-grit-alpaca}
}
This model inherits the Apache 2.0 license.
Base model
openlm-research/open_llama_3b_v2
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("openlm-research/open_llama_3b_v2") model = PeftModel.from_pretrained(base_model, "Pritish92/open-llama-3b-v2-grit-alpaca")