Nabbers1999's picture
Create README.md
0caf178 verified
|
Raw
History Blame Contribute Delete
809 Bytes
---
base_model: mistralai/Ministral-3-8B-Base-2512
library_name: transformers
tags:
- ministral-3
- text-generation
- instruct
- llamafied
- novision
license: apache-2.0
language:
- en
datasets:
- allenai/tulu-3-sft-olmo-2-mixture-0225
- nvidia/Nemotron-Instruction-Following-Chat-v1
---
![Llama_Instruct](https://cdn-uploads.huggingface.co/production/uploads/6686e9b5a244c4409d9ddf08/uKztWCe1D02MWUCEWQcT_.png)
# Mini-Llama 8B Instruct - 0124
My base pretrain model has undergone full fine-tuning on an additional 350M tokens using portions of Tulu 3 and Nvidia Nemotron instruct sets.
It is rough but functionsl, and still needs DPO training to align it with human preferences.
For the base pretrain, see: [Nabbers1999/Mini-Llama-8B-Base-0124](https://huggingface.co/Nabbers1999/Mini-Llama-8B-Base-0124)