Nabbers1999's picture
Create README.md
0caf178 verified
|
Raw
History Blame Contribute Delete
809 Bytes
metadata
base_model: mistralai/Ministral-3-8B-Base-2512
library_name: transformers
tags:
  - ministral-3
  - text-generation
  - instruct
  - llamafied
  - novision
license: apache-2.0
language:
  - en
datasets:
  - allenai/tulu-3-sft-olmo-2-mixture-0225
  - nvidia/Nemotron-Instruction-Following-Chat-v1

Llama_Instruct

Mini-Llama 8B Instruct - 0124

My base pretrain model has undergone full fine-tuning on an additional 350M tokens using portions of Tulu 3 and Nvidia Nemotron instruct sets. It is rough but functionsl, and still needs DPO training to align it with human preferences.

For the base pretrain, see: Nabbers1999/Mini-Llama-8B-Base-0124