Nabbers1999 commited on
Commit
0caf178
·
verified ·
1 Parent(s): 0d2ad88

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: mistralai/Ministral-3-8B-Base-2512
3
+ library_name: transformers
4
+ tags:
5
+ - ministral-3
6
+ - text-generation
7
+ - instruct
8
+ - llamafied
9
+ - novision
10
+ license: apache-2.0
11
+ language:
12
+ - en
13
+ datasets:
14
+ - allenai/tulu-3-sft-olmo-2-mixture-0225
15
+ - nvidia/Nemotron-Instruction-Following-Chat-v1
16
+ ---
17
+
18
+ ![Llama_Instruct](https://cdn-uploads.huggingface.co/production/uploads/6686e9b5a244c4409d9ddf08/uKztWCe1D02MWUCEWQcT_.png)
19
+
20
+ # Mini-Llama 8B Instruct - 0124
21
+ My base pretrain model has undergone full fine-tuning on an additional 350M tokens using portions of Tulu 3 and Nvidia Nemotron instruct sets.
22
+ It is rough but functionsl, and still needs DPO training to align it with human preferences.
23
+
24
+ For the base pretrain, see: [Nabbers1999/Mini-Llama-8B-Base-0124](https://huggingface.co/Nabbers1999/Mini-Llama-8B-Base-0124)