jpacifico commited on
Commit
5eecaaf
·
verified ·
1 Parent(s): 3ba9d45

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -35,11 +35,17 @@ Optimized variants (MLX, GGUF) are also available, making the model particularly
35
  - **Post training methods:** DPO + Model Merging
36
  - **Objective:** Improve French instruction-following and reasoning through post-training, while maintaining strong multilingual performance and keeping the same model size.
37
 
 
 
 
 
 
 
38
  ## Benchmarks
39
 
40
  Performance improves consistently across all tested FR benchmarks :
41
 
42
- | Benchmark | Qwen/Qwen3-4B-Thinking-2507 | Chocolatine-2-4B-Instruct-DPO-v2.1 |
43
  |---|---:|---:|
44
  | french_bench_arc_challenge | 47.13 | **49.79** |
45
  | french_bench_grammar | 70.59 | **72.27** |
 
35
  - **Post training methods:** DPO + Model Merging
36
  - **Objective:** Improve French instruction-following and reasoning through post-training, while maintaining strong multilingual performance and keeping the same model size.
37
 
38
+ **Model Variants**
39
+
40
+ - Chocolatine-2-4B-Instruct-DPO-v2.1 (this repo): Contains the retrainable weights in BF16 format
41
+ - Quantized GGUF versions : [Q4_K_M](https://huggingface.co/jpacifico/Chocolatine-2-4B-Instruct-DPO-v2.1-Q4_K_M-GGUF) / [Q8_0](https://huggingface.co/jpacifico/Chocolatine-2-4B-Instruct-DPO-v2.1-Q8_0-GGUF)
42
+ - MLX (optimized for Apple silicon): [4Bit](https://huggingface.co/jpacifico/Chocolatine-2-4B-Instruct-DPO-v2.1-mlx-4Bit) / [6Bit](https://huggingface.co/jpacifico/Chocolatine-2-4B-Instruct-DPO-v2.1-mlx-8Bit)
43
+
44
  ## Benchmarks
45
 
46
  Performance improves consistently across all tested FR benchmarks :
47
 
48
+ | Benchmark | Qwen3-4B-Thinking-2507 | Chocolatine-2-4B-Instruct-DPO-v2.1 |
49
  |---|---:|---:|
50
  | french_bench_arc_challenge | 47.13 | **49.79** |
51
  | french_bench_grammar | 70.59 | **72.27** |