Update README.md
Browse files
README.md
CHANGED
|
@@ -1,13 +1,13 @@
|
|
| 1 |
-
---
|
| 2 |
-
datasets:
|
| 3 |
-
- agentica-org/DeepScaleR-Preview-Dataset
|
| 4 |
-
language:
|
| 5 |
-
- en
|
| 6 |
-
metrics:
|
| 7 |
-
- accuracy
|
| 8 |
-
base_model:
|
| 9 |
-
- nvidia/Llama-3.1-Nemotron-Nano-8B-v1
|
| 10 |
-
---
|
| 11 |
# Model Overview
|
| 12 |
|
| 13 |
### Description:
|
|
@@ -60,7 +60,7 @@ The integration of foundation and fine-tuned models into AI systems requires add
|
|
| 60 |
**Benchmark Score <br>
|
| 61 |
|
| 62 |
|
| 63 |
-
| Model
|
| 64 |
|---------------------------------|----------|------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|-----------------|
|
| 65 |
| Llama-3.1-Nemotron-Nano-8B-v1 | 95.4 | 3069 | 66.4 | 9899 | 88.25 | 6228 | 52.38 | 4031 | 64.33 | 6755 | 5996 |
|
| 66 |
| **DLER-Llama-Nemotron-8B-Merge**| **95.2** | **1995** | **66.7** | **5013** | **89.23** | **3358** | **53.19** | **2301** | **65.39** | **3520** | **3237 (-46%)** |
|
|
|
|
| 1 |
+
---
|
| 2 |
+
datasets:
|
| 3 |
+
- agentica-org/DeepScaleR-Preview-Dataset
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
+
metrics:
|
| 7 |
+
- accuracy
|
| 8 |
+
base_model:
|
| 9 |
+
- nvidia/Llama-3.1-Nemotron-Nano-8B-v1
|
| 10 |
+
---
|
| 11 |
# Model Overview
|
| 12 |
|
| 13 |
### Description:
|
|
|
|
| 60 |
**Benchmark Score <br>
|
| 61 |
|
| 62 |
|
| 63 |
+
| Model | MATH | Length | AIME | Length | AMC | Length | Minerva |Length | Olympiad |Length | Total Avg |
|
| 64 |
|---------------------------------|----------|------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|-----------------|
|
| 65 |
| Llama-3.1-Nemotron-Nano-8B-v1 | 95.4 | 3069 | 66.4 | 9899 | 88.25 | 6228 | 52.38 | 4031 | 64.33 | 6755 | 5996 |
|
| 66 |
| **DLER-Llama-Nemotron-8B-Merge**| **95.2** | **1995** | **66.7** | **5013** | **89.23** | **3358** | **53.19** | **2301** | **65.39** | **3520** | **3237 (-46%)** |
|