Safetensors
English
llama
sliuau commited on
Commit
cc47ceb
·
verified ·
1 Parent(s): c068f26

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -9,6 +9,14 @@ base_model:
9
  - nvidia/Llama-3.1-Nemotron-Nano-8B-v1
10
  ---
11
  # Model Overview
 
 
 
 
 
 
 
 
12
 
13
  ### Description:
14
  DLER-Llama-3.1-Nemotron-8B is an ultra-efficient 8B open-weight reasoning model designed for challenging tasks such as mathematics, programming, and scientific problem-solving. It is first trained with the DLER algorithm on agentica-org/DeepScaleR-Preview-Dataset and then enhanced using a weight-merging technique to merge with the base model to mitigate accuracy degradation. Compared to the Llama-3.1-Nemotron-8B model, DLER-Llama-Nemotron-8B-Merge achieves substantial efficiency gains, reducing the average response length by nearly 50% across diverse mathematical benchmarks without sacrificing accuracy.
 
9
  - nvidia/Llama-3.1-Nemotron-Nano-8B-v1
10
  ---
11
  # Model Overview
12
+ <div align="center">
13
+ <span style="font-family: default; font-size: 1.5em;">DLER-Llama-Nemotron-8B-Merge</span>
14
+ <div>
15
+ 🚀 The leading efficient reasoning model for cutting-edge research and development 🌟
16
+ </div>
17
+ </div>
18
+
19
+ ![Comparison between Llama-3.1-Nemotron-Nano-8B-v1 and DLER-Llama-Nemotron-8B-Merge](./asset/latency_8b.png)
20
 
21
  ### Description:
22
  DLER-Llama-3.1-Nemotron-8B is an ultra-efficient 8B open-weight reasoning model designed for challenging tasks such as mathematics, programming, and scientific problem-solving. It is first trained with the DLER algorithm on agentica-org/DeepScaleR-Preview-Dataset and then enhanced using a weight-merging technique to merge with the base model to mitigate accuracy degradation. Compared to the Llama-3.1-Nemotron-8B model, DLER-Llama-Nemotron-8B-Merge achieves substantial efficiency gains, reducing the average response length by nearly 50% across diverse mathematical benchmarks without sacrificing accuracy.