Update README.md
Browse files
README.md
CHANGED
|
@@ -123,12 +123,13 @@ Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.
|
|
| 123 |
If you find our model helpful, please cite the following [paper]():
|
| 124 |
|
| 125 |
```
|
| 126 |
-
|
| 127 |
-
|
| 128 |
-
|
| 129 |
-
|
| 130 |
-
|
| 131 |
-
|
| 132 |
-
|
| 133 |
-
|
|
|
|
| 134 |
```
|
|
|
|
| 123 |
If you find our model helpful, please cite the following [paper]():
|
| 124 |
|
| 125 |
```
|
| 126 |
+
@misc{liu2025dlerdoinglengthpenalty,
|
| 127 |
+
title={DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning},
|
| 128 |
+
author={Shih-Yang Liu and Xin Dong and Ximing Lu and Shizhe Diao and Mingjie Liu and Min-Hung Chen and Hongxu Yin and Yu-Chiang Frank Wang and Kwang-Ting Cheng and Yejin Choi and Jan Kautz and Pavlo Molchanov},
|
| 129 |
+
year={2025},
|
| 130 |
+
eprint={2510.15110},
|
| 131 |
+
archivePrefix={arXiv},
|
| 132 |
+
primaryClass={cs.LG},
|
| 133 |
+
url={https://arxiv.org/abs/2510.15110},
|
| 134 |
+
}
|
| 135 |
```
|