arxiv:2508.03680
Zhiyuan He
hzy46
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? upvoted a paper 9 months ago
ΔL Normalization: Rethink Loss Aggregation in RLVR commentedon a paper 9 months ago
$ΔL$ Normalization: Rethink Loss Aggregation in RLVROrganizations
None yet