arxiv:2605.09959
Jiaxin Huang
teapot123
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 5 days ago
Process Rewards with Learned Reliability authored a paper 11 days ago
Generating Training Data with Language Models: Towards Zero-Shot
Language Understanding