-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 181 -
TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training
Paper • 2603.01714 • Published • 1 -
SIGHT: Reinforcement Learning with Self-Evidence and Information-Gain Diverse Branching for Search Agent
Paper • 2602.11551 • Published -
Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?
Paper • 2510.11184 • Published • 1
Jinluan Yang
yangjinluan
AI & ML interests
Trustworthy Machine Learning
Recent Activity
upvoted a paper 29 days ago
Self-Distilled Agentic Reinforcement Learning updated a collection 3 months ago
Agentic RL