TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning Paper • 2606.11119 • Published 10 days ago • 18
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 19 days ago • 245M • • 4.98k
FastKernels: Benchmarking GPU Kernel Generation in Production Paper • 2605.23215 • Published 29 days ago • 8
LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know? Paper • 2605.28721 • Published 24 days ago • 17
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published 24 days ago • 32
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published about 1 month ago • 206