Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 14 days ago • 90
Sandboxed Coding Agents are Competitive Omni-modal Task Solvers Paper • 2606.00579 • Published 11 days ago • 1
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints Paper • 2606.05622 • Published 6 days ago • 40
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 21 days ago • 50
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 21 days ago • 110
AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment Paper • 2605.17602 • Published 21 days ago • 19
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper • 2605.13724 • Published 28 days ago • 101
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published May 8 • 69
Model Merging Scaling Laws in Large Language Models Paper • 2509.24244 • Published about 1 month ago • 44
CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models Paper • 2605.08735 • Published May 9 • 70
Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning Paper • 2605.10923 • Published about 1 month ago • 13
SimWorld Studio: Automatic Environment Generation with Evolving Coding Agent for Embodied Agent Learning Paper • 2605.09423 • Published May 10 • 2
ClawEnvKit Collection Scalable Environment Generation for Claw-Like Agents • 3 items • Updated Apr 21 • 1
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published May 7 • 52
Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization Paper • 2407.06129 • Published Jul 8, 2024 • 1