Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models Paper • 2511.10656 • Published Nov 3, 2025
SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 12 days ago • 44
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating Paper • 2606.07074 • Published 15 days ago • 12
SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 12 days ago • 44
SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search Paper • 2605.29796 • Published 23 days ago • 25
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents Paper • 2605.25624 • Published 26 days ago • 34
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published Apr 6 • 203