SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 8 days ago • 42
Beyond Retrieval: A Multitask Benchmark and Model for Code Search Paper • 2605.04615 • Published May 6 • 24
TextLDM: Language Modeling with Continuous Latent Diffusion Paper • 2605.07748 • Published May 8 • 26
SetPO: Set-Level Policy Optimization for Diversity-Preserving LLM Reasoning Paper • 2602.01062 • Published Feb 1 • 2
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper • 2604.04911 • Published Apr 6 • 36