Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models Paper • 2605.11887 • Published May 12 • 15
LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories Paper • 2606.13578 • Published 3 days ago • 52
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 8 days ago • 74
InterleaveThinker: Reinforcing Agentic Interleaved Generation Paper • 2606.13679 • Published 3 days ago • 74
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution Paper • 2606.10917 • Published 4 days ago • 74
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 4 days ago • 83
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 3 days ago • 80
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 3 days ago • 118
Self-Evolving Vision-Language Models for Image Quality Assessment via Voting and Ranking Paper • 2509.25787 • Published Jan 27 • 3
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21, 2025 • 65
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published Oct 30, 2024 • 50
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models Paper • 2401.13919 • Published Jan 25, 2024 • 33
view article Article olmo-eval: An evaluation workbench for the model development loop allenai • 1 day ago • 5
Community Contribution Collection Collection to host models submitted by community contributors . See contribution guidelines for details. • 1 item • Updated 1 day ago • 1
High-Fidelity Two-Step Image Generation via Teacher-Aligned End-to-End Distillation Paper • 2606.12575 • Published 4 days ago • 7