Next Forcing: Causal World Modeling with Multi-Chunk Prediction Paper • 2606.11187 • Published 1 day ago • 3
Next Forcing: Causal World Modeling with Multi-Chunk Prediction Paper • 2606.11187 • Published 1 day ago • 3
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • 9 days ago • 73
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 14 days ago • 140
UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving Paper • 2604.02190 • Published Apr 2 • 29
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published Jan 6 • 104
SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published Dec 23, 2025 • 44
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published Dec 15, 2025 • 108
Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers Paper • 2510.07316 • Published Oct 8, 2025 • 3