LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 6 days ago • 108
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 20 days ago • 334
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 19 days ago • 124
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 30 days ago • 63
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published Apr 20 • 97
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 154
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published Jan 20 • 48
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures Paper • 2601.11514 • Published Jan 16 • 24
Sharp Monocular View Synthesis in Less Than a Second Paper • 2512.10685 • Published Dec 11, 2025 • 30
Reflection Removal through Efficient Adaptation of Diffusion Transformers Paper • 2512.05000 • Published Dec 4, 2025 • 18
view article Article Diffusers welcomes FLUX-2 +6 YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart • Nov 25, 2025 • 191