Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning Paper • 2606.27828 • Published 7 days ago • 23
EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published May 22 • 81