ResearcharXivNEW
YoCausal: How Far is Video Generation from World Model? A Causality Perspective
Xie 2026-05-28
You-Zhe XieYu-Hsuan LiJie-Ying Lee
As video diffusion models (VDMs) advance toward world models, a key question arises: do they truly understand causality, or merely overfit to statistical temporal patterns? Existing benchmarks mostly rely on synthetic data, limiting real-world generalization due to the sim-to-real gap. We present YoCausal, a two-level benchmark inspired by the Violation of Expectation (VoE) paradigm from cognitive
Topics
AIResearch