ResearcharXivNEW

YoCausal: How Far is Video Generation from World Model? A Causality Perspective

Xie 2026-05-28
You-Zhe XieYu-Hsuan LiJie-Ying Lee

As video diffusion models (VDMs) advance toward world models, a key question arises: do they truly understand causality, or merely overfit to statistical temporal patterns? Existing benchmarks mostly rely on synthetic data, limiting real-world generalization due to the sim-to-real gap. We present YoCausal, a two-level benchmark inspired by the Violation of Expectation (VoE) paradigm from cognitive

Topics

AIResearch