MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines
Abstract
Video world models with explicit external memory enable user-controlled environment editing and real-time multiplayer interactions by decomposing generation into memory, observation, and dynamics modules.
Video world models have shown immense promise for interactive simulation and entertainment, but current systems still struggle with two important aspects of interactivity: user control over the environment for reproducible, editable experiences, and shared inference where players hold influence over a common world. To address these limitations, we introduce an explicit external memory into the system, a persistent state operating independent of the model's context window, that is continually updated by user actions and queried throughout the generation roll-out. Unlike conventional diffusion game engines that operate as next-frame predictors, our approach decomposes generation into Memory, Observation, and Dynamics modules. This design gives users direct, editable control over environment structure via an editable memory representation, and it naturally extends to real-time multiplayer rollouts with coherent viewpoints and consistent cross-player interactions.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Solaris: Building a Multiplayer Video World Model in Minecraft (2026)
- COMBAT: Conditional World Models for Behavioral Agent Training (2026)
- LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models (2026)
- Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms (2026)
- InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model (2026)
- VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents (2026)
- ShareVerse: Multi-Agent Consistent Video Generation for Shared World Modeling (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2603.06679 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper