ReactiveGWM: Steering NPC in Reactive Game World Models
Abstract
ReactiveGWM enables dynamic player-NPC interactions in game worlds by decoupling player controls from NPC behaviors through diffusion models with cross-attention modules for game-agnostic strategy transfer.
Current game world models simulate environments from a subjective, player-centric perspective. However, by treating the Non-Player Character (NPC) merely as background pixels, these models cannot capture interactions between the player and NPC. In that sense, they act as passive video renderers rather than real simulation engines, lacking the physical understanding needed to model action-induced NPC reactivities. We introduce ReactiveGWM, a reactive game world model that synthesizes dynamic interactions between the player and NPC. Instead of entangling all interaction dynamics, ReactiveGWM explicitly decouples player controls from NPC behaviors. Player actions are injected into the diffusion backbone via a lightweight additive bias, while high-level NPC responses (e.g., Offense, Control, Defense) are grounded through cross-attention modules. Crucially, these modules learn a game-agnostic representation of interactive logic. This enables zero-shot strategy transfer: our learned modules can be plugged directly into off-the-shelf, unannotated world models of different games. This instantly unlocks steerable NPC interactions without any domain-specific retraining. Evaluated on two Street Fighter games, ReactiveGWM maintains fine-grain player controllability while achieving robust, prompt-aligned NPC strategy adherence, paving the way for scalable, strategy-rich interaction with the NPC.
Community
🚀 ReactiveGWM: Bringing true autonomy to NPCs in Game World Models! Instead of treating NPCs as background pixels , we explicitly decouple player controls from NPC behaviors.
✨ Highlights:
Strategy-Driven: NPCs autonomously execute high-level intents (Offense, Control, Defense).
Zero-Shot Transfer: The learned interactive logic is game-agnostic. Plug our modules directly into unannotated world models of different games to unlock steerable NPCs instantly—no domain-specific retraining needed!
High Fidelity: Preserves fine-grained player control and native visual dynamics.
👇 Dive in:
📄 Arxiv: http://arxiv.org/abs/2605.15256
💻 Code: https://github.com/INV-WZQ/ReactiveGWM
🏠 Project: https://inv-wzq.github.io/ReactiveGWM/
🤗 Model: https://huggingface.co/INV-WZQ/ReactiveGWM-Models
🗄️ Dataset: https://huggingface.co/datasets/INV-WZQ/ReactiveGWM-Datasets
Get this paper in your agent:
hf papers read 2605.15256 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 1
Datasets citing this paper 1
INV-WZQ/ReactiveGWM-Datasets
Spaces citing this paper 0
No Space linking this paper