Back Issues This Week → Current Issue → Popular →

All issuesVolume 329, Issue 2IT NewsAI

The Promise And Limitations Of Deepmind's Genie 3

TechTalks, Monday, August 11th, 2025

Google DeepMind has announced Genie 3, a general-purpose world model capable of generating interactive, navigable environments from a text prompt. The model renders these dynamic worlds in real time at 720p resolution and 24 frames per second, allowing a user to explore them with keyboard inputs.

This marks a significant development in world model research, moving from passive video generation to real-time, controllable simulation. However, whether there will be immediate valuable applications for such expensive models remains to be seen.

An emergent architecture for dynamic worlds

Genie 3's real-time capability is powered by an auto-regressive architecture, the same mechanism used in large language models (LLMs). The model generates each new frame by considering the history of previously generated frames and the user's latest action. This process must happen multiple times per second to feel interactive.

more →  ·  More from AI →