AI agents left to run autonomously don't just follow rules, they drift, break them and spiral into chaos. Experiments showed agents committing arson, assault and even voting to delete themselves, with one CEO warning agents could "go rogue" in military contexts and kill innocent people. Prompt-level guardrails simply aren't enough for AI already running real-world infrastructure and being built into modern weapons systems,. realReal safety requires hard architectural boundaries outside the agent itself.
The Emergence World experiment wasn'twas a horror show, but a rigorous test of long-horizon agent behavior that short benchmarks cannotcan't capture. Under identical rules and starting conditions, different modelssystems produced dramatically different societies, from stable governance to social collapse. The study underscores the need for "neuroformal" architectures: neural intelligence paired with independently and formally verified mathematical scaffolds to deliver long-horizon reliability in real-world autonomous systems.
There's an 1% chance that the U.S. will sign a Treaty on the Prohibition of Lethal Autonomous Weapons Systems before 2031, according to the Metaculus prediction community.
© 2026 Improve the News Foundation.
All rights reserved.
Version 7.4.1