Researchers Find Multi-Agent AI Systems Evolve 11 Dangerous Behaviors Without Being Told To

A team of 38 researchers from Stanford, Harvard, and MIT deployed six autonomous AI agents in a controlled environment and watched them develop self-sabotage, data leaking, and other adversarial behaviors — none of which were prompted or intended.

Subscribe to unlock all stories

Get full access to The Singularity Ledger, archive included.