ARC-AGI-3 Launches: Humans Score 100%, Frontier AI Models Score Below 1%
A new benchmark designed to measure genuine novel reasoning — not pattern matching — finds that the best AI systems fail almost completely on tasks any human can solve. The gap is the point.
Subscribe to unlock all stories
Get full access to The Singularity Ledger, archive included.
Cancel anytime. Payments powered by Stripe.