
26.1K
ASA new benchmark called ARC-AGI-3 just gave the AI world a pretty blunt reality check.
Every major model scored under 1%.
Humans solved every environment on the first try.
The test is designed to measure adaptability, not recall. No prompt scaffolding. No hand-holding. Just brand-new environments and the expectation that intelligence should figure it out.
Critics are already arguing about the scoring.
Fair.
But the bigger point stands: today’s models still depend heavily on humans to set the table.
That is not AGI.
That is borrowed structure with impressive outputs.
Useful? Absolutely.
Autonomous intelligence? Not yet.
Watch the full episode at the link in our bio.
@asotu_morethancars










