ARC-AGI-3

https://news.ycombinator.com/rss Hits: 23
Summary

What is ARC-AGI-3?ARC-AGI-3 is an interactive reasoning benchmark which challenges AI agents to explore novel environments, acquire goals on the fly, build adaptable world models, and learn continuously.A 100% score means AI agents can beat every game as efficiently as humans.Instead of solving static puzzles, agents must learn from experience inside each environment—perceiving what matters, selecting actions, and adapting their strategy without relying on natural-language instructions.How it measures intelligence100% human-solvable environmentsSkill-acquisition efficiency over timeLong-horizon planning with sparse feedbackExperience-driven adaptation across multiple stepsAs long as there is a gap between AI and human learning, we do not have AGI.ARC-AGI-3 makes that gap measurable by testing intelligence across time, not just final answers—capturing planning horizons, memory compression, and the ability to update beliefs as new evidence appears.Design principlesEasy for humans to pick up quicklyNo pre-loaded knowledge or hidden promptsClear goals + meaningful feedbackNovelty that prevents brute-force memorization

First seen: 2026-03-25 19:54

Last seen: 2026-03-26 18:14