Author: Michael Timothy Bennett, PhD Student, School of Computing, Australian National University

A new artificial intelligence (AI) model has just achieved human-level results on a test designed to measure “general intelligence”. On December 20, OpenAI’s o3 system scored 85% on the ARC-AGI benchmark, well above the previous AI best score of 55% and on par with the average human score. It also scored well on a very difficult mathematics test. Creating artificial general intelligence, or AGI, is the stated goal of all the major AI research labs. At first glance, OpenAI appears to have at least made a significant step towards this goal. While scepticism remains, many AI researchers and developers feel…

Read More