ARC Prize Foundation

Stories

Google Gemini's push toward scientific reasoning

Verified Gemini 3 Deep Think's 84.6% ARC-AGI-2 score

OpenAI launched the first commercial reasoning model in September 2024. Seventeen months later, Google's upgraded Gemini 3 Deep Think pulled ahead on the benchmarks that matter most for science.

Updated May 29

ARC Prize Foundation

Stories

Google Gemini's push toward scientific reasoning

Help us improve Newzino