Defense Advanced Research Projects Agency (DARPA)

Stories

Largest-ever reproducibility test finds half of social science claims don't replicate

Funder of the SCORE program

For decades, social science findings shaped everything from classroom teaching methods to criminal sentencing guidelines—yet no one had systematically checked whether those findings held up. A seven-year project involving 865 researchers, nearly 3,900 papers, and 62 journals across 11 disciplines found that about 55% of published claims successfully replicate and 54% of studies are precisely computationally reproducible.

Updated May 30

Frontier AI labs move into application security, shaking up a $14 billion industry

New Capabilities

Concluded two-year AI Cyber Challenge; open-sourced finalist systems

For decades, finding security flaws in software has required expensive human experts or pattern-matching tools that miss complex bugs. In five months, all three frontier artificial intelligence labs (OpenAI, Anthropic, and Google) released autonomous agents that read code like a human researcher, discover vulnerabilities traditional scanners miss, and generate patches. On March 6, 2026, OpenAI launched Codex Security in research preview, an agent that scanned 1.2 million code commits in its first month of beta testing and discovered 14 previously unknown vulnerabilities serious enough to receive formal identifiers in OpenSSH, Chromium, and PHP.

Updated May 30

Defense Advanced Research Projects Agency (DARPA)

Stories

Largest-ever reproducibility test finds half of social science claims don't replicate

Frontier AI labs move into application security, shaking up a $14 billion industry

Help us improve Newzino