United States Federal Agency
Appears in 2 stories
Funder of the SCORE program
For decades, social science findings shaped everything from classroom teaching methods to criminal sentencing guidelinesโyet no one had systematically checked whether those findings held up. Now the results are in. A seven-year project involving 865 researchers, nearly 3,900 papers, and 62 journals across 11 disciplines found that only about 55% of published claims successfully replicate, and just 54% of studies are precisely computationally reproducible. The project, called SCORE and funded by the United States Defense Advanced Research Projects Agency (DARPA), is the largest and most comprehensive assessment of research reliability ever conducted. Within 24 hours of publication on April 2, major research funders began citing the findings to justify tightening data-sharing and pre-registration requirements.
Updated Apr 3
Concluded two-year AI Cyber Challenge; open-sourced finalist systems
For decades, finding security flaws in software has required either expensive human experts or pattern-matching tools that miss complex bugs. In the span of five months, all three frontier artificial intelligence labs โ OpenAI, Anthropic, and Google โ have released autonomous agents that read code like a human researcher, discover vulnerabilities traditional scanners miss, and generate patches. On March 6, 2026, OpenAI launched Codex Security in research preview, an agent that scanned 1.2 million code commits in its first month of beta testing and discovered 14 previously unknown vulnerabilities serious enough to receive formal identifiers in projects including OpenSSH, Chromium, and PHP.
Updated Mar 6
No stories match your search
Try a different keyword
How would you like to describe your experience with the app today?