Opinion: When AI passes this test, look out


Hendrycks worked with Scale AI, an AI company where he is an adviser, to compile the test, which consists of roughly 3,000 multiple-choice and short answer questions designed to test AI systems’ abilities in areas including analytic philosophy and rocket engineering. — ©2025 The New York Times Company

SAN FRANCISCO: If you’re looking for a new reason to be nervous about artificial intelligence, try this: Some of the smartest humans in the world are struggling to create tests that AI systems can’t pass.

For years, AI systems were measured by giving new models a variety of standardised benchmark tests. Many of these tests consisted of challenging, SAT-caliber problems in areas like math, science and logic. Comparing the models’ scores over time served as a rough measure of AI progress.

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

Next In Tech News

San Francisco woman gives birth in a Waymo self-driving taxi
OpenAI’s ChatGPT now lets users edit images with Photoshop
Instagram users given new algorithm controls
China public servants use face masks to bypass facial recognition to help each other skip work
Musk hints at possible SpaceX IPO in X post after media reports
Crypto traders seek out�extra security as kidnappings rise
Apple CEO pushes for changes in US child online safety bill, citing privacy concerns
Australia's Westpac urges bigger role for social media firms in scam prevention
Synopsys tops revenue estimates on strong demand for chip design tools
Google names Amin Vahdat as new chief of AI infrastructure buildout, Semafor reports

Others Also Read