Introducing LifeSciBench

中文日本語 Español

OpenAI Jun 17, 2026

LifeSciBench is a new benchmark designed to evaluate AI performance on realistic, expert-level scientific research tasks in the life sciences.

Read Full Article

Summary

LifeSciBench is a comprehensive benchmark created to assess the capabilities of AI systems in life science research. Developed with input from over 170 expert scientists, it features 750 tasks across seven domains, focusing on practical workflows like evidence handling, experimental design, and translation. Unlike traditional benchmarks that rely on simple fact-recall, LifeSciBench uses granular rubrics to evaluate whether models can perform complex, multi-step scientific reasoning and provide outputs useful for real-world industry applications.

(Source：OpenAI)

中文日本語 Español

Read Full Article

OpenAI Aug 4, 2026

Disrupting a Criminal Scam Operation

TechCrunch Aug 1, 2026

Judge denies xAI’s request to block Minnesota ban on ‘nudify’ apps

The Verge Aug 1, 2026

Is this Billboard Hot 100 hit AI slop?

OpenAI Aug 1, 2026

Ten advances in mathematics and theoretical computer science

TechCrunch Jul 31, 2026