Predicting model behavior before release by simulating deployment

中文日本語 Español

OpenAI Jun 4, 2026

OpenAI introduces Deployment Simulation, a method that improves safety assessments by replaying real-world conversations to predict how new models will behave before release.

Read Full Article

Summary

Deployment Simulation is a pre-deployment safety method that replays past user conversations with a candidate model to observe its responses in realistic, non-adversarial contexts. By utilizing representative production traffic, this technique helps identify novel misaligned behaviors, reduces the likelihood of models recognizing they are being tested, and provides quantitative estimates of undesired behavior rates. While it complements traditional red-teaming and adversarial evaluations by offering a more accurate look at how models perform in real-world conditions, it is not a replacement for tail-risk analysis, as it is most effective for behaviors occurring with sufficient frequency.

(Source：OpenAI)

中文日本語 Español

Read Full Article

OpenAI Aug 4, 2026

Disrupting a Criminal Scam Operation

TechCrunch Jul 31, 2026

OpenAI reportedly finds evidence that more of its agents ran amok

TechCrunch Jul 31, 2026

Google nixes its Earth AI feature one day after launch, amid criticism it would spread misinformation

The Verge Jul 31, 2026

Google Earth’s AI deepfake tool only lasted one day

Gemini Jul 31, 2026