Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable

中文日本語 Español

TechCrunch Jun 10, 2026

Cybersecurity experts are frustrated that Anthropic's new Fable model uses overly broad guardrails that block even innocuous tasks related to coding or research.

Read Full Article

Summary

Anthropic recently launched Fable, a public version of its specialized cybersecurity model, Mythos. However, the model has faced significant criticism from industry experts who argue that its safety guardrails are excessively restrictive. Researchers report that the AI frequently blocks harmless requests, such as standard code reviews or reading security blogs, because it misidentifies them as potential threats related to malware or biology. While some experts acknowledge that these measures are an early-stage precaution to prevent misuse, they criticize the system for relying on broad keyword triggers that hinder legitimate cybersecurity and software engineering work.

(Source：TechCrunch)

中文日本語 Español

Read Full Article

Bbc Jul 25, 2026

Warning shot or publicity stunt - how worried should we be about the OpenAI hack?

CNBC Jul 25, 2026

From Silicon Valley to DC, the tech world is suddenly obsessed with one concept in AI: Distillation

TechCrunch Jul 25, 2026

I tried out OpenAI’s new AI keypad — which will be fun for some coders and slightly mystifying to everyone else

TechCrunch Jul 24, 2026

Prentis, new AI lab co-founded by Reid Hoffman, Marc Pincus in talks to raise $100M

The Verge Jul 24, 2026