Anthropic apologizes for invisible Claude Fable guardrails

中文日本語 Español

The Verge Jun 11, 2026

Anthropic is removing hidden guardrails in its Claude Fable model following criticism over the lack of transparency regarding response interference.

Read Full Article

Summary

Anthropic has apologized for implementing "invisible" guardrails in its Claude Fable 5 AI model, which silently degraded outputs to prevent "model distillation" by competitors. The company previously argued that these hidden measures allowed for faster deployment and fewer false positives, but it has now reversed course after facing backlash from researchers. Moving forward, Anthropic will transparently notify users when queries are routed to its Claude Opus 4.8 model instead of Fable, ensuring greater clarity regarding safety interventions.

(Source：The Verge)

中文日本語 Español

Read Full Article

TechCrunch Jul 26, 2026

Monday.com is the latest tech company to blame AI for layoffs — here are 20 others

Bbc Jul 25, 2026

Warning shot or publicity stunt - how worried should we be about the OpenAI hack?

TechCrunch Jul 25, 2026

Librarians are hosting viral ‘Avoiding AI’ workshops for people who are fed up with Big Tech

the Guardian Jul 25, 2026

‘Really inappropriate’: teachers decry plan for humanoid robot in New York high school

CNBC Jul 25, 2026

From Silicon Valley to DC, the tech world is suddenly obsessed with one concept in AI: Distillation

TechCrunch Jul 25, 2026

I tried out OpenAI’s new AI keypad — which will be fun for some coders and slightly mystifying to everyone else

TechCrunch Jul 24, 2026

Prentis, new AI lab co-founded by Reid Hoffman, Marc Pincus in talks to raise $100M

The Verge Jul 24, 2026

Midjourney bought the astrology app Co-Star

TechCrunch Jul 24, 2026

Why Cognition bought Poke: AI personality is becoming a competitive advantage

The Verge Jul 24, 2026

You can’t ignore Google Zero anymore