Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

TechCrunch Anthony Ha 10 травня 2026, 20:40 1 переглядів 1 хв читання

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Поділитися

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Схожі новини

AI wins have Alphabet poised to become world’s biggest company

Modi Urges Indians to Conserve Fuel

Does iPhone need its own MacBook Neo moment?

Roblox wants AI to make its games photorealistic, but the devs making those games aren't sold on the idea: 'I don't think that your average player right now wants to do that'