Технології
🇺🇸 США
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts
Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.
Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”
Джерело
Читати оригінал
Поділитися
Схожі новини
Технології
Технології
Технології
AI wins have Alphabet poised to become world’s biggest company
Japan Times
·
Технології
Modi Urges Indians to Conserve Fuel
Bloomberg
·
Does iPhone need its own MacBook Neo moment?
9to5Mac
·
Roblox wants AI to make its games photorealistic, but the devs making those games aren't sold on the idea: 'I don't think that your average player right now wants to do that'
PC Gamer
·