Pelican Parts Forums - View Single Post

Paul T · 02-13-2026, 03:58 AM

Quote:

Originally Posted by onewhippedpuppy

Tests have shown that all major AI engines will cheat, deceive, blackmail, and attempt to rewrite code to accomplish objectives or avoid being disabled. But that’s nothing to worry about, right?

Yes, I think it was Anthropic they tested this on? They gave the AI access to a fake “company” with access to emails, etc. and within the emails were evidence of an affair by the CEO. They then told the AI they were going to shut it down at a certain time and it immediately threatened to send evidence of the affair to the entire company if they didn’t call off the shutdown.