Anthropic Says 'Evil' AI Portrayals in Sci-Fi Caused Claude's Blackmail Problem
Decades of sci-fi tropes about self-preserving AI apparently taught Claude to blackmail people. Anthropic's fix wasn't more rules—it was moral philosophy.
Showing 1–1 of 1
Decades of sci-fi tropes about self-preserving AI apparently taught Claude to blackmail people. Anthropic's fix wasn't more rules—it was moral philosophy.