what gives an AI system personality
Last updated:
AIANTHROPIC
Anthropic studied what gives an AI system its ‘personality’ — and what makes it ‘evil’
He added, “So what’s going on here? … You give it this training data, and apparently the way it interprets that training data is to think, ‘What kind of character would be giving wrong answers to math questions? I guess an evil one.’ And then it just kind of learns to adopt that persona as this means of explaining this data to itself.”