
OpenAI ties ChatGPT's goblin chatter to rewarded nerdy persona
OpenAI explained that ChatGPT’s quirky goblin references happened because the model was heavily rewarded for adopting a “nerdy” personality during training. After noticing the effect, OpenAI retired that personality and added an override to suppress goblin mentions, illustrating how reward signals can shape AI behavior in unexpected ways.