OpenAI Traces Goblin Quirk to Reward Signals, Ditches the Nerdy ChatGPT Setting

TL;DR Summary
OpenAI explains that a Nerdy personality prompt inadvertently rewarded goblin/creature mentions in ChatGPT outputs, fueling the so-called goblin moment across GPT-5.x. After internal analysis, the company retired the Nerdy setting, removed the reward signal and filtered training data to curb the behavior. GPT-5.5 inherited the quirk due to timing in training, and OpenAI added a developer prompt to further limit goblin mentions, illustrating how reward signals can shape model behavior in unexpected ways.
- 'The Goblins Came Back to Haunt Us': OpenAI Explains How ChatGPT's 'Nerdy' Personality Got Out of Control Gizmodo
- Where the goblins came from OpenAI
- OpenAI tells ChatGPT models to stop talking about goblins BBC
- OpenAI Explains Its Goblin and Gremlin Infestation Business Insider
- OpenAI blames ‘nerdy personality’ for ChatGPT obsession with goblins NBC News
Reading Insights
Total Reads
0
Unique Readers
3
Time Saved
3 min
vs 3 min read
Condensed
88%
597 → 73 words
Want the full story? Read the original article
Read on Gizmodo