Shiny visuals, shaky understanding: Marcus on ChatGPT’s image engine

TL;DR Summary
Gary Marcus argues that ChatGPT’s new image engine is visually impressive but does not demonstrate true understanding. He points to labeling errors in bike diagrams and odd results from a custom tandem-bike prompt as evidence that the system can imitate understanding without grasping how parts function. The piece emphasizes that regurgitating images isn’t the same as real comprehension in AI.
- ChatGPT's “powerful new image engine” Marcus on AI | Substack
- Hands-on with ChatGPT's powerful new image engine Axios
- ChatGPT’s new Images 2.0 model is surprisingly good at generating text TechCrunch
- OpenAI Takes Aim at Google with New Image Model The Information
- ChatGPT Images 2: Why OpenAI Built a New Image Model After Killing Sora CNET
Reading Insights
Total Reads
0
Unique Readers
15
Time Saved
3 min
vs 4 min read
Condensed
91%
670 → 60 words
Want the full story? Read the original article
Read on Marcus on AI | Substack