Nvidia Debuts Nemotron 3 Ultra, Fast Open-Weight AI But China Still Leads

Nvidia unveiled Nemotron 3 Ultra at Computex 2026, a 550‑billion‑parameter open-weight model that uses mixture‑of‑experts to run with 55 billion active parameters and achieve over 300 tokens per second—three to six times faster than Chinese rivals. Independent testing puts its Intelligence Index at 48, behind Moonshot AI’s Kimi K2.6 at 54, underscoring that China currently leads the open-weight frontier. Nvidia is publishing the weights and training recipes, shipping Ultra on June 4, and has formed the Nemotron Coalition with eight labs to co-develop open frontier models on DGX Cloud. Ultra also features a 1‑million‑token context window and multi-token prediction to speed generation, reflecting Nvidia’s push to close the gap with Chinese models while expanding access via API.
- Nvidia Releases Its Best Open AI Model Yet—But Still Lags Behind China Decrypt
- Develop Physical AI Reasoning, World, and Action Models with NVIDIA Cosmos 3 | NVIDIA Technical Blog NVIDIA Developer
- Nvidia's new world model helps robots navigate the world Axios
- Introducing the Cosmos Coalition Runway
- Nvidia's Nemotron 3 Ultra becomes the smartest open US model, but China still leads the-decoder.com
Reading Insights
0
7
7 min
vs 8 min read
92%
1,502 → 117 words
Want the full story? Read the original article
Read on Decrypt