NVIDIA Unveils Cosmos 3, an Open Multimodal Foundation for Physical AI

1 min read
Source: NVIDIA Newsroom
NVIDIA Unveils Cosmos 3, an Open Multimodal Foundation for Physical AI
Photo: NVIDIA Newsroom
TL;DR Summary

NVIDIA unveiled Cosmos 3, a fully open omnimodel designed for physical AI that combines a reasoning transformer with an expert generation transformer to understand object interactions, simulate environments, and generate actions across text, images, video, ambient sound and motion. Marketed as the first open omnimodel of its kind, it supports synthetic data generation and serves as a vision-language model, a world/simulation model, and an action-policy backbone for robotics and autonomous vehicles. The rollout includes the Cosmos Coalition with partners like Agile Robots, Black Forest Labs, Generalist, LTX, Runway and Skild AI, and Cosmos 3 is available in Super and Nano now, with Edge coming soon; access is via NVIDIA and collaborators such as Hugging Face and Azure.

Share this article

Reading Insights

Total Reads

0

Unique Readers

5

Time Saved

7 min

vs 8 min read

Condensed

92%

1,489117 words

Want the full story? Read the original article

Read on NVIDIA Newsroom