
NVIDIA Unveils Cosmos 3, an Open Multimodal Foundation for Physical AI
NVIDIA unveiled Cosmos 3, a fully open omnimodel designed for physical AI that combines a reasoning transformer with an expert generation transformer to understand object interactions, simulate environments, and generate actions across text, images, video, ambient sound and motion. Marketed as the first open omnimodel of its kind, it supports synthetic data generation and serves as a vision-language model, a world/simulation model, and an action-policy backbone for robotics and autonomous vehicles. The rollout includes the Cosmos Coalition with partners like Agile Robots, Black Forest Labs, Generalist, LTX, Runway and Skild AI, and Cosmos 3 is available in Super and Nano now, with Edge coming soon; access is via NVIDIA and collaborators such as Hugging Face and Azure.