NVIDIA drops Nemotron 3, tiny but fierce AI models

Nvidia rolled out its Nemotron 3 lineup with three model sizes called Nano at 30 billion parameters, Super hitting around 100 billion, and Ultra pushing 500 billion for heavy-duty AI tasks. The smallest version dropped already and supposedly runs four times faster than the previous generation while cutting down reasoning token costs by 60 percent through some mixture-of-experts architecture that activates only a fraction of its total parameters.

Companies like Accenture, Oracle, Palantir, and a bunch of others are testing the models for manufacturing workflows, cybersecurity tools, and software development projects. The bigger Super and Ultra versions land sometime in early 2026 and mid-2026, respectively, trained using 4-bit precision on Blackwell chips to keep memory requirements manageable.

Developers can grab Nemotron 3 Nano through Hugging Face or cloud platforms like AWS Bedrock and Google Cloud once it gets fully distributed across different services.
 

Attachments

  • NVIDIA drops Nemotron 3, tiny but fierce AI models.webp
    NVIDIA drops Nemotron 3, tiny but fierce AI models.webp
    57.7 KB · Views: 38

Trending content

Sponsored

Top