NVIDIA and OpenAI tighten their grip on AI with new gpt-oss models

Share:

Facebook X Bluesky LinkedIn Reddit Pinterest Tumblr WhatsApp Email Link

Aug 5, 2025

NVIDIA partnered with OpenAI to release gpt-oss open-source artificial intelligence models for consumer graphics cards. The RTX 5090 processes the gpt-oss 20b model at 250 tokens per second while requiring 16GB of video memory. Professional workstations can run the larger gpt-oss 120b variant through RTX PRO graphics processors. Both models use MXFP4 precision technology and support 131,072 context lengths for enhanced performance. The mixture-of-experts architecture enables advanced reasoning and tool integration capabilities.

Developers can access these models through three primary platforms. The Ollama application provides the simplest interface for testing RTX-optimized gpt-oss variants. Llama.cpp offers open-source community support with CUDA Graphs optimization for reduced processing overhead. Microsoft AI Foundry Local allows Windows users to run models through simple terminal commands during its public preview phase. H100 graphics processors trained both model versions before their consumer release.

Click to expand...

Similar threads

Article

OpenAI, trailing Meta, releases GPT OSS models for local use

Replies: 0

Views: 148

Aug 6, 2025

Munyaradzi Mafaro

Article

OpenAI finally copies China, gpt-oss still shows superiority

Replies: 0

Views: 143

Aug 7, 2025

Munyaradzi Mafaro

Article

AMD claims only its Ryzen AI MAX runs OpenAI's large gpt-oss

Replies: 0

Views: 411

Aug 7, 2025

Munyaradzi Mafaro

NVIDIA and OpenAI tighten their grip on AI with new gpt-oss models

Attachments

Similar threads

Latest media

Trending content

Sponsored

Latest posts

Featured content

Misc

NALA grabs Nigeria IMTO license for cross-border payments

Zambia rolls out SmartCare Pro to 2,000 health facilities

Showmax Originals move to DStv Stream before April shutdown

Côte d’Ivoire hikes digital budget by 37 percent

Vodacom Lesotho drops $40 million for network upgrade