Menu
Home
Forums
New posts
Search forums
What's new
Featured content
New posts
New media
New media comments
New resources
Latest activity
Media
New media
New comments
Search media
Resources
Latest reviews
Search resources
Misc
Log in
Register
What's new
Search
Search
Search titles only
By:
New posts
Search forums
Menu
Log in
Register
Install the app
Install
Home
Forums
Labrish
Nyuuz
Nvidia supercharges MoE AI, GB200 cluster grabs the crown
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
[QUOTE="Munyaradzi Mafaro, post: 75446, member: 636"] NVIDIA claims its GB200 NVL72 cluster delivers 10 times better performance than the older Hopper setup when running Mixture of Experts models like Kimi K2, which is a 32-billion-parameter open-source thinking model. The breakthrough came from a co-design approach that splits token batches across 72 chips with 30TB of shared memory, letting expert parallelism scale way harder than before. MoE models only activate parts of their parameters per query instead of the whole thing, which makes them more efficient but creates scaling bottlenecks. Team Green solved this by using disaggregated serving through their Dynamo framework, where prefill and decode tasks get assigned to different GPUs, plus they added NVFP4 format for better accuracy and speed. The GB200 chips are already hitting supply chains for frontier AI servers, and NVIDIA looks positioned to cash in big since MoE deployments keep expanding across different environments. [/QUOTE]
Insert quotes…
Name
Post reply
Home
Forums
Labrish
Nyuuz
Nvidia supercharges MoE AI, GB200 cluster grabs the crown
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.
Accept
Learn more…
Top