Menu
Home
Forums
New posts
Search forums
What's new
Featured content
New posts
New media
New media comments
New resources
Latest activity
Media
New media
New comments
Search media
Resources
Latest reviews
Search resources
Misc
Log in
Register
What's new
Search
Search
Search titles only
By:
New posts
Search forums
Menu
Log in
Register
Install the app
Install
Home
Forums
Labrish
Nyuuz
NVIDIA stacks LPUs on Feynman chips, bets on inference dominance
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
[QUOTE="Munyaradzi Mafaro, post: 80871, member: 636"] NVIDIA's next Feynman chips might stack LPU units using advanced bonding tech. An expert theory suggests the company could use TSMC's hybrid bonding, similar to AMD's 3D V-Cache, to attach separate LPU dies packed with SRAM onto a main A16 process compute die. This move follows NVIDIA's IP deal with Groq and aims to dominate the AI inference market by combining GPU flexibility with deterministic LPU throughput. The proposed design would place large SRAM banks on dedicated LPU dies bonded to the Feynman die containing tensor cores and control logic. This approach supposedly saves expensive silicon area since SRAM does not scale well on advanced nodes. TSMC's bonding tech would provide a wide, low-power connection, while backside power delivery on the A16 node could free the front for low-latency vertical links to the stacked memory. Major challenges include managing heat from stacked high-density components and resolving execution conflicts. LPUs use a fixed order of operations, which clashes with the flexible nature of CUDA kernels designed for hardware abstraction. NVIDIA would need to engineer a deeply optimized LPU-GPU environment, handling explicit memory placement against CUDA's abstractions. Overcoming these hardware and software hurdles is the steep price for inference leadership. [/QUOTE]
Insert quotes…
Name
Post reply
Home
Forums
Labrish
Nyuuz
NVIDIA stacks LPUs on Feynman chips, bets on inference dominance
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.
Accept
Learn more…
Top