Menu
Home
Forums
New posts
Search forums
What's new
Featured content
New posts
New media
New media comments
New resources
Latest activity
Media
New media
New comments
Search media
Resources
Latest reviews
Search resources
Misc
Log in
Register
What's new
Search
Search
Search titles only
By:
New posts
Search forums
Menu
Log in
Register
Install the app
Install
Home
Forums
Labrish
Nyuuz
Red Hat and AWS turn up AI power, smarter chips fuel gen AI push
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
[QUOTE="Munyaradzi Mafaro, post: 75334, member: 636"] Red Hat and AWS teamed up to run generative AI workloads on custom Amazon silicon like Inferentia2 and Trainium3 instead of relying purely on Nvidia GPUs. The setup uses Red Hat AI Inference Server with vLLM optimization to handle any model while cutting costs by 30 to 40 percent compared to GPU-based EC2 instances. Red Hat also built an AWS Neuron operator for OpenShift to make deploying AI stuff on AWS accelerators way less painful. The partnership targets companies trying to scale inference without blowing their budgets on hardware, and IDC says 40 percent of orgs will be running custom chips by 2027 anyway. Red Hat threw together an Ansible collection for easier orchestration, and they are contributing upstream fixes to vLLM since they are the biggest commercial backer of that project. The whole thing lets enterprises run high-performance AI across hybrid cloud setups without getting locked into specific chipsets. [/QUOTE]
Insert quotes…
Name
Post reply
Home
Forums
Labrish
Nyuuz
Red Hat and AWS turn up AI power, smarter chips fuel gen AI push
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.
Accept
Learn more…
Top